Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
ASPseek is a full-featured medium-to-large scale SQL-based Internet search engine. It consists of an indexing robot, search daemon and search frontend (CGI program). These programs are written in C++ using the STL library.
Proof-of-concept sample application that uses IBM Watson and IBM Bluemix services to automatically index video files by analyzing video frames and audio speech.
Our framework can be used for producing heapfiles that correspond to R#-Tree hierarchies of arbitrary dimensionality and block-size, and then perform complex operations on top of them using our RESTful interface, #QL (or not).
Indexing Wuzzuf data (which consists of 2 tables: Job vacancies and Candidates' applications) into Elasticsearch and use these data to recommend other vacancies that match candidates' interests
The library has a bunch of pretty bad things:
For the 2nd one, see e.g.: https://github.com/stiang/remove-markdown/blob/master/index.js#L33
An example:
Ideally, the library should use a markdown parser, but remove-markdown is pretty far from it, so I'd sugge