Sign Up (1k chunks free) | HN Search and RAG |Documentation | Meeting Link | Discord | Matrix
This repository contains the code for an engine which provides sub-100ms vector search (SPLADE, dense vector, cross-encoder re-rank'ed hybrid), recommendations, RAG, and analytics for Hacker News.
Issues and contributions are welcome! There are three main folders for this project and they contain the code for their respective components of the application.
actix-frontend
for the nojs version at hnnojs.trieve.ai built using Rust actix-web and Minijinjasolidjs-spa
for the power tool JS version at hn.trieve.aiingest
for the various scripts relying on Redis queue's for pulling all of the data from the API and indexing it into a Trieve instance for search and RAG
-
Dense vector semantic search, re-rankers, SPLADE, and other techniques have gotten a lot of hype recently, but it's hard to figure out where each technique is best applied. We're hackers and built hacker'y features into Trieve to make it easier to test and experiment with this new tech, and needed a dataset us and our friends would have knowledge of to mess with. We're hoping to build lots of cool shareable projects on top of this demo to learn and collect feedback with.
-
Make HN search more explorable with recommendations plus RAG and more open with public analytics. We added a feedback button under the search bar and are collecting CTR data such that we can continously fine-tune our models to improve the search experience. If you have some time, try out our blind comparison at hn-comparison.trieve.ai to help us collect higher quality training data.
-
Open up a more fully-featured discovery API for others who want to build on top of HN data. The firebase HN API is great, but storing all the data yourself is expensive and time-consuming. We are hoping to lower that barrier in the old Octopart HNSearch spirit.
-
Show what's possible with Trieve, work out scaling/stability bugs, and pressure test.
-
HN is a cool place where we like to spend time and showing what we are working on to this community we feel apart is something we're excited about!