Yang HW4 by xyanglu · Pull Request #13 · inference-ai-course/Homework4-Submission

xyanglu · 2025-11-26T23:52:07Z

This week's task was to create a RAG pipeline using recent arXiv cs.CL papers, converting them into searchable chunks, embedding them, and indexing them with FAISS. We then implement a simple query interface that takes a user question, retrieves the top relevant chunks, and displays them for further processing.

The deliverables are described below and can be found in the following files:

Code Notebook / Script: see main.py
Data & Index: see faiss_index.bin and chunks.json
Retrieval Report: see
Example 1.png
Example 2.png
Example 3.png
Example 4.png
Example 5.png
FastAPI Service: To run, use uvicorn main:app --reload --port 5000 and make requests by calling /search with param q as your query

This script implements a FastAPI application that extracts text from PDF files, chunks the text, and uses a SentenceTransformer model to create embeddings for a FAISS index. It also provides a search endpoint to retrieve the top-3 passages based on a query.

xyanglu added 4 commits November 26, 2025 18:33

Add files via upload

d34c031

Add deliverables section to README

4615b74

Add files via upload

e4fdab2

xyanglu assigned xyanglu and ScottLL and unassigned xyanglu Nov 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Yang HW4#13

Yang HW4#13
xyanglu wants to merge 4 commits intoinference-ai-course:mainfrom
xyanglu:main

xyanglu commented Nov 26, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

xyanglu commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xyanglu commented Nov 26, 2025 •

edited

Loading