In March 2023, I developed a retrieval augmented generation (RAG) platform called AlignmentSearch, a conversational agent that curates the Alignment Research Dataset (ARD). When asked a query, we source relevant passages from a dataset of academic papers, books, articles, and other resources on AI safety and alignment. We then prompt GPT-3 to synthesize a response while citing back to these passages.
I set up a database for efficient querying of sources using Pinecone, optimized semantic similarity model by fine-tuning terminology-specific embeddings layer, developed automated MySQL database updates for multiple sources, and engineered question-answering assistant prompt for maximum accuracy and helpfulness.