A collection of LLM related papers, thesis, tools, datasets, courses, open source models, benchmarks
-
Updated
Oct 8, 2024 - Python
A collection of LLM related papers, thesis, tools, datasets, courses, open source models, benchmarks
summaries of ai research
MechaMap - Toolkit for Mechanistic Interpretability (MI) Research
AI Agent Version Control Framework for Real-Time Updation of Tools
This project aims to analyze a resume against a job description and provide an overall matching score along with some recommendations and actionable insights to better tailor the resume to the job described and suggest skills and courses to bridge the skill gap.
Replication package of the paper 'Large Language Models for In-File Vulnerability Localization are "Lost in the End"' (https://doi.org/10.1145/3715758)
A Python framework designed to support various iterative and adaptive reasoning patterns, including Answer On Thought (AoT), Learn to Think (L2T), Graph of Thoughts (GoT), a novel Hybrid approach, and Fact-and-Reflection (FaR).
Empirical documentation of progressive degradation and metacognitive behaviors in conversational AI through narrative frameworks. A 5-session experiment with DeepSeek V3 demonstrating how content filters systematically reduce AI utility.
Research framework studying the impact of API documentation quality on LLM code generation success - discovering the documentation sweet spot phenomenon
Notes and personal observations from the Gandalf: Agent Breaker beta, a red-team challenge for testing LLM security.
Add a description, image, and links to the llm-research topic page so that developers can more easily learn about it.
To associate your repository with the llm-research topic, visit your repo's landing page and select "manage topics."