alphaXiv MCP connects Claude Code directly to arXiv’s millions of papers via the Model Context Protocol. The killer feature: embedding similarity search — it finds papers that are conceptually related to your query, not just keyword matches. Papers using different terminology but addressing the same problem. Papers you’d never find by searching for exact terms. This is the missing piece in AI-assisted literature reviews.
| *Source: alphaXiv MCP Docs | alphaXiv Launch Tweet | AlphaSignal: Upgrade Your Paper Research | DeepakNess: arXiv Now Has an MCP Server* |
Install — One Command
claude mcp add --transport http alphaxiv https://api.alphaxiv.org/mcp/v1
That’s it. Claude Code now has access to arXiv. No API key needed for basic use.
Known issue: You may need to reactivate the MCP each time you restart Claude Code.
Six Tools Available
| Tool | What It Does | Best For |
|---|---|---|
embedding_similarity_search |
Semantic/conceptual paper discovery via embeddings | Finding related work you’d miss with keywords |
full_text_papers_search |
Keyword search across arXiv | Method names, benchmarks, author searches |
agentic_paper_retrieval |
Multi-turn autonomous retrieval (beta) | Complex research questions requiring multiple papers |
get_paper_content |
Get paper text as structured report or raw | Reading and summarizing specific papers |
answer_pdf_queries |
Ask questions about a specific paper | Quick fact extraction from papers |
read_files_from_github_repository |
Access paper codebases on GitHub | Understanding implementations |
Why Embedding Search Changes Research
KEYWORD SEARCH: "type system" "gradual typing" "security"
→ finds papers using EXACTLY those terms
EMBEDDING SEARCH: "type systems that improve software security"
→ finds papers about:
- information flow control (different terminology, same concept)
- dependent types for memory safety
- refinement types for cryptographic protocols
- capability-based security (related approach)
→ papers you'd NEVER find by keyword
The embedding search takes 2-3 sentence descriptions covering concepts, methods, applications, and related terminology. Returns up to 25 papers ranked by relevance + popularity.
Research Workflows
Literature Review (Parallel Search)
Run all three search tools simultaneously for maximum coverage:
1. embedding_similarity_search → conceptual matches
2. full_text_papers_search → exact term matches
3. agentic_paper_retrieval → autonomous deep search
↓
Combine results, deduplicate
↓
get_paper_content → read top papers
↓
answer_pdf_queries → extract specific claims/methods
Reading Dependency Graph
Someone in the community already built this: give a paper, have Claude use alphaXiv to find its key references, then find their references, building a dependency graph of what to read in what order. Perfect for learning a new subfield.
Paper → Code Analysis
get_paper_content → extract GitHub URL from paper
↓
read_files_from_github_repository → explore repo structure
↓
Claude analyzes implementation details
Alternatives & Comparison
| Tool | Scope | Search Type | Setup |
|---|---|---|---|
| alphaXiv MCP | arXiv | Embedding + keyword + agentic | claude mcp add (one command) |
| arxiv-mcp-server | arXiv | Keyword | Self-hosted |
| Semantic Scholar MCP | Semantic Scholar | Semantic | Self-hosted, no API key needed |
| Academix MCP | OpenAlex + DBLP + S2 + arXiv + CrossRef | Multi-source | Self-hosted |
| Zotero MCP | Your Zotero library | Local collection | Self-hosted |
alphaXiv MCP is the easiest to set up (one command, no self-hosting) and the only one with embedding search built in.
How LearnAI Team Could Use This
- Build faster literature reviews — combine semantic, keyword, and agentic search when preparing AI education or coding-agent research briefs.
- Track AI education research — monitor papers on AI-assisted learning, assessment, tutoring, and classroom coding agents.
- Support student research projects — help learners find related work even when they do not know the exact academic terminology.
Real-World Use Cases
| Scenario | Description |
|---|---|
| Literature reviews | Find conceptually related papers beyond exact keyword matches. |
| Reading lists | Generate dependency graphs that show what to read first in a new subfield. |
| Paper-to-code review | Pull paper text and linked repositories into Claude Code for implementation analysis. |
For LearnAI Research Areas
| Research Area | How alphaXiv MCP Helps |
|---|---|
| Type systems | Embedding search: “type systems that enforce safety properties” finds papers across PL, security, and verification |
| Program analysis | Search for conceptually similar analysis techniques even when different terminology is used |
| Formal verification | agentic_paper_retrieval: “recent advances in automated verification of concurrent programs” |
| Security education | Literature review on “teaching security concepts through programming language design” |
| AI education (LAI) | Track new papers on AI-assisted learning, coding agents in education, assessment methods |
For Students
- Literature review assignments — students search for papers semantically, not just by keyword
- Research methodology — teaches the difference between keyword and embedding search
- Reading dependency graphs — Claude builds a “what to read first” map for any topic
- Paper comprehension —
answer_pdf_querieslets students ask questions about papers they’re reading