ar
arXiv recommender
Find ML papers worth your time. Side by side, five algorithms.

Find ML papers worth your time. Five algorithms, side by side.

Pick any arXiv paper from the ML / AI / NLP / CV corpus on OpenAlex. Four algorithms each return their own top-10 most-similar papers: a hybrid blend, a sentence-transformer content tower, classic TF-IDF, and an ALS model fit on the citation graph. Latency badges are live calls against the FastAPI service. The leaderboard at the bottom is the offline evaluation with bootstrap 95% confidence intervals on every metric.

Best MAP@10 (hybrid)
0.167
16x the popularity baseline
Citation graph
46.4kedges
in-subset, citing to cited
Best p95 latency
75.4ms
top-10 from a 28,000-paper catalogue
Catalogue
28,424
CS arXiv papers since 2019 via OpenAlex

Seed selection

Type any title or author. The FastAPI service does the lookup.

Recommendations, side by side

Pick a seed paper above

Hybrid

-

Neural + ALS + TF-IDF + popularity blend, cold-paper aware

Neural

-

MiniLM sentence-transformer over title + abstract, cosine

Content (TF-IDF)

-

Title + abstract + authors + topic, sparse cosine

Citation ALS

-

Implicit ALS over citing -> cited edges, 96 factors

Leaderboard

Held-out evaluation on 2,000 citing-paper seeds, top-10, bootstrap 95% confidence intervals

AlgorithmMAP@10NDCG@10Recall@10Precision@10Coveragep95 lat (ms)
Popularity0.01010.01550.03200.00330.0000.1
TF-IDF0.15460.19250.31100.03170.38766.4
Neural0.12790.16230.27090.02770.3663.7
Citation ALS0.03920.05880.12250.01270.1840.7
Hybridbest0.16650.21010.34650.03530.33375.4
MAP@10
NDCG@10
Recall@10