Joined 10/20/2014, 6:35:24 PM has 4706 karma
Applied ML @ Voyage AI (https://voyageai.com).
I love chess, deadlifting, and Elden Ring.
Twitter: https://x.com/frankzliu
LinkedIn: https://www.linkedin.com/in/fzliu
Biomolecular shifts occur in our 40s and 60s (2024)
Text Embedding Benchmark
Watermarking Autoregressive Image Generation
Agentic Search for Dummies
Benchmark for Evaluating Text Embeddings
Does RL Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Emergence of Diffusion Models from Associative Memory
MAIR: A Benchmark for Evaluating Instructed Retrieval
Wanting to Be Understood Explains the Meta-Problem of Consciousness
The Gumbel-Softmax Distribution
The Value of Chess Pieces
AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions
CURE: A Dataset for Clinical Understanding and Retrieval Evaluation
Extracting memorized pieces of books from open-weight language models
The FCC Builds a Firewall Around US-Bound Electronics
Towards Understanding Sycophancy in Language Models
Extended Thinking Tips
Embedding Benchmark for Retrieval
Torch Backends
Hideo Kojima's Boss Fight with Time
AI Risk Repository
RAG for Contract Analysis
tau²-bench
Video Search with Multimodal AI
Untether AI Shuts Down, Engineering Team Joins AMD
Norway Chess 2025 in 7 Graphs
Retrieval Embedding Benchmark
General-Purpose vs. Domain-Specific Embedding Models