Joined 4/3/2021, 1:02:35 PM has 744 karma
Machine Learning Conferences Should Establish "Refutations and Critiques" Track
SuperGPQA: Scaling LLM Evaluation Across 285 Graduate Disciplines
SepLLM: Accelerate LLMs by Compressing One Segment into One Separator
Step-Video-T2V: The Practice, Challenges, and Future of Video Foundation Model
Logic R1: Reproduce DeepSeek R1 Zero on 2K Logic Puzzle Dataset
Libnginx: Nginx as a Shared Library
DeepSeek-VL2: Moe Vision-Language Models for Advanced Multimodal Understanding [pdf]
Fast vectorizable algorithms of binary searching for floating point numbers
New OpenAI Feature: Predicted Outputs
Collaborative Filtering Is Wrong and Here Is Why
REST: A Plug-and-Play Method for Accelerating LLM Without Additional Training
Smoke 'em if you got 'em: Hacker gains root access using cigarette lighter
O1 Replication Journey: A Strategic Progress Report
Failures of Gradient-Based Deep Learning (2017) [pdf]
Qwen2-VL
Qwen2-Math
FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention
MiniCPM-v2.6: GPT-4V Level MLLM for Single/Multi Image and Video on Your Phone
MindSearch: LLM-Based Web Search Engine Similar to Perplexity.ai and SearchGPT
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters
PowerInfer-2: Fast Large Language Model Inference on a Smartphone
Large-scale photonic chiplet Taichi empowers 160TOPS/W AI
Asterinas: OS kernel written in Rust and providing Linux-compatible ABI
Mq-deadline scalability improvements (with more than 100% improvement)
Researchers Create First Functional Semiconductor Made from Graphene
Wayland Enjoyed Many Successes in 2023
Improving our safety with a physical quantities and units library
Nesting chinstrap penguins sleep by seconds-long microsleeps
PowerInfer: High-Speed Large Language Model Serving on Consumer-Grade GPUs