Joined 12/17/2023, 5:21:25 AM has 567 karma
Founder and CEO @ Dragonscale Industries Inc Past: Data science platform @ Apple, CEO @ Tuplejump Inc.
Show HN: CodePrism – an AI-generated code analysis engine as MCP
VILA: On Pre-Training for Visual Language Models
Flame: Factuality-Aware Alignment for Large Language Models
LoRA Land: 310 Fine-Tuned LLMs That Rival GPT-4, a Technical Report
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
ResearchAgent: Iterative Research Idea Generation Using LLMs
CodecLM: Aligning Language Models with Tailored Synthetic Data
Viking – a family of models for the Nordic languages
Mixtral-8x22B on HuggingFace
Implementation of Google's Griffin Architecture – RNN LLM
Griffin: RNN for Efficient Language Models
Categorical Deep Learning: An Algebraic Theory of Architectures
Training LLMs over Neurally Compressed Text
TinyTimeMixer: Open-source time series LLM by IBM
White House Announces Open Science Recognition Challenge Winners
Mixture-of-Depths: Dynamically allocating compute in transformers
Arizona State University – Can Large Language Models Reason and Plan?
Can LLMs Every Reason?
Berkeley Function-Calling Leaderboard
Language models as compilers: Simulating pseudocode execution
GLM: Genome Language Model – Deep learning to predict gene function
Evolution of RAG
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Mistral 7B v0.2