On-device, AI memory runtime for continual learning.
We’re in the prototype stage and seeking design partners among early-stage companies. Connect with us in the #tiles channel on the User & Agents Discord, or reach out via [email protected]. Subscribe to our blog Neuron Analysis for updates on on-device AI and personalization research.
Below is a living index of resources that inform and inspire our work.
- ✨ Unternet Kernel
- Ollama JavaScript library
- ✨ Modelfile Reference - Ollama English Documentation
- ✨ Introducing Gemma 3n: The developer guide
- Foundation Models adapter training - Apple Intelligence - Apple Developer
- ✨ Unsloth AI - Open Source Fine-tuning & RL for LLMs
- ✨ Mistral.rs, a cross-platform, highly-multimodal inference engine
- Osmosis, Unlocking AI self-improvement at production scale
- Supermemory MCP
- ✨ Introducing the v0 composite model family, Vercel
- Agent Reinforcement Trainer, OpenPipe
- Universal Quantized File Format: UQFF
- GGUF Tool Suite
- uqff_maker
- Minions, Big & Small LLMs working together
- ✨ The Kaitchup Index: A Leaderboard for Quantized LLMs
- Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily
- Serving Voice AI at $1/hr: Open-source, LoRAs, Latency, Load Balancing - Neil Dwyer, Gabber
- 📏RULER: Easy Mode for RL Rewards
- ART·E: How We Built an Email Research Agent That Beats o3
- OpenBench, Provider-agnostic, open-source evaluation infrastructure for language models
- ✨ LoRA's Limitations: Head-to-Head with Full RL
- A DSPy rewrite to Rust
- ✨ A case for client-side machine learning, Christopher Fleetwood
- ✨ Ratchet Architecture
- ✨ How Tailscale works
- Democratizing Al: The Psyche Network Architecture, Nous Research
- ✨ The Bitter Lesson is coming for Tokenization
- On the Way to LLM Personalization: Learning to Remember User Conversations, Apple Machine Learning Research
- ✨ Text-to-LoRA: Instant Transformer Adaption, Sakana AI
- ✨ Small Language Models are the Future of Agentic AI, NVIDIA Research
- ✨ Defeating Prompt Injections by Design, Google Deepmind
- Introducing FlexOlmo: a new paradigm for language model training and data collaboration, Allen AI
- WhisperKit: On-device Real-time ASR with Billion-Scale Transformers, Argmax
- ✨ Towards Large-scale Training on Apple Silicon, Exo Labs
- Kinetics: Rethinking Test-Time Scaling Laws
- Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
- LoFT: Low-Rank Adaptation That Behaves Like Full Fine-Tuning
- AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air
- Comparative Analysis of Retrieval Systems in the Real World
- FedVLM: Scalable Personalized Vision-Language Models through Federated Learning
- On the Way to LLM Personalization: Learning to Remember User Conversations
- A Preliminary Report On Edge-Verified Machine Learning, Exo Labs
- ✨ Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities
- ✨ Intent-Based Architecture and Their Risks
- ✨ r/LocalLLaMA
- ✨ The State of On-Device LLMs
- ✨ An Analogy for Understanding Transformers
- ✨ Neural networks, 3Blue1Brown
- MCP 201: The power of protocol, Anthropic
- GGUF Quantization Docs (Unofficial)
- Reverse-engineering GGUF | Post-Training Quantization
- Reference implementation of the Transformer architecture optimized for Apple Neural Engine
- H100 PCIe vs SXM vs NVL: Which H100 GPU Is Fastest and Most Cost-Effective for Fine-Tuning LLMs?
- Programming as theory building
- ✨ The Use of Knowledge in (AGI) Society, Luke Drago
- ✨ Workshop Labs Mission, Workshop Labs
- ✨ Empowering humans in the age of AI, Imbue
- ✨ Everything is ugly, so go build something that isn't — Raiza Martin, Huxe (ex NotebookLM)
- ✨ Responsive Software, Osmosis
- Our designer built an operating system with Cursor
- ✨ Why Tool Als Want to Be Agent Als, Gwern
- Agents vs Workflows: Why Not Both? — Sam Bhagwat, Mastra.ai
- ✨ Machines of Buying and Selling Grace - Adam Behrens, New Generation
- ✨ Andrej Karpathy: Software Is Changing (Again)
- ✨ I Remastered Facebook's Little Red Book
- ✨ The Rise of Personal LLM "Cognitive Core", Andrej Karpathy
- Why I hope Apple keeps investing in on-device AI
- Fun stories from building OpenRouter and where all this is going - Alex Atallah, OpenRouter
- Arcee AI Conductor
- ✨ Announcing the Arcee Model Engine Public Beta
- Inference by Sequoia
- ✨ Something Pretty Right: A History of Visual Basic | Retool
© 2025 TilesHQ. All rights reserved.