ai - Tags - Luis Quintanilla

Luis Quintanilla

Home
About
Profile Contact Uses Colophon
Feeds
Main Responses Blog
Subscribe
Starter Packs
Blogroll Podroll Forums YouTube
Collections
Radio Books Tags
Knowledgebase
Snippets Wiki Presentations
Live
Stream Recordings
Events

ai

A list of posts tagged ai

Blogs

How do I keep up with AI?
Vibe-Specing - From concepts to specification
Llama's Turn On: Tuning In to AI's Quest for Higher Consciousness in MOOs
Digitize Analog Bookmarks using AI, .NET, and GitHub Models
Configure Ollama on Dev Containers and VS Code
Getting started with Ollama on Windows
Using Generative AI to produce Spotify Clips
AI like it's 1999 or 1899
Quick thoughts about Snapdragon Summit 2023
Deploy ML.NET Machine Learning Model in Blazor WebAssembly Static Website
Use machine learning to categorize web links with F# and ML.NET
Restaurant Inspections ETL & Data Enrichment with Spark.NET and ML.NET Automated (Auto) ML
The Case for Doing Machine Learning with F#
Operationalizing Machine Learning with ML.NET, Azure DevOps and Azure Container Instances
Serverless Machine Learning with ML.NET and Azure Functions
Deploy .NET Machine Learning Models with ML.NET, ASP.NET Core, Docker and Azure Container Instances

Notes

Ollama Adds Mistral 3.1 Support
Tinkering with DeepSeek R1, GitHub Models, and .NET on stream
Spotify Wrapped 2024 AI Generated Podcast
.NET Conf 2024 Bound
Clock Tables - Org Mode, Plain Text, and AI
These models are too damn big!
Use AI to generate a blogroll others can subscribe to
New Era of Work - Windows / Surface Event Blog (March 21, 2024)
Book Review - Agency
Down the weird web
What about instrumentals? AI Generated Spotify Clips Addendum
AI abundance after scarcity cycles
Quick Thoughts Snapdragon Summit 2023 Addendum
New AI generated phone wallpaper
Dall-E Outpainting and generative models
Vertigo AI
Web Neural Network API - Working Draft
Next gen stick figures using AI and NVIDIA Canvas

Responses

State-Of-The-Art Prompting For AI Agents
Introducing ElevenLabs Conversational AI 2.0
The Darwin Gödel Machine - AI that improves itself by rewriting its own code
Agent Network Protocol - The HTTP of the Agentic Web Era
Claude Artifacts
Introducing AutoRound
Parakeet TDT 0.6B V2 (En)
Introducing Locate 3D
ZeroSearch - Incentivize the Search Capability of LLMs without Searching
A Survey of AI Agent Protocols
TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching
CORG: Generating Answers from Complex, Interrelated Contexts
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks
GPT Image 1 - Image Generation API
The Llama 4 herd: The beginning of a new era of natively multimodal AI innovation
Introducing Gemma 3
Introducing Command A
Simon Willison's AI-Generated Tools Colophon
Mistral Small 3.1
Introducing AX: Why Agent Experience Matters
Introducing Mercury, the first commercial-scale diffusion large language model
s1: Simple test-time scaling
Nomic Embed Text V2: An Open Source, Multilingual, Mixture-of-Experts Embedding Model
Claude 3.7 Sonnet and Claude Code
The Ultra-Scale Playbook: Training LLMs on GPU Clusters
Dream Job - Google Super Bowl 2025 Ad
Languages & Runtime Community Standup - Tensors in .NET
Generative AI for Beginners (.NET) is now available
AI Dev Gallery now in the Microsoft Store
HuggingFace AI Agents Course
The Illustrated DeepSeek-R1
Introducing deep research
Semantic Search and On-Device ML in Emacs
Swarm navigation of cyborg-insects in unknown obstructed soft terrain
AI Subtitles Are Coming to VLC
Agents
Agents Whitepaper
LangChain State of AI 2024 Report
MarkItDown - Convert files to Markdown
Build a YouTube chat app with .NET
Sora is here
Day 1 of .NET Conf
Microsoft Research: Introducing DRIFT Search
Microsoft Research Focus: Week of October 28, 2024
In-Context LoRA for Diffusion Transformers
Thinking LLMs: General Instruction Following with Thought Generation
NotebookLlama: An Open Source version of NotebookLM
Join me at DEVintersection in Las Vegas - September 10-12
Bringing Llama 3 to life
Transformers in music recommendation
Transformer Explainer: Interactive Learning of Text-Generative Models
HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction
GPT-4o System Card
Reddit says companies must pay for data access
LongROPE: Extending LLM Context Window Beyond 2 Million Tokens
Tensors from scratch series
Meta AI's Segment Anything Model (SAM) 2
Zombie Internet
Meta Large Language Model Compiler: Foundation Models of Compiler Optimization
Deep Questions - Debunking AI Model Capabilities / Distributed Webs of Trust
Mapping the Mind of a Large Language Model
Claude's Character
Ultravox - An open, fast, and extensible multimodal LLM
Apple - Private Cloud Compute
Introducing Apple’s On-Device and Server Foundation Models
Introducing Apple Intelligence
The Verge - Apple WWDC 2024 keynote in 18 minutes
Andrej Karpathy - Let's reproduce GPT-2 (124M)
TinyAgent: Function Calling at the Edge
Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20
Introducing Snowflake Arctic
SAMMO: A general-purpose framework for prompt optimization
Google Penzai
Introducing Phi-3
Introducing Llama 3
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
DE-COP: Detecting Copyrighted Content in Language Models Training Data
Using LLM to select the right SQL Query from candidates
RecurrentGemma - Open weights language model from Google DeepMind, based on Griffin.
ARAGOG: Advanced RAG Output Grading
Large Language Models Are Zero-Shot Time Series Forecasters
ReALM: Reference Resolution As Language Modeling
Introducing Stable Audio 2.0
Start using ChatGPT instantly
Announcing DBRX: A new standard for efficient open source LLMs
One-step Diffusion with Distribution Matching Distillation
Stability CEO Resigns
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Releasing Common Corpus: the largest public domain dataset for training LLMs
Machine Learning for Games Course - HuggingFace
Quanto: a PyTorch quantization toolkit
Demystifying Embedding Spaces using Large Language Models
The Tokenizer Playground
Introducing Stable Video 3D: Quality Novel View Synthesis and 3D Generation from Single Images
Nvidia reveals Blackwell B200 GPU
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
LaVague - Large Action Model framework
Spreadsheets are all you need
Building Meta’s GenAI Infrastructure
OpenAI Transformer Debugger
Diffusion Models From Scratch
Diffusion models from scratch, from a new theoretical perspective
Fast Inner-Product Algorithms and Architectures for Deep Neural Network Accelerators
Grok-1
Ollama now supports AMD graphics cards
What I learned from looking at 900 most popular open source AI tools
You can now train a 70b language model at home
Levels of Complexity: RAG Applications
Inflection-2.5: meet the world's best personal AI
Training great LLMs entirely from ground up in the wilderness as a startup
Stable Diffusion 3: Research Paper
Wix’s new AI chatbot builds websites in seconds based on prompts
Gemma PyTorch
Introducing the next generation of Claude
GGUF, the long way around
Predictive Human Preference: From Model Ranking to Model Routing
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Tumblr and WordPress to Sell Users’ Data to Train AI Tools
The latest Microsoft Copilot update on Android makes me mourn the death of Cortana
Announcing Mistral Large
GPT in 500 lines of SQL
Jim Cramer says McDonald’s embracing AI at drive-thrus is good news for Nvidia
Stable Diffusion 3 - Early Preview
The AI Study Guide: Azure Machine Learning Edition
The killer app of Gemini Pro 1.5 is video
HuggingChat
Gemma: Introducing new state-of-the-art open models
Cosmopedia v0.1
MLX Swift - On-device ML research with MLX and Swift
Ollama - Windows Preview
HuggingFace - Open Source AI Cookbook
V-JEPA: The next step toward Yann LeCun’s vision of advanced machine intelligence (AMI)
Magic.dev
OpenAI Sora - Creating video from text
Introducing Gemini 1.5
The text file that runs the internet
GraphRAG: Unlocking LLM discovery on narrative private data
NVIDIA Chat with RTX
Stable Cascade
Memory and new controls for ChatGPT
Introducing Nomic Embed: A Truly Open Embedding Model
LangChain - OpenGPTs
Eagle 7B : Soaring past Transformers with 1 Trillion Tokens Across 100+ Languages (RWKV-v5)
Ollama - Python & JavaScript Libraries
Google’s Hugging Face deal puts ‘supercomputer’ power behind open-source AI
FOSDEM 2024 Schedule
OpenAI Microscope
NightShade
Introducing Stable LM 2 1.6B
Stable Code 3B - Coding on the Edge
Sampling for Text Generation
Talking about Open Source LLMs on Oxide and Friends
SingSong - Generating musical accompaniments from singing
LeftoverLocals: Listening to LLM responses through leaked GPU local memory
AI for economists - prompts and resources
Every - Daily Newsletter
More than an OpenAI Wrapper: Perplexity Pivots to Open Source
My AI Timelines Have Sped Up (Again)
Introducing the GPT Store
Ferret: Refer and Ground Anything Anywhere at Any Granularity
Conor McGregor pitching Zune and Windows Phone to Cristiano Ronaldo
VideoPoet: A large language model for zero-shot video generation
Midjourney v6
LangChain State of AI 2023
Phi-2 now on HuggingFace
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
OpenAI - Prompt engineering
Solo - an AI website builder for solopreneurs
MemoryCache - Augmenting Local AI with Browser Data
Mozilla Innovation Week - Explore the Future of AI with Mozilla
Bash One-Liners for LLMs
Mixtral 8x7B on Apple Silicon with MLX
Steering at the Frontier: Extending the Power of Prompting
promptbase
LLM360: Towards Fully Transparent Open-Source LLMs
Phi-2: The surprising power of small language models
Answer.AI - A new old kind of R&D lab
Introducing Stable LM Zephyr 3B
The Geometry of Truth: Dataexplorer
State of AI Report - 2023
OnnxStream - Stable Diffusion XL 1.0 Base on a Raspberry Pi Zero 2
Evaluating LLMs is a minefield
MemGPT - Towards LLMs as Operating Systems
Best Practices for LLM Evaluation of RAG Applications
MLflow 2.8 with LLM-as-a-judge metrics and Best Practices for LLM Evaluation of RAG Applications
Metadata-Curated Language-Image Pre-training (MetaCLIP) - Demystifying CLIP Data
OpenAgents: An Open Platform for Language Agents in the Wild
The Foundation Model Transparency Index
The New Kings of Open Source AI (Oct 2023 Recap)
Mixtral of experts
SatCLIP - A Global, General-Purpose Geographic Location Encoder
Long context prompting for Claude 2.1
Introducing Gemini
Introducing llamafile
AI and Mass Spying
AI Alliance Launches
Understanding Deep Learning
The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
Chain-of-Verification Reduces Hallucination in Large Language Models
Multimodality and Large Multimodal Models (LMMs)
HuggingFace: Text Embeddings Inference
Scaffolded LLMs as natural language computers
Creating the First Confidential GPUs
The AI Attack Surface Map v1.0
Mistral 7B Model
Meta announces AI experiences in Facebook, Instagram, WhatsApp
Carton - Run any ML model from any programming language
FlowiseAI - Drag and drop for LLM flows
Why Open Source AI Will Win
Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes
vim + llm = 🔥
Next-Gen CPU Acceleration: AVX For Generative AI
ChatGPT can now see, hear, and speak
Amazon and Anthropic announce strategic collaboration to advance generative AI
DALL·E 3
Optimizing your LLM in production
Software²
PointLLM: Empowering Large Language Models to Understand Point Clouds
How consumers are using Generative AI
Coqui 🐸 XTTS
Introducing Würstchen: Fast Diffusion for Image Generation
Efficient Controllable Generation for SDXL with T2I-Adapters
Spread Your Wings: Falcon 180B is here
Modular: Mojo🔥 - It’s finally here!
Generative AI and .NET - Part 2 SDK
Rethinking trust in direct messages in the AI era
Perplexity: Interactive LLM visualization
Can LLMs learn from a single example?
Teaching with AI
Generative AI and .NET - Part 1 Intro
Llama 2 7B/13B are now available in Web LLM
Supporting the Open Source AI Community
Making Large Language Models Work For You
Consciousness is a Big Suitcase
Introducing Code Llama, a state-of-the-art large language model for coding
Announcing Python in Excel
Large Language Models with Semantic Search
Patterns for Building LLM-based Systems and Products
HuggingFace Candle
Open challenges in LLM research
Jupyter AI Brings Generative AI to Notebooks
Announcing xAI
Introducing Keras Core: Keras for TensorFlow, JAX, and PyTorch.
GPT-4 API general availability
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Gorilla: Large Language Model Connected with Massive APIs
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold
StarCoder: A State-of-the-Art LLM for Code
LoRA: Low-Rank Adaptation of Large Language Models
Shap-E: Generating Conditional 3D Implicit Functions
Massively Multilingual Speech (MMS)
PaLM 2
Transformers Agent
Copilot for Docs
ChatGPT Prompt Engineering for Developers
ImageBind: One Embedding Space To Bind Them All
Language models can explain neurons in language models
How generative AI is changing the way developers work
PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware
Wikipedia embeddings dataset
LLaVA: Large Language and Vision Assistant
WebGPU API
Consistency Models
Free Dolly
Generative Agents: Interactive Simulacra of Human Behavior
Koala: A Dialogue Model for Academic Research