ai
A list of posts tagged ai
Blogs
- Configure Ollama on Dev Containers and VS Code
- Getting started with Ollama on Windows
- Using Generative AI to produce Spotify Clips
- AI like it's 1999 or 1899
- Quick thoughts about Snapdragon Summit 2023
- Deploy ML.NET Machine Learning Model in Blazor WebAssembly Static Website
- Use machine learning to categorize web links with F# and ML.NET
- Restaurant Inspections ETL & Data Enrichment with Spark.NET and ML.NET Automated (Auto) ML
- The Case for Doing Machine Learning with F#
- Operationalizing Machine Learning with ML.NET, Azure DevOps and Azure Container Instances
- Serverless Machine Learning with ML.NET and Azure Functions
- Deploy .NET Machine Learning Models with ML.NET, ASP.NET Core, Docker and Azure Container Instances
Notes
- Clock Tables - Org Mode, Plain Text, and AI
- These models are too damn big!
- Use AI to generate a blogroll others can subscribe to
- New Era of Work - Windows / Surface Event Blog (March 21, 2024)
- Book Review - Agency
- Down the weird web
- What about instrumentals? AI Generated Spotify Clips Addendum
- AI abundance after scarcity cycles
- Quick Thoughts Snapdragon Summit 2023 Addendum
- New AI generated phone wallpaper
- Dall-E Outpainting and generative models
- Vertigo AI
- Web Neural Network API - Working Draft
- Next gen stick figures using AI and NVIDIA Canvas
Responses
- Join me at DEVintersection in Las Vegas - September 10-12
- Bringing Llama 3 to life
- Transformers in music recommendation
- Transformer Explainer: Interactive Learning of Text-Generative Models
- HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction
- GPT-4o System Card
- Reddit says companies must pay for data access
- LongROPE: Extending LLM Context Window Beyond 2 Million Tokens
- Tensors from scratch series
- Meta AI's Segment Anything Model (SAM) 2
- Zombie Internet
- Meta Large Language Model Compiler: Foundation Models of Compiler Optimization
- Deep Questions - Debunking AI Model Capabilities / Distributed Webs of Trust
- Mapping the Mind of a Large Language Model
- Claude's Character
- Ultravox - An open, fast, and extensible multimodal LLM
- Apple - Private Cloud Compute
- Introducing Apple’s On-Device and Server Foundation Models
- Introducing Apple Intelligence
- The Verge - Apple WWDC 2024 keynote in 18 minutes
- Andrej Karpathy - Let's reproduce GPT-2 (124M)
- TinyAgent: Function Calling at the Edge
- Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20
- Introducing Snowflake Arctic
- SAMMO: A general-purpose framework for prompt optimization
- Google Penzai
- Introducing Phi-3
- Introducing Llama 3
- Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
- DE-COP: Detecting Copyrighted Content in Language Models Training Data
- Using LLM to select the right SQL Query from candidates
- RecurrentGemma - Open weights language model from Google DeepMind, based on Griffin.
- ARAGOG: Advanced RAG Output Grading
- Large Language Models Are Zero-Shot Time Series Forecasters
- ReALM: Reference Resolution As Language Modeling
- Introducing Stable Audio 2.0
- Start using ChatGPT instantly
- Announcing DBRX: A new standard for efficient open source LLMs
- One-step Diffusion with Distribution Matching Distillation
- Stability CEO Resigns
- Mamba: Linear-Time Sequence Modeling with Selective State Spaces
- Releasing Common Corpus: the largest public domain dataset for training LLMs
- Machine Learning for Games Course - HuggingFace
- Quanto: a PyTorch quantization toolkit
- Demystifying Embedding Spaces using Large Language Models
- The Tokenizer Playground
- Introducing Stable Video 3D: Quality Novel View Synthesis and 3D Generation from Single Images
- Nvidia reveals Blackwell B200 GPU
- MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
- LaVague - Large Action Model framework
- Spreadsheets are all you need
- Building Meta’s GenAI Infrastructure
- OpenAI Transformer Debugger
- Diffusion Models From Scratch
- Diffusion models from scratch, from a new theoretical perspective
- Fast Inner-Product Algorithms and Architectures for Deep Neural Network Accelerators
- Grok-1
- Ollama now supports AMD graphics cards
- What I learned from looking at 900 most popular open source AI tools
- You can now train a 70b language model at home
- Levels of Complexity: RAG Applications
- Inflection-2.5: meet the world's best personal AI
- Training great LLMs entirely from ground up in the wilderness as a startup
- Stable Diffusion 3: Research Paper
- Wix’s new AI chatbot builds websites in seconds based on prompts
- Gemma PyTorch
- Introducing the next generation of Claude
- GGUF, the long way around
- Predictive Human Preference: From Model Ranking to Model Routing
- The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
- Tumblr and WordPress to Sell Users’ Data to Train AI Tools
- The latest Microsoft Copilot update on Android makes me mourn the death of Cortana
- Announcing Mistral Large
- GPT in 500 lines of SQL
- Jim Cramer says McDonald’s embracing AI at drive-thrus is good news for Nvidia
- Stable Diffusion 3 - Early Preview
- The AI Study Guide: Azure Machine Learning Edition
- The killer app of Gemini Pro 1.5 is video
- HuggingChat
- Gemma: Introducing new state-of-the-art open models
- Cosmopedia v0.1
- MLX Swift - On-device ML research with MLX and Swift
- Ollama - Windows Preview
- HuggingFace - Open Source AI Cookbook
- V-JEPA: The next step toward Yann LeCun’s vision of advanced machine intelligence (AMI)
- Magic.dev
- OpenAI Sora - Creating video from text
- Introducing Gemini 1.5
- The text file that runs the internet
- GraphRAG: Unlocking LLM discovery on narrative private data
- NVIDIA Chat with RTX
- Stable Cascade
- Memory and new controls for ChatGPT
- Introducing Nomic Embed: A Truly Open Embedding Model
- LangChain - OpenGPTs
- Eagle 7B : Soaring past Transformers with 1 Trillion Tokens Across 100+ Languages (RWKV-v5)
- Ollama - Python & JavaScript Libraries
- Google’s Hugging Face deal puts ‘supercomputer’ power behind open-source AI
- FOSDEM 2024 Schedule
- OpenAI Microscope
- NightShade
- Introducing Stable LM 2 1.6B
- Stable Code 3B - Coding on the Edge
- Sampling for Text Generation
- Talking about Open Source LLMs on Oxide and Friends
- SingSong - Generating musical accompaniments from singing
- LeftoverLocals: Listening to LLM responses through leaked GPU local memory
- AI for economists - prompts and resources
- Every - Daily Newsletter
- More than an OpenAI Wrapper: Perplexity Pivots to Open Source
- My AI Timelines Have Sped Up (Again)
- Introducing the GPT Store
- Ferret: Refer and Ground Anything Anywhere at Any Granularity
- Conor McGregor pitching Zune and Windows Phone to Cristiano Ronaldo
- VideoPoet: A large language model for zero-shot video generation
- Midjourney v6
- LangChain State of AI 2023
- Phi-2 now on HuggingFace
- LLM in a flash: Efficient Large Language Model Inference with Limited Memory
- OpenAI - Prompt engineering
- Solo - an AI website builder for solopreneurs
- MemoryCache - Augmenting Local AI with Browser Data
- Mozilla Innovation Week - Explore the Future of AI with Mozilla
- Bash One-Liners for LLMs
- Mixtral 8x7B on Apple Silicon with MLX
- Steering at the Frontier: Extending the Power of Prompting
- promptbase
- LLM360: Towards Fully Transparent Open-Source LLMs
- Phi-2: The surprising power of small language models
- Answer.AI - A new old kind of R&D lab
- Introducing Stable LM Zephyr 3B
- The Geometry of Truth: Dataexplorer
- State of AI Report - 2023
- OnnxStream - Stable Diffusion XL 1.0 Base on a Raspberry Pi Zero 2
- Evaluating LLMs is a minefield
- MemGPT - Towards LLMs as Operating Systems
- Best Practices for LLM Evaluation of RAG Applications
- MLflow 2.8 with LLM-as-a-judge metrics and Best Practices for LLM Evaluation of RAG Applications
- Metadata-Curated Language-Image Pre-training (MetaCLIP) - Demystifying CLIP Data
- OpenAgents: An Open Platform for Language Agents in the Wild
- The Foundation Model Transparency Index
- The New Kings of Open Source AI (Oct 2023 Recap)
- Mixtral of experts
- SatCLIP - A Global, General-Purpose Geographic Location Encoder
- Long context prompting for Claude 2.1
- Introducing Gemini
- Introducing llamafile
- AI and Mass Spying
- AI Alliance Launches
- Understanding Deep Learning
- The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
- Chain-of-Verification Reduces Hallucination in Large Language Models
- Multimodality and Large Multimodal Models (LMMs)
- HuggingFace: Text Embeddings Inference
- Scaffolded LLMs as natural language computers
- Creating the First Confidential GPUs
- The AI Attack Surface Map v1.0
- Mistral 7B Model
- Meta announces AI experiences in Facebook, Instagram, WhatsApp
- Carton - Run any ML model from any programming language
- FlowiseAI - Drag and drop for LLM flows
- Why Open Source AI Will Win
- Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes
- vim + llm = 🔥
- Next-Gen CPU Acceleration: AVX For Generative AI
- ChatGPT can now see, hear, and speak
- Amazon and Anthropic announce strategic collaboration to advance generative AI
- DALL·E 3
- Optimizing your LLM in production
- Software²
- PointLLM: Empowering Large Language Models to Understand Point Clouds
- How consumers are using Generative AI
- Coqui 🐸 XTTS
- Introducing Würstchen: Fast Diffusion for Image Generation
- Efficient Controllable Generation for SDXL with T2I-Adapters
- Spread Your Wings: Falcon 180B is here
- Modular: Mojo🔥 - It’s finally here!
- Generative AI and .NET - Part 2 SDK
- Rethinking trust in direct messages in the AI era
- Perplexity: Interactive LLM visualization
- Can LLMs learn from a single example?
- Teaching with AI
- Generative AI and .NET - Part 1 Intro
- Llama 2 7B/13B are now available in Web LLM
- Supporting the Open Source AI Community
- Making Large Language Models Work For You
- Consciousness is a Big Suitcase
- Introducing Code Llama, a state-of-the-art large language model for coding
- Announcing Python in Excel
- Large Language Models with Semantic Search
- Patterns for Building LLM-based Systems and Products
- HuggingFace Candle
- Open challenges in LLM research
- Jupyter AI Brings Generative AI to Notebooks
- Announcing xAI
- Introducing Keras Core: Keras for TensorFlow, JAX, and PyTorch.
- GPT-4 API general availability
- Orca: Progressive Learning from Complex Explanation Traces of GPT-4
- Gorilla: Large Language Model Connected with Massive APIs
- Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold
- StarCoder: A State-of-the-Art LLM for Code
- LoRA: Low-Rank Adaptation of Large Language Models
- Shap-E: Generating Conditional 3D Implicit Functions
- Massively Multilingual Speech (MMS)
- PaLM 2
- Transformers Agent
- Copilot for Docs
- ChatGPT Prompt Engineering for Developers
- ImageBind: One Embedding Space To Bind Them All
- Language models can explain neurons in language models
- How generative AI is changing the way developers work
- PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware
- Wikipedia embeddings dataset
- LLaVA: Large Language and Vision Assistant
- WebGPU API
- Consistency Models
- Free Dolly
- Generative Agents: Interactive Simulacra of Human Behavior
- Koala: A Dialogue Model for Academic Research