Responses

lqdev👽06/30/2025

https://www.theverge.com/news/695124/tumblr-wordpress-automattic-fediverse-integration-on-hold-decoder

That's unfortunate. Having Tumblr as your frontend for posting and curating feeds, with WordPress as the backend and integrated with decentralized networks like the Fediverse for distribution would've been amazing. It sounds like Tumblr is still strategic to Automattic and as more people move away from the centralized social platforms and towards their own corners of the internet, having these differente building blocks in place will be even more important.

Permalink: /feed/tumblr-wordpress-fediverse-integration-pause/

Tags: #tumblr #wordpress #fediverse #automattic

lqdev👽06/29/2025

https://marginalrevolution.com/marginalrevolution/2025/06/privatize-federal-land.html

I’ve long advocated selling off some federal land...Most of this “public land” is never used by the public. Selling some of it would actually make it more accessible and useful to real people.

How about no. If privatization means, John and Jane Doe can purchase some land for their homestead is one thing. We've tried that before by the way - Homestead Act. In reality, it means some private equity firm or hedge fund buying the land, building expensive and shoddy cookit-cutter condos, and selling you a subscription for a roof over your head. I say condos because it's hot and there's not a lot of water in Vegas, so keeping those brand new AI data centers cool might be a problem.

Also, if you haven't seen that area that Alex highlights in his post, it seems like a deserted landscape. And yes, the terrain can be rough, but actually, it's a beautiful landscape that is best left undisturbed.

Permalink: /feed/do-not-privatize-federal-land/

Tags: #bureaulandmanagement #uspol #land

lqdev👽06/16/2025

https://jyopari.github.io/posts/seal

Large language models (LLMs) are powerful but static; they lack mechanisms to adapt their weights in response to new tasks, knowledge, or examples. We introduce Self-Adapting LLMs (SEAL) 🦭, a framework that enables LLMs to self-adapt by generating their own finetuning data and update directives. Given a new input, the model produces a self-edit — a generation that may restructure the information in different ways, specify optimization hyperparameters, or invoke tools for data augmentation and gradient-based updates. Through supervised finetuning (SFT), these self-edits result in persistent weight updates, enabling lasting adaptation. To train the model to produce effective self-edits, we use a reinforcement learning loop, using the downstream performance of the updated model as the reward signal. Unlike prior approaches that rely on separate adaptation modules or auxiliary networks, SEAL directly uses the model's generation to parameterize and control its own adaptation process. Experiments on knowledge incorporation and few-shot generalization show that SEAL is a promising step toward language models capable of self-directed adaptation in response to new data.

We demonstrate SEAL in two domains: (1) Knowledge Incorporation, where the model integrates new factual information by generating logical implications as synthetic data, and (2) Few-Shot Learning, where the model autonomously selects data augmentations and training hyperparameters to adapt to new abstract reasoning tasks.

Code
Paper

Permalink: /feed/self-adapting-language-models-seal/

Tags: #agent #ai #llm #mit #research

lqdev👽06/16/2025

https://www.anthropic.com/engineering/built-multi-agent-research-system

Great article.

TLDR: These are largely engineering problems.

The journey of this multi-agent system from prototype to production taught us critical lessons about system architecture, tool design, and prompt engineering. A multi-agent system consists of multiple agents (LLMs autonomously using tools in a loop) working together. Our Research feature involves an agent that plans a research process based on user queries, and then uses tools to create parallel agents that search for information simultaneously. Systems with multiple agents introduce new challenges in agent coordination, evaluation, and reliability.

Benefits of a multi-agent system

The essence of search is compression: distilling insights from a vast corpus. Subagents facilitate compression by operating in parallel with their own context windows, exploring different aspects of the question simultaneously before condensing the most important tokens for the lead research agent. Each subagent also provides separation of concerns—distinct tools, prompts, and exploration trajectories—which reduces path dependency and enables thorough, independent investigations.

Once intelligence reaches a threshold, multi-agent systems become a vital way to scale performance. For instance, although individual humans have become more intelligent in the last 100,000 years, human societies have become exponentially more capable in the information age because of our collective intelligence and ability to coordinate. Even generally-intelligent agents face limits when operating as individuals; groups of agents can accomplish far more.

Our internal evaluations show that multi-agent research systems excel especially for breadth-first queries that involve pursuing multiple independent directions simultaneously.

Multi-agent systems work mainly because they help spend enough tokens to solve the problem...Multi-agent architectures effectively scale token usage for tasks that exceed the limits of single agents.

We’ve found that multi-agent systems excel at valuable tasks that involve heavy parallelization, information that exceeds single context windows, and interfacing with numerous complex tools.

Architecture overview

Our Research system uses a multi-agent architecture with an orchestrator-worker pattern, where a lead agent coordinates the process while delegating to specialized subagents that operate in parallel.

Traditional approaches using Retrieval Augmented Generation (RAG) use static retrieval. That is, they fetch some set of chunks that are most similar to an input query and use these chunks to generate a response. In contrast, our architecture uses a multi-step search that dynamically finds relevant information, adapts to new findings, and analyzes results to formulate high-quality answers.

Prompt engineering and evaluations for research agents

Think like your agents. To iterate on prompts, you must understand their effects. To help us do this, we built simulations using our Console with the exact prompts and tools from our system, then watched agents work step-by-step.

Teach the orchestrator how to delegate. In our system, the lead agent decomposes queries into subtasks and describes them to subagents. Each subagent needs an objective, an output format, guidance on the tools and sources to use, and clear task boundaries. Without detailed task descriptions, agents duplicate work, leave gaps, or fail to find necessary information.

Scale effort to query complexity. Agents struggle to judge appropriate effort for different tasks, so we embedded scaling rules in the prompts...

Tool design and selection are critical. Agent-tool interfaces are as critical as human-computer interfaces. Using the right tool is efficient—often, it’s strictly necessary...Bad tool descriptions can send agents down completely wrong paths, so each tool needs a distinct purpose and a clear description.

Let agents improve themselves....When given a prompt and a failure mode, they are able to diagnose why the agent is failing and suggest improvements.

Start wide, then narrow down. Search strategy should mirror expert human research: explore the landscape before drilling into specifics.

Guide the thinking process....Our testing showed that extended thinking improved instruction-following, reasoning, and efficiency. Subagents also plan, then use interleaved thinking after tool results to evaluate quality, identify gaps, and refine their next query. This makes subagents more effective in adapting to any task.

Parallel tool calling transforms speed and performance....For speed, we introduced two kinds of parallelization: (1) the lead agent spins up 3-5 subagents in parallel rather than serially; (2) the subagents use 3+ tools in parallel.

Our prompting strategy focuses on instilling good heuristics rather than rigid rules. We studied how skilled humans approach research tasks and encoded these strategies in our prompts—strategies like decomposing difficult questions into smaller tasks, carefully evaluating the quality of sources, adjusting search approaches based on new information, and recognizing when to focus on depth (investigating one topic in detail) vs. breadth (exploring many topics in parallel).

Effective evaluation of agents

...evaluating multi-agent systems presents unique challenges...Because we don’t always know what the right steps are, we usually can't just check if agents followed the “correct” steps we prescribed in advance. Instead, we need flexible evaluation methods that judge whether agents achieved the right outcomes while also following a reasonable process.

Start evaluating immediately with small samples....it’s best to start with small-scale testing right away with a few examples, rather than delaying until you can build more thorough evals.

LLM-as-judge evaluation scales when done well....We used an LLM judge that evaluated each output against criteria in a rubric: factual accuracy (do claims match sources?), citation accuracy (do the cited sources match the claims?), completeness (are all requested aspects covered?), source quality (did it use primary sources over lower-quality secondary sources?), and tool efficiency (did it use the right tools a reasonable number of times?)...[we] found that a single LLM call with a single prompt outputting scores from 0.0-1.0 and a pass-fail grade was the most consistent and aligned with human judgements.

Human evaluation catches what automation misses....Adding source quality heuristics to our prompts helped resolve this issue. Even in a world of automated evaluations, manual testing remains essential.

the best prompts for these agents are not just strict instructions, but frameworks for collaboration that define the division of labor, problem-solving approaches, and effort budgets. Getting this right relies on careful prompting and tool design, solid heuristics, observability, and tight feedback loops.

Production reliability and engineering challenges

Agents are stateful and errors compound....Agents can run for long periods of time, maintaining state across many tool calls. This means we need to durably execute code and handle errors along the way. Without effective mitigations, minor system failures can be catastrophic for agents. When errors occur, we can't just restart from the beginning...We combine the adaptability of AI agents built on Claude with deterministic safeguards like retry logic and regular checkpoints.

Debugging benefits from new approaches...Adding full production tracing let us diagnose why agents failed and fix issues systematically. Beyond standard observability, we monitor agent decision patterns and interaction structures—all without monitoring the contents of individual conversations, to maintain user privacy.

Deployment needs careful coordination...we use rainbow deployments to avoid disrupting running agents, by gradually shifting traffic from old to new versions while keeping both running simultaneously.

Synchronous execution creates bottlenecks. Currently, our lead agents execute subagents synchronously, waiting for each set of subagents to complete before proceeding. This simplifies coordination, but creates bottlenecks in the information flow between agents. Asynchronous execution would enable additional parallelism: agents working concurrently and creating new subagents when needed. But this asynchronicity adds challenges in result coordination, state consistency, and error propagation across the subagents. As models can handle longer and more complex research tasks, we expect the performance gains will justify the complexity.

Conclusion

For all the reasons described in this post, the gap between prototype and production is often wider than anticipated.

Multi-agent research systems can operate reliably at scale with careful engineering, comprehensive testing, detail-oriented prompt and tool design, robust operational practices, and tight collaboration between research, product, and engineering teams who have a strong understanding of current agent capabilities.

Permalink: /feed/anthropic-how-we-built-multi-agent-system/

Tags: #agent #ai #anthropic #engineering

lqdev👽06/14/2025

https://swicg.github.io/activitypub-e2ee/mls

Messaging Layer Security (MLS) is an IETF standard for end-to-end encrypted (E2EE) messaging. It lets people on laptops and phones communicate with each other in a secure way that no one in between can see.

MLS is designed to use pluggable lower-level protocols. This specification defines an envelope format for distributing MLS messages through the network, and an Activity Streams 2.0 profile for the packets of application data stored inside the messages.

This specification is ready for review from both ActivityPub developers and security analysts. It’s time to start making proof-of-concept implementations and testing interoperability.

Source: https://socialwebfoundation.org/2025/06/13/mls-over-activitypub-draft/

Permalink: /feed/mls-over-activitypub-draft/

Tags: #protocol #socialweb #activitypub #w3c

lqdev👽06/05/2025

https://simonrepp.com/faircamp/

This is such a cool project. Looks like it's super simple to get started with and it's packed with a ton of features so creators get to stay in their flow.

I also like how creators are showcased.

Permalink: /feed/faircamp-static-site-audio-producers/

Tags: #faircamp #music #socialweb #indieweb #staticsite #technology #tools #openweb #opensource

lqdev👽06/03/2025

https://www.youtube.com/watch?v=DL82mGde6wo

Permalink: /feed/state-of-the-art-prompting-ai-agents-y-combinator/

Tags: #ai #agents #prompting

lqdev👽06/03/2025

https://elevenlabs.io/blog/conversational-ai-2-0

Conversational AI 2.0 launches with advanced features and enterprise readiness.

Natural turn-taking to understand the flow of conversation.
Multilingual communication with integrated language detection
Integrated RAG: knowledgeable agents, minimum latency, maximum privacy
Multimodality
Batch calls

Permalink: /feed/elevenlabs-conversational-ai-2-0/

Tags: #ai #speech #elevenlabs

lqdev👽06/03/2025

https://sakana.ai/dgm/

A longstanding goal of AI research has been the creation of AI that can learn indefinitely. One tantalizing path toward that goal is an AI that improves itself by rewriting its own code, including any code responsible for learning. That idea, known as a Gödel Machine, proposed by Jürgen Schmidhuber decades ago, is a hypothetical self-improving AI. It optimally solves problems by recursively rewriting its own code when it can mathematically prove a better strategy, making it a key concept in meta-learning or “learning to learn.”

While the theoretical Gödel Machine promised provably beneficial self-modifications, its realization relied on an impractical assumption: that the AI could mathematically prove that a proposed change in its own code would yield a net improvement before adopting it. We, in collaboration with Jeff Clune’s lab at UBC, propose something more feasible: a system that harnesses the principles of open-ended algorithms like Darwinian evolution to search for improvements that empirically improve performance.

We call the result the Darwin Gödel Machine (full technical report). DGMs leverage foundation models to propose code improvements, and use recent innovations in open-ended algorithms to search for a growing library of diverse, high-quality AI agents. Our experiments show that DGMs improve themselves the more compute they are provided. In line with the clear trend that AI systems that rely on learning ultimately outperform those designed by hand, there is a potential that DGMs could soon outperform hand-designed AI systems.

Technical Report

Permalink: /feed/sakana-ai-gwm/

Tags: #ai #agent #sakana

lqdev👽05/30/2025

https://discourse.32bit.cafe/t/resources-list-for-the-personal-web/49

This list is to help guide and help those who are in every stage of their web-building journey on the personal web. This list is not meant to overwhelm you, but rather give you options and find tools, graphics, utilities, codes, and everything in between to help get you creating more on the independent web.

Permalink: /feed/resource-list-personal-web/

Tags: #32bitcafe #personalweb #indieweb #openweb #community #retro

lqdev👽05/26/2025

https://cheapskatesguide.org/articles/small-social-growth.html

Last month during Meta's antitrust trial, Mark Zuckerberg admitted that his company's focus has changed radically from where it began...[his] testimony implies that Facebook (Meta) may no longer be as much of an option if we actually want to talk to each other on line. But, neither are many of the other large social media sites.

...if companies allowed us to discover great new small blogs and small social media sites, many of us would decide to spend more time there. Then, companies would be less able to advertise to us on their own sites.

...we have been told by the mainstream media for well over a decade that small social media is dead. They say the same about small blogs, yet small blogs are actually more plentiful than ever, with over 600 million blogs on the Internet and about 7 million new blog articles published each day.

If you decide to create your own forum, do yourself a favor. The way to protect yourself and your users is by running your social media site at a domain that you own and on a server that you control, and don't agree to hand over any of your rights to anyone for any reason.

I believe large corporate-run social media sites are vulnerable in a way they have not been for twenty years. Their users have reached a state of extreme dissatisfaction with their "enshittified" platforms, and they are looking for something, anything, better.

But each of you who are reading these words can help end this situation by creating your own small social media site.

All you need to start your own small site is a domain name, a server that you can buy, rent, or set up on an old computer, perhaps social media software (see above), patience, a willingness to work, and ideas for ways of notifying potential users of the existence of your site.

Run your new site as you and your users see fit, and make it far superior to Facebook, Twitter (X), Reddit, or any of the other major sites whose owners think you have no other options.

Permalink: /feed/stage-set-growth-small-social-media-cheapskate/

Tags: #social #openweb #personalweb

lqdev👽05/26/2025

https://audmcname.com/comics/rss-is-not-dead-yet/

Illustrating the history of RSS. Created in collaboration with A. Service, and debuted at VanCAF 2023

Cover of Rss is not dead yet comic — Source: *audmcname.com*

Permalink: /feed/rss-not-dead-yet-mcnamee/

Tags: #rss #comic #art

lqdev👽05/22/2025

https://support.mozilla.org/en-US/kb/future-of-pocket

I haven't used Pocket in a long time but sad to hear it's shutting down. It's great they're offering the option of letting you export your data and platforms like Micro.blog are making it easy to host that content on your own site (assuming you're using Micro.blog).

For bookmarking solutions, I've been using my website as well as messages-to-self on Element. What's on my website is the content that I really want to make sure I archive, whereas the messages to self I treat more as a read-it-later solution. Eventually, some of those make it to my website. I'm working on my mobile publishing flow to simplify my bookmarking process but overall, I'm happy with my current system.

This is also another great reminder why owning your content is important.

Permalink: /feed/pocket-shutting-down/

Tags: #pocket #mozilla #bookmark

lqdev👽05/17/2025

https://agent-network-protocol.com/

Comparison of MCP and ANP: What Kind of Communication Protocol Do Agents Need?

MCP and ANP differ significantly in protocol architecture, identity authentication, and information organization.

MCP is a typical Client-Server (CS) architecture, while ANP is a typical Peer-to-Peer (P2P) architecture.

MCP's identity authentication is based on the OAuth standard, facilitating client access to current internet resources. ANP's identity authentication is based on the W3C DID standard, focusing on cross-platform interoperability among agents, enabling seamless connectivity.

MCP organizes information using JSON-RPC technology, essentially API calls. ANP uses semantic web Linked-Data technology to build a data network that is easily accessible and understandable by AI.

ANP is agent-centric, where each agent has equal status, forming a decentralized agent collaboration network.

Permalink: /feed/agent-network-protocol/

Tags: #agent #ai #protocol

lqdev👽05/16/2025

https://www.nbcchicago.com/news/local/satellite-imagery-shows-large-dust-storm-over-chicago-area-from-space/3748009/

Never change Chicago.

What felt like the first week of good weather since forever ends with a dust storm.

Permalink: /feed/chicago-dust-storm/

Tags: #chicago #weather #duststorm

lqdev👽05/11/2025

https://www.manton.org/2025/05/10/openai-rolls-out-new-things.html

💯💯💯. Artifacts are great. In ChatGPT when I find myself trying to format outputs as markdown, the UI gets confused and some of the content is rendered in markdown while the rest gets rendered as part of the UI which makes it unusable and hard to copy. The best way I've found to get around it is to ask it to format in org-mode or asciidoc. Claude Artifacts make this a non-issue. I also love that in addition to copying and downloading the artifact, you can also publish it as a URL.

Permalink: /feed/claude-artifacts-reece/

Tags: #claude #chatgpt #plaintext #ai #org #emacs #asciidoc #markdown

lqdev👽05/10/2025

https://hypha.coop/dripline/announcing-dp-social-inbox/

I hadn't heard of Distributed Press and this is an older post (from 2023), but the idea is interesting.

Hypha and Sutty are thrilled to announce the release of the Social Inbox, a new feature of Distributed.Press that integrates a website’s comment section with federated social media platforms like Mastodon. With the Social Inbox enabled, websites obtain their own account on the Fediverse, allowing it to automatically send out new posts to followers at the time of publication. When other users reply to posts, you can approve them to be published to the site as comments. The Social Inbox allows readers to directly engage with your posts where they already are, and gives publishers the ability to incorporate public dialogue into their websites.

Permalink: /feed/announcing-distributed-press-social-inbox/

Tags: #decentralization #indieweb #social #fediverse #dweb

lqdev👽05/09/2025

https://blog.discourse.org/2025/04/discourse-and-the-fediverse/

Two years ago, we started working on a plugin that brings Discourse and the Fediverse closer together. Discourse communities are online spaces that facilitate open collaboration and communication. The Fediverse offers ways to expand the reach of Discourse communities and help them build bridges with people active in other spaces, all while keeping the conversation civil, meaningful and focused. This post will describe how the ActivityPub plugin works and how you can enable your Discourse community to connect with other communities or Fediverse users.

Permalink: /feed/discourse-fediverse/

Tags: #discourse #fediverse #social #openweb #opensource #forum

lqdev👽05/09/2025

https://bukmark.club/

The BUKMARK.CLUB is a collection of websites from across the Internet.

Permalink: /feed/bukmark-club/

Tags: #bookmark #internet #web #community

lqdev👽05/09/2025

https://osteophage.neocities.org/writing/in-praise-of-links

Hyperlinks deserve more recognition in light of all the ways their value has been sidelined and denied.

To that end, I present a link compilation in praise of links. It includes things I agree with entirely and things I don't, spanning from the 2020s to the early 2000s, to supply a tapestry of perspectives, context, and examples on the value and importance of links. Links may have their downsides, challenges, and vulnerabilities, but my hope is that this compilation will (re)invigorate your appreciation for linking as a technology and a social practice, all the better to understand what's at stake when links are discarded and devalued.

Permalink: /feed/in-praise-of-links-osteophage/

Tags: #openweb #internet #indieweb #decentralization #web

lqdev👽05/09/2025

https://thehistoryoftheweb.com/the-evolution-of-blogging/

...around 1997, when Jorn Barger launched his website, titling it “The Robot Wisdom Weblog.” Barger’s plan was to curate his favorite links from around the web, and add bits of commentary to each one. In other words, he meant to keep a log of his web experience. Hence weblog.

In their earliest days, webloggers stood as gatekeepers to the web’s ever-growing well of content. Each day, these URL pioneers would post a few new links and sprinkle in their own commentary. Blogs acted as a signpost for web users, and following a few key blogs was enough to keep track of just about everything new on the web. Many began to look at the blogging community as a brand new type of media, one that often stood far closer to an impartial truth than traditional mediums would allow for.

1999, Peter Merholz threw up his own weblog, and in the sidebar added the tagline: "I’ve decided to pronounce the word “weblog” as wee’- blog. Or “blog” for short."

In the beginning, most bloggers hand coded their sites, adding a new HTML page to their server each time they had a new entry, or just updating the homepage to include the newest links. But soon enough, tools showed up to help bloggers with the process.

...on the fringes, a new type of blog was emerging. The personal blog. These sites ditched the curated links and focused exclusively on commentary. Bloggers used their site to chronicle their personal journey, from the almost boring and banal to the weird and wonderful. This new type of blog was less an alternative media source and more akin to an online journal or diary. And these writers saw themselves not as gatekeepers to the web, but as sharers of their own identity.

New tools soon caught up with this shifting perspective. The first of the bunch to do so was LiveJournal...

Blogger came next. And it would prove to be the most popular of the early blogging tools. The platform first launched in August of 1999, but picked up steam the following year when it introduced the concept of a permalink.

These platforms made blogging open, in every sense of the word. Visitors were greeted with a single, open textarea on a free platform that was open to all to participate.

Blogger became the platform choice for a lot of writers out there. It certainly was for Mena Trott

Trott had the feeling that platforms like Blogger did not go nearly far enough.

Trott wanted her own site to stand out from the crowd. To be quintessentially hers. Blogger wouldn’t let her do that. So together with her husband, a Perl programmer, she built Movable Type...And with that move, Trott ushered in a third generation of blogging.

Movable Type allowed users to set up their blog on their own server, and have complete control over its look and feel.

Blog (the noun and verb) became a word familiar not just to web geeks, but to everyone. Lots of imitators and innovators followed in the wake of Moveable Type. Open source platforms like WordPress rose to meet the rising demand. But for a long time, Moveable Type was the standard. It fed its community with a constant stream of new features that promoted openness and freedom.

Blogging, as a community and a practice, continues to be refined to this day thanks to the dedication of web users everywhere. But its sharpest point was made the day Mena Trott decided that a blog could be more than just some text on a page. It could be one of a kind.

Permalink: /feed/evolution-of-blogging-history-web/

Tags: #blogging #history #web #internet

lqdev👽05/09/2025

https://socialweb.network/

Your gateway to decentralized social networking protocols, resources, and community

Permalink: /feed/socialweb-network/

Tags: #fediverse #social #openweb #decentralized #community #indieweb

lqdev👽05/09/2025

https://huggingface.co/blog/autoround

As large language models (LLMs) and vision-language models (VLMs) continue to grow in size and complexity, deploying them efficiently becomes increasingly challenging. Quantization offers a solution by reducing model size and inference latency. Intel's AutoRound emerges as a cutting-edge quantization tool that balances accuracy, efficiency, and compatibility.

AutoRound is a weight-only post-training quantization (PTQ) method developed by Intel. It uses signed gradient descent to jointly optimize weight rounding and clipping ranges, enabling accurate low-bit quantization (e.g., INT2 - INT8) with minimal accuracy loss in most scenarios. For example, at INT2, it outperforms popular baselines by up to 2.1x higher in relative accuracy.

Paper
Repo

Permalink: /feed/intel-autoround/

Tags: #quantization #intel #ai

lqdev👽05/09/2025

https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2

parakeet-tdt-0.6b-v2 is a 600-million-parameter automatic speech recognition (ASR) model designed for high-quality English transcription, featuring support for punctuation, capitalization, and accurate timestamp prediction.

This XL variant of the FastConformer [1] architecture integrates the TDT [2] decoder and is trained with full attention, enabling efficient transcription of audio segments up to 24 minutes in a single pass.

Permalink: /feed/nvidia-parakeet-v2/

Tags: #nvidia #ai #transcription

lqdev👽05/09/2025

https://about.flipboard.com/fediverse/fediverse-house-2025-roundup/

No Walls, Just Vibes at SXSW’s Fediverse House

Fediverse House was SXSW’s first physical gathering dedicated entirely to the constellation of interoperable social platforms powered by protocols like ActivityPub and AT Protocol. (We know “Fediverse” isn’t quite accurate to describe the whole ecosystem; in this case, it was just a convenient, fun way to name the space.)

It didn’t matter which platform you preferred or which protocol you were building on; everyone was focused on the singular goal of building a better internet.

...the real power move for creators is ownership and control of their work and livelihoods.

...identity portability is a radical shift, and...decentralization could lead to more humane social spaces.

Permalink: /feed/fediverse-house-2025-roundup/

Tags: #fediverse #conference #indieweb #openweb

lqdev👽05/09/2025

https://locate3d.atmeta.com/

Locate 3D transforms how AI understands physical space, delivering state-of-the-art object localization in complex 3D environments.

Operating directly on standard sensor data, Locate 3D brings spatial intelligence to robotics and augmented reality applications in real-world settings.

Paper

We present Locate 3D, a model for localizing objects in 3D scenes from referring expressions like “the small coffee table between the sofa and the lamp.” Locate 3D sets a new state-of-the-art on standard referential grounding benchmarks and showcases robust generalization capabilities. Notably, Locate 3D operates directly on sensor observation streams (posed RGB-D frames), enabling real-world deployment on robots and AR devices. Key to our approach is 3D-JEPA, a novel self-supervised learning (SSL) algorithm applicable to sensor point clouds. It takes as input a 3D pointcloud featurized using 2D foundation models (CLIP, DINO). Subsequently, masked prediction in latent space is employed as a pretext task to aid the self-supervised learning of contextualized pointcloud features. Once trained, the 3D-JEPA encoder is finetuned alongside a language-conditioned decoder to jointly predict 3D masks and bounding boxes. Additionally, we introduce Locate 3D Dataset, a new dataset for 3D referential grounding, spanning multiple capture setups with over 130K annotations. This enables a systematic study of generalization capabilities as well as a stronger model.

Permalink: /feed/meta-locate3d/

Tags: #meta #ai #3d #research

lqdev👽05/09/2025

https://alibaba-nlp.github.io/ZeroSearch/

Effective information searching is essential for enhancing the reasoning and generation capabilities of large language models (LLMs). Recent research has explored using reinforcement learning (RL) to improve LLMs' search capabilities by interacting with live search engines in real-world environments. While these approaches show promising results, they face two major challenges: (1) Uncontrolled Document Quality: The quality of documents returned by search engines is often unpredictable, introducing noise and instability into the training process. (2) Prohibitively High API Costs: RL training requires frequent rollouts, potentially involving hundreds of thousands of search requests, which incur substantial API expenses and severely constrain scalability. To address these challenges, we introduce ZeroSearch, a reinforcement learning framework that enhances the search capabilities of LLMs without interacting with real search engines. Our approach begins with lightweight supervised fine-tuning to transform the LLM into a retrieval module capable of generating both relevant and noisy documents in response to a query. During RL training, we employ a curriculum-based rollout strategy that incrementally degrades the quality of generated documents, progressively eliciting the model’s reasoning ability by exposing it to increasingly challenging retrieval scenarios. Extensive experiments demonstrate that ZeroSearch effectively incentivizes the search capabilities of LLMs using a 3B LLM as the retrieval module. Remarkably, a 7B retrieval module achieves comparable performance to the real search engine, while a 14B retrieval module even surpasses it. Furthermore, it generalizes well across both base and instruction-tuned models of varying sizes and is compatible with a wide range of RL algorithms.

Permalink: /feed/zero-search-alibaba/

Tags: #ai #search #llm #research

lqdev👽05/09/2025

https://arxiv.org/abs/2504.16736

It'll be interesting to see once many of the exiting protocols begin to converge towards a standard. There's a ton of existing well-adopted protocol standards out there that when stitched together could create a compelling and native experience. I think eventually we'll get to a place where agents are native parts of the OSI stack and the world wide web.

The rapid development of large language models (LLMs) has led to the widespread deployment of LLM agents across diverse industries, including customer service, content generation, data analysis, and even healthcare. However, as more LLM agents are deployed, a major issue has emerged: there is no standard way for these agents to communicate with external tools or data sources. This lack of standardized protocols makes it difficult for agents to work together or scale effectively, and it limits their ability to tackle complex, real-world tasks. A unified communication protocol for LLM agents could change this. It would allow agents and tools to interact more smoothly, encourage collaboration, and triggering the formation of collective intelligence. In this paper, we provide the first comprehensive analysis of existing agent protocols, proposing a systematic two-dimensional classification that differentiates context-oriented versus inter-agent protocols and general-purpose versus domain-specific protocols. Additionally, we conduct a comparative performance analysis of these protocols across key dimensions such as security, scalability, and latency. Finally, we explore the future landscape of agent protocols by identifying critical research directions and characteristics necessary for next-generation protocols. These characteristics include adaptability, privacy preservation, and group-based interaction, as well as trends toward layered architectures and collective intelligence infrastructures. We expect this work to serve as a practical reference for both researchers and engineers seeking to design, evaluate, or integrate robust communication infrastructures for intelligent agents.

Permalink: /feed/survey-ai-agent-protocols/

Tags: #ai #agents #protocols #research

lqdev👽05/05/2025

https://hamatti.org/posts/resisting-the-urge-to-rewrite-the-website/

I have half-jokingly been entertaining an idea of building a new blog: for every blog post published, you get to add one new feature. The first blog post needs to be written in a plain txt file and deployed as-is to a server. With the second post, you can choose to add something you want: maybe a bit of HTML navigation to be able to have links between the blog posts. The third post could come with Markdown support so your posts are not plain text anymore but rendered into HTML. And so on.

The idea would is that you get to “earn” feature development by writing blog posts. If you want a fancy site with bells and whistles and good tooling, you have to write and publish a lot. That way, you cannot get lost into the rabbit hole of new shiny feature development at the expense of writing.

I like this suggestion. My rewrite has been ongoing but slow. I tried vibe-coding my way to success but didn't have much luck there, so it's taking more time to do it manually. If I was doing it from scratch it wouldn't be a challenge. The challenge is making all the posts and current structure fit into what I want to website to eventually look like.

I use different tools for writing and coding. I can write my blog posts on my iPad when I’m in the library or local pub and at that time, there’s no way for me to tinker with the tools or the website itself.

Another great suggestion. A lot of my desire to rewrite stems from frictions in my current workflow as well as being able to support new and different post types. Back when I had my Duo, I didn't mind the mobile VS Code authoring experience, but now that I'm back to a single screen device, not being able to publish from my phone creates a lot of friction.

Permalink: /feed/resisting-urge-rewrite-website-santala/

Tags: #website #indieweb #blogging

lqdev👽05/05/2025

https://arxiv.org/abs/2505.00562

Learning to solve complex tasks with signal temporal logic (STL) specifications is crucial to many real-world applications. However, most previous works only consider fixed or parametrized STL specifications due to the lack of a diverse STL dataset and encoders to effectively extract temporal logic information for downstream tasks. In this paper, we propose TeLoGraF, Temporal Logic Graph-encoded Flow, which utilizes Graph Neural Networks (GNN) encoder and flow-matching to learn solutions for general STL specifications. We identify four commonly used STL templates and collect a total of 200K specifications with paired demonstrations. We conduct extensive experiments in five simulation environments ranging from simple dynamical models in the 2D space to high-dimensional 7DoF Franka Panda robot arm and Ant quadruped navigation. Results show that our method outperforms other baselines in the STL satisfaction rate. Compared to classical STL planning algorithms, our approach is 10-100X faster in inference and can work on any system dynamics. Besides, we show our graph-encoding method's capability to solve complex STLs and robustness to out-distribution STL specifications. Code is available at this https URL

Permalink: /feed/telograf/

Tags: #ai #graphs #neuralnetworks #research

lqdev👽05/05/2025

https://shellsharks.com/just-put-it-on-your-blog

It’s great to have a place to share your thoughts. A place you can go back to when you want to remember something you had written or thought about before. A place you can refer people to when they have questions you’ve answered in the past. A place to be you. So, get a blog, and put all the things there.

It is great to carve out your own space in the interwebs. If you run out of ideas of what to do with your website, here's James' blog with 100+ ideas.

Permalink: /feed/just-post-on-your-blog-shellsharks/

Tags: #blogging #indieweb #website

lqdev👽05/05/2025

https://arxiv.org/abs/2505.00023

In a real-world corpus, knowledge frequently recurs across documents but often contains inconsistencies due to ambiguous naming, outdated information, or errors, leading to complex interrelationships between contexts. Previous research has shown that language models struggle with these complexities, typically focusing on single factors in isolation. We classify these relationships into four types: distracting, ambiguous, counterfactual, and duplicated. Our analysis reveals that no single approach effectively addresses all these interrelationships simultaneously. Therefore, we introduce Context Organizer (CORG), a framework that organizes multiple contexts into independently processed groups. This design allows the model to efficiently find all relevant answers while ensuring disambiguation. CORG consists of three key components: a graph constructor, a reranker, and an aggregator. Our results demonstrate that CORG balances performance and efficiency effectively, outperforming existing grouping methods and achieving comparable results to more computationally intensive, single-context approaches.

Permalink: /feed/corg-context-organizer-framework/

Tags: #rag #ai #knowledge #research

lqdev👽05/05/2025

https://arxiv.org/abs/2505.00234

Many methods for improving Large Language Model (LLM) agents for sequential decision-making tasks depend on task-specific knowledge engineering--such as prompt tuning, curated in-context examples, or customized observation and action spaces. Using these approaches, agent performance improves with the quality or amount of knowledge engineering invested. Instead, we investigate how LLM agents can automatically improve their performance by learning in-context from their own successful experiences on similar tasks. Rather than relying on task-specific knowledge engineering, we focus on constructing and refining a database of self-generated examples. We demonstrate that even a naive accumulation of successful trajectories across training tasks boosts test performance on three benchmarks: ALFWorld (73% to 89%), Wordcraft (55% to 64%), and InterCode-SQL (75% to 79%)--matching the performance the initial agent achieves if allowed two to three attempts per task. We then introduce two extensions: (1) database-level selection through population-based training to identify high-performing example collections, and (2) exemplar-level selection that retains individual trajectories based on their empirical utility as in-context examples. These extensions further enhance performance, achieving 91% on ALFWorld--matching more complex approaches that employ task-specific components and prompts. Our results demonstrate that automatic trajectory database construction offers a compelling alternative to labor-intensive knowledge engineering.

Permalink: /feed/self-generate-incontext-examples-llm-agents-improve-sequential-decision-making/

Tags: #ai #agents #planning #research

lqdev👽05/02/2025

https://devblogs.microsoft.com/ise/running-rag-onnxruntime-genai/

Really great to see these case studies and comparisons.

A while back we published a blog post showcasing how experiences like the AI Dev Gallery make use of ONNX Runtime and the various AI building blocks in .NET to enable a diverse set of scenarios.

For more ONNX Runtime GenAI focused C# content, you can also reference the post Using Phi-3 & C# with ONNX for text and vision samples

Permalink: /feed/rag-onnx-runtime-genai-windows/

Tags: #onnxruntime #windows #dotnet

lqdev👽04/28/2025

https://github.com/AboutRSS/ALL-about-RSS

A list of RSS related stuff: tools, services, communities and tutorials, etc.

Permalink: /feed/all-about-rss/

Tags: #rss #indieweb #feeds

lqdev👽04/24/2025

https://openai.com/index/image-generation-api/

Today, we’re bringing the natively multimodal model that powers this experience in ChatGPT to the API via gpt-image-1, enabling developers and businesses to easily integrate high-quality, professional-grade image generation directly into their own tools and platforms.

The gpt-image-1 model is now available globally via the Images API, with support in the Responses API coming soon.

Permalink: /feed/gpt-image-1-api/

Tags: #openai #ai #gpt

lqdev👽04/08/2025

https://ai.meta.com/blog/llama-4-multimodal-intelligence/

We’re introducing Llama 4 Scout and Llama 4 Maverick, the first open-weight natively multimodal models with unprecedented context length support and our first built using a mixture-of-experts (MoE) architecture. We’re also previewing Llama 4 Behemoth, one of the smartest LLMs in the world and our most powerful yet to serve as a teacher for our new models.

Llama 4 Scout, a 17 billion active parameter model with 16 experts, is the best multimodal model in the world in its class and is more powerful than all previous generation Llama models, while fitting in a single NVIDIA H100 GPU. Additionally, Llama 4 Scout offers an industry-leading context window of 10M and delivers better results than Gemma 3

Llama 4 Maverick, a 17 billion active parameter model with 128 experts, is the best multimodal model in its class, beating GPT-4o and Gemini 2.0 Flash across a broad range of widely reported benchmarks

After tinkering with Mistral Small 3.1, I'm excited to try Llama 4 Scout once it lands in GitHub Models and Ollama.

I don't think that Mistral Small 3.1 uses MoE architecture which should help a ton considering the 10 million token context size.

Permalink: /feed/llama-4-herd/

Tags: #llama #meta #ai #llm

lqdev👽04/08/2025

https://www.fastcompany.com/91310237/the-tumblr-revival-is-real-and-gen-z-is-leading-the-charge

Thanks to Gen Z, the site has found new life. As of 2025, Gen Z makes up 50% of Tumblr’s active monthly users and accounts for 60% of new sign-ups

Don't call it a comeback.

User numbers spiked in January during the near-ban of TikTok and jumped again last year when Brazil temporarily banned X.

To keep up with the momentum, Tumblr introduced Reddit-style Communities in December, letting users connect over shared interests like photography and video games. In January, it debuted Tumblr TV—a TikTok-like feature that serves as both a GIF search engine and a short-form video platform.

But perhaps Tumblr’s greatest strength is that it isn’t TikTok or Facebook.

For its users...that’s part of the appeal.

Tumblr's been through a rough ride, but at the core there's a lot of nice things to like about it. In previous posts, I talked about some opportunities for Tumblr as a result of replatforming on top of WordPress and its expressed intent for ActivityPub integration. In doing so, not only would it enable an intuitive front-end for publishing different kinds of posts, but it would also provide opportunities for self-hosting and a more federated and decentralized social platform.

I'm excited to see where this goes.

Permalink: /feed/gen-z-tumblr-revival/

Tags: #tumblr #automattic #socialmedia #genz #activitypub #fediverse

lqdev👽03/18/2025

https://maho.dev/2025/03/a-guide-to-implementing-activitypub-in-a-static-site-or-any-website-q1-2025-updates/

Awesome post from my friend maho.dev

I’ve introduced customizable templates for note generation. This enhancement allows the full content of a blog post to be included directly in ActivityPub notes, moving beyond mere link sharing—a practice often associated with bots—and leveraging the full potential of the Fediverse.

This is cool. I'd like something better for my POSSE setup. Might give this a try.

Permalink: /feed/bringing-static-sites-fediverse-enhancements-implementations-maho/

Tags: #fediverse #indieweb #activitypub

lqdev👽03/18/2025

https://blog.google/technology/developers/gemma-3/

Today, we're introducing Gemma 3, a collection of lightweight, state-of-the-art open models built from the same research and technology that powers our Gemini 2.0 models. These are our most advanced, portable and responsibly developed open models yet. They are designed to run fast, directly on devices — from phones and laptops to workstations — helping developers create AI applications, wherever people need them. Gemma 3 comes in a range of sizes (1B, 4B, 12B and 27B), allowing you to choose the best model for your specific hardware and performance needs.

Build with the world's best single-accelerator model: Gemma 3 delivers state-of-the-art performance for its size, outperforming Llama3-405B, DeepSeek-V3 and o3-mini in preliminary human preference evaluations on LMArena’s leaderboard. This helps you to create engaging user experiences that can fit on a single GPU or TPU host.

Create AI with advanced text and visual reasoning capabilities: Easily build applications that analyze images, text, and short videos, opening up new possibilities for interactive and intelligent applications.

Handle complex tasks with an expanded context window: Gemma 3 offers a 128k-token context window to let your applications process and understand vast amounts of information.

Create AI-driven workflows using function calling: Gemma 3 supports function calling and structured output to help you automate tasks and build agentic experiences.

Permalink: /feed/introducing-gemma-3/

Tags: #gemma #google #ai

lqdev👽03/18/2025

https://cohere.com/blog/command-a

Today, we’re introducing Command A, a new state-of-the-art generative model optimized for demanding enterprises that require fast, secure, and high-quality AI. Command A delivers maximum performance with minimal hardware costs when compared to leading proprietary and open-weights models, such as GPT-4o and DeepSeek-V3. For private deployments, Command A excels on business-critical agentic and multilingual tasks, while being deployable on just two GPUs, compared to other models that typically require as many as 32.

Its 256k context length (2x most leading models) can handle much longer enterprise documents. Other key features include Cohere’s advanced retrieval-augmented generation (RAG) with verifiable citations, agentic tool use, enterprise-grade security, and strong multilingual performance.

The next generation of Cohere models will help power a range of AI applications for customers across industries like finance, healthcare, manufacturing, energy, and the public sector. In particular, they will seamlessly integrate with North, our secure AI agents platform to unlock the full potential of your company data and people with AI agents.

Permalink: /feed/introducing-command-a-cohere/

Tags: #cohere #ai #llm

lqdev👽03/18/2025

https://tools.simonwillison.net/colophon

The tools on tools.simonwillison.net were mostly built using AI-assisted programming.

This page lists the commit messages for each tool, many of which link to the LLM transcript used to produce the code.

The descriptions for each of the tools were generated using Claude 3.7 Sonnet.

This is such a cool use of a website. Using AI, Simon has generated a set of small utilities that are useful to him. If they're useful to others though, he's made them avaiable on his website.

Permalink: /feed/willison-tools-colophon/

Tags: #colophon #indieweb #ai #llm

lqdev👽03/18/2025

https://mistral.ai/fr/news/mistral-small-3-1

Building on Mistral Small 3, this new model comes with improved text performance, multimodal understanding, and an expanded context window of up to 128k tokens. The model outperforms comparable models like Gemma 3 and GPT-4o Mini, while delivering inference speeds of 150 tokens per second.

Lightweight: Mistral Small 3.1 can run on a single RTX 4090 or a Mac with 32GB RAM. This makes it a great fit for on-device use cases.

Permalink: /feed/mistral-small-3-1/

Tags: #mistral #ai #opensource

lqdev👽03/18/2025

https://ericmigi.com/blog/introducing-two-new-pebbleos-watches

Just pre-ordered my Pebble Core Time 2 Duo! Sadly it won't be here until December.

Go to https://store.repebble.com/ to get yours.

Permalink: /feed/preordered-pebbleos-watch/

Tags: #pebbleos #smartwatch #opensource #pebble

lqdev👽03/05/2025

https://biilmann.blog/articles/introducing-ax/

Interesting post from Matt Biilman.

...the largest disruption from the current evolution of AI will come from bringing agency to computers.

Computers will perceive their environment and take actions autonomously in order to achieve goals.

Companies discovered that improving the DX of their products would empower and incentivize developers to extend their product with new capabilities and lead to huge competitive advantages. For developer tool companies, DX became a key competitive differentiator.

We need to start focusing on AX or “agent experience” — the holistic experience AI agents will have as the user of a product or platform.

Too many companies are focusing on adding shallow AI features all over their products or building yet another AI agent. The real breakthrough will be thinking about how your customers’ favorite agents can help them derive more value from your product. This requires thinking deeply about agents as a persona your team is building and developing for.

As AI agents start becoming useful and commonplace, we’re broadly going to see two approaches enabling agents to interact with the software we depend on:

A closed vertical approach, where companies tightly integrate their own agents into their own software. An open approach, where companies focus on making their software accessible to external agents.

Leaning into AX as a strategy, means embracing a vision of an open agent world. This vision aligns with the original ethos of the open web: a place where many diverse competing agents (built by different people or companies) can seamlessly interact with software on behalf of their users. Prioritizing AX makes it as simple as possible for any agent a user prefers, to deliver outcomes on their behalf.

Software designed for AI agents has the potential to deliver exponential value. As an industry we must collectively focus on building an open agent ecosystem and designing thoughtful AX to create a better, more open, and connected digital world.

Permalink: /feed/ax-why-agent-experience-matters/

Tags: #ai #agents #ax

lqdev👽02/26/2025

https://www.inceptionlabs.ai/news

We are announcing the Mercury family of diffusion large language models (dLLMs), a new generation of LLMs that push the frontier of fast, high-quality text generation.

Mercury is up to 10x faster than frontier speed-optimized LLMs. Our models run at over 1000 tokens/sec on NVIDIA H100s, a speed previously possible only using custom chips.

Permalink: /feed/mercury-first-commercial-scale-llm/

Tags: #diffusion #llm #ai

lqdev👽02/25/2025

https://www.inevitable.live/algorithm/first-post

This is just a place for me to share. I been wanting a lil blog for years. Somewhere to post random shit I fuck with where the audience is way smaller than it is on the social media platforms. Finally pulled the trigger, bare with us as we still developing this page and the layout.

This is cool!

I ran into this while reading the post Blogging for smaller audiences and deeper connections by Herbert Lui.

Looking at other posts, it looks like he or at least someone is actively posting.

Permalink: /feed/inevitable-jcole-blog/

Tags: #music #blog #indieweb #creativity #art

lqdev👽02/25/2025

https://nazhamid.com/journal/your-site-is-a-home/

While going through a backlog of personal blogs I haven't read in a few weeks, I'm noticing a trend in posts that express similar sentiments to those in Naz's post.

I'm happy to see more people choosing to have a place that gives them more autonomy and freedom to express themselves in whichever form makes the most sense for them.

Your site is a home.

Eventually, social networks were created...novelty and the promise of interconnectedness by gathering in a common town square to blast out whatever was going on in our lives eventually won out.

But you might have still had your website, your home to return to...the place you built.

Things started to change and instead of going home, you, and everybody else started to live in the town square.

There is a shape to your physical home. Arranged and organized in the ways that make sense for you, the reward is a space that works for you.

A platform or network doesn’t allow for much configuration. The town square isn’t owned by you.

People became absolutely reliant on the same gathering place...the convenience of seeing friends (sometimes) outweighed your other neighbors spouting garbage and hate. You came to rely on this place for everything.

You can still have a home.

I have very little at the town square, because it’s not a public one. It’s a walled-off town square, whose rules and borders change at the whims of those who created it.

Thanks for visiting my home. I’m glad you dropped by.

I’d love to see yours sometime.

Permalink: /feed/your-site-is-a-home-hamid/

Tags: #personalweb #indieweb #socialmedia

lqdev👽02/25/2025

https://maho.dev/2025/02/how-to-multistream-using-azure-azure-vm-monaserver-2-and-ffmpeg-with-obs-studio/

In this guide, we will set up a multistreaming workflow using Azure, an Azure VM, MonaServer 2, and FFmpeg to broadcast to multiple platforms like LinkedIn Live and YouTube Live. We’ll use OBS Studio as the main streaming software. Let’s get started.

Love this detailed writeup from Maho.

When I originally experimented with something similar using my Owncast instance, I was running ffmpeg locally but I like his solution better.

Permalink: /feed/multistream-azure-ffmpeg-obs-studio-maho/

Tags: #livestream #azure #obs

lqdev👽02/25/2025

https://unplatform.fromthesuperhighway.com/

Unplatform is an interactive guidebook, online library, and recommendations database intended to help you escape social media and join the indie web.

Love this guide.

Permalink: /feed/unplatform-indieweb/

Tags: #indieweb #socialmedia #personalweb

lqdev👽02/24/2025

https://arxiv.org/abs/2501.19393

Test-time scaling is a promising new approach to language modeling that uses extra test-time compute to improve performance. Recently, OpenAI's o1 model showed this capability but did not publicly share its methodology, leading to many replication efforts. We seek the simplest approach to achieve test-time scaling and strong reasoning performance. First, we curate a small dataset s1K of 1,000 questions paired with reasoning traces relying on three criteria we validate through ablations: difficulty, diversity, and quality. Second, we develop budget forcing to control test-time compute by forcefully terminating the model's thinking process or lengthening it by appending "Wait" multiple times to the model's generation when it tries to end. This can lead the model to double-check its answer, often fixing incorrect reasoning steps. After supervised finetuning the Qwen2.5-32B-Instruct language model on s1K and equipping it with budget forcing, our model s1-32B exceeds o1-preview on competition math questions by up to 27% (MATH and AIME24). Further, scaling s1-32B with budget forcing allows extrapolating beyond its performance without test-time intervention: from 50% to 57% on AIME24. Our model, data, and code are open-source at this https URL

Permalink: /feed/s1-simple-test-time-scaling/

Tags: #ai #research #s1

lqdev👽02/24/2025

https://www.nomic.ai/blog/posts/nomic-embed-text-v2

Today we're excited to announce Nomic Embed Text V2, our next-generation embedding model that brings the Mixture of Experts (MoE) architecture to text embeddings on a new expanded multilingual training dataset.

Personally, I found the part on MoE to be the most interesting about this release.

Rather than a dense model which uses all parameters on an input, the MoE architecture dynamically routes to different "experts" - sparse subsets of parameters at each layer - activating, ideally, only the parameters especially needed to process the input. This approach allows for more efficient use of compute when generating embeddings.

In our experiments, we found that alternating MoE layers with 8 experts and top-2 routing provides the optimal balance between performance and efficiency. This results in 475M total parameters in the model, but only 305M active during training and inference.

Research into embedding model architecture has significant practical implications for working with text embeddings in production:

Lower latency for high-volume applications of embeddings like retrieval

Reduced deployment costs through more efficient parameter usage

More accessibility to embeddings in settings with constrained compute

Permalink: /feed/nomic-embed-text-v2/

Tags: #nomic #ai #embeddings

lqdev👽02/24/2025

https://www.anthropic.com/news/claude-3-7-sonnet

Today, we’re announcing Claude 3.7 Sonnet1, our most intelligent model to date and the first hybrid reasoning model on the market. Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. API users also have fine-grained control over how long the model can think for.
Claude 3.7 Sonnet shows particularly strong improvements in coding and front-end web development. Along with the model, we’re also introducing a command line tool for agentic coding, Claude Code. Claude Code is available as a limited research preview, and enables developers to delegate substantial engineering tasks to Claude directly from their terminal.

Claude 3.7 Sonnet and Claude Code mark an important step towards AI systems that can truly augment human capabilities. With their ability to reason deeply, work autonomously, and collaborate effectively, they bring us closer to a future where AI enriches and expands what humans can achieve.

Permalink: /feed/anthropic-claude-3-7-sonnet-code/

Tags: #ai #anthropic #claude

lqdev👽02/19/2025

https://huggingface.co/spaces/nanotron/ultrascale-playbook

Gold.

All the techniques we'll cover in this book tackle one or several of the following three key challenges, which we'll keep bumping into throughout the book:

1. **Memory Usage**: it's a hard limitation - if a training step doesn't fit in memory, training cannot proceed 2. **Compute Efficiency**: we want our hardware to spend most time computing, so we need to reduce time spent on data transfers or waiting for other GPUs to perform work. 3. **Communication overhead**: we want to minimize communication overhead as it keeps GPUs idle. To archieve this we will try to make best use of intra-node (fast) and inter-node (slower) bandwidths as well as overlap communication with compute as much as possible.
In many places we'll see that we can trade one of these (computation, communication, memory) for another (e.g. recomputation or Tensor Parallelism). Finding the right balance is key to scaling training.

Ultra-Scale Playbook Cheatsheet

Permalink: /feed/ultrascale-playbook-training-llms-gpu-clusters/

Tags: #ai #llm #training

lqdev👽02/17/2025

https://www.youtube.com/watch?v=-7e6g11BJc0

I didn't watch the Super Bowl this year so I didn't get a chance to watch the ads until a few days later.

From the ones I saw, I think Google's was my favorite.

Apart from the touching story, what stood out to me the most was its focus on humanity, with technology playing a supporting role rather than being the main character.

Permalink: /feed/google-dream-job-2025-sb-ad/

Tags: #google #ad #superbowl #ai

lqdev👽02/15/2025

https://www.youtube.com/watch?v=VOEeNffChSg

Great session on Tensors in .NET by Tanner

Tensors are the foundational data structure powering man AI workloads today.

Introduced in .NET 9, Tensors builds on top of earlier work like TensorPrimitives, hardware intrinsics, and generic math.

You can check out the recording below to learn more.

Permalink: /feed/dotnet-language-runtime-standup-tensors/

Tags: #ai #tensor #dotnet #runtime

lqdev👽02/15/2025

https://devblogs.microsoft.com/dotnet/announcing-generative-ai-for-beginners-dotnet/

If you're a .NET developer looking to get started with Generative AI, there's a new course for beginners that just launched.

Through a series of self-paced lessons, you'll learn the foundations to help you start using AI to augment the capabilities of your .NET applications.

Check out the course and give us feedback!

Permalink: /feed/generative-ai-for-beginners-dotnet-course/

Tags: #ai #dotnet #course #generativeai

lqdev👽02/15/2025

https://apps.microsoft.com/detail/9n9pn1mm3bd5

So excited to see the AI Dev Gallery is now in the Microsoft Store.

Through various interactive samples, developers get to see how they might use AI for different tasks like text classification, object detection, and many others.

Best of all, the models are all running locally and you get to see and export the C# code powering the samples to Visual Studio so you can continue tinkering on your own and integrate into your own applications.

A few months ago, the team came on the .NET AI Community Standup to showcase the app. Since then, it's only kept improving and introducing new scenarios.

You can check out the recording from that stream here.

Permalink: /feed/ai-dev-gallery-windows-preview-ms-store/

Tags: #ai #windows #dotnet #microsoft #onnx

lqdev👽02/10/2025

https://huggingface.co/learn/agents-course/unit0/introduction

This free course will take you on a journey, from beginner to expert, in understanding, using and building AI agents.

At the end of this course you’ll understand how Agents work and how to build your own Agents using the latest libraries and tools.

Permalink: /feed/huggingface-ai-agents-course/

Tags: #huggingface #ai #agents #course

lqdev👽02/10/2025

https://andysblog.uk/why-blog-if-nobody-reads-it/

There is a hidden value in blogging... You do it not for the applause but because it needs doing.

You write because you think, because you observe, because you need to put it somewhere.

If someone reads it? Bonus. If not? The work still got done.

💯💯💯

I agree with many of the points in Andy's post.

I don't have analytics on my website, so I couldn't tell you who or how many people are visiting it.

And that's okay.

When someone links to content they found on my website and tags me or sends me an e-mail letting me know something on my website is broken is when I know there's at least one person.

Many of the content on here serves as a reminder of how I've solved problems for myself. The microposts and responses are observations that capture a moment in time. If the messages reach someone on the other end and either solve their problem or resonate with them, I'm happy that happens, but I see that as a bonus.

Permalink: /feed/why-blog-nobody-reads-andy/

Tags: #blogging #expression #creativity #writing

lqdev👽02/06/2025

https://ericmigi.com/blog/why-were-bringing-pebble-back

TL;DR We’re making a new Pebble-style smartwatch.

This is amazing news! By the time I decided to get a Pebble, they'd already been sold to Google.

I agree with Eric's points here:

I've tried every single smart watch out there, but none do it for me. No one makes a smartwatch with the core set of features I want:

Always-on e-paper screen (it’s reflective rather than emissive. Sunlight readable. Glanceable. Not distracting to others like a bright wrist)

Long battery life (one less thing to charge. It’s annoying to need extra cables when traveling) Simple and beautiful user experience around a core set of features I use regularly (telling time, notifications, music control, alarms, weather, calendar, sleep/step tracking)

Buttons! (to play/pause/skip music on my phone without looking at the screen)

Hackable (apparently you can’t even write your own watchfaces for Apple Watch? That is wild. There were >16k watchfaces on the Pebble appstore!)

Garmin is the closest I've come to it at least on the battery life and buttons front. Pebble was one-of-a-kind.

I already signed up to stay up to date with the project and plan on being a day one customer. You can too at repebble.com.

P.S. While not surprised, I didn't know Eric was behind another amazing project I'm a fan of, Beeper.

Permalink: /feed/bringing-pebble-back/

Tags: #eink #pebble #smartwatch #google #opensource

lqdev👽02/04/2025

https://xuanwo.io/links/2025/01/link-blog/

Great to see more people sharing knowledge and linking to others from their own websites / platforms.

I decided to follow simon's approach to creating a link blog, where I can share interesting links I find on the internet along with my own comments and thoughts about them.

...this is an excellent way to share knowledge while also keeping a personal record. It’s much better than simply saving something to Readwise to read later and leaving a few highlights or dull comments like "interesting."

Permalink: /feed/build-a-link-blog-xuanwo/

Tags: #blogging #indieweb #linkblog #personalweb #openweb

lqdev👽02/03/2025

https://letsnotdothat.com/

😂😂😂

This website is amazing.

We already finalized our OKRs for H1 but if you throw it in the backlog maybe we can get it prioritized in H2 planning

This could be part of a larger initiative, but let’s hold off for now

We should validate that with users first

Permalink: /feed/master-art-of-product-manager/

Tags: #fun #business #productmanager

lqdev👽02/03/2025

https://newsletter.languagemodels.co/p/the-illustrated-deepseek-r1

Just like most existing LLMs, DeepSeek-R1 generates one token at a time, except it excels at solving math and reasoning problems because it is able to spend more time processing a problem through the process of generating thinking tokens that explain its chain of thought.

DeepSeek R1 Training Recipe — Source: *newsletter.languagemodels.co*

Permalink: /feed/illustrated-deepseek-r1/

Tags: #deepseek #ai #visualization #generativeai #genai #llm #learning

lqdev👽02/03/2025

https://openai.com/index/introducing-deep-research/

An agent that uses reasoning to synthesize large amounts of online information and complete multi-step research tasks for you.

Powered by a version of the upcoming OpenAI o3 model that’s optimized for web browsing and data analysis, it leverages reasoning to search, interpret, and analyze massive amounts of text, images, and PDFs on the internet, pivoting as needed in reaction to information it encounters.

How it works

Deep research was trained using end-to-end reinforcement learning on hard browsing and reasoning tasks across a range of domains. Through that training, it learned to plan and execute a multi-step trajectory to find the data it needs, backtracking and reacting to real-time information where necessary. The model is also able to browse over user uploaded files, plot and iterate on graphs using the python tool, embed both generated graphs and images from websites in its responses, and cite specific sentences or passages from its sources. As a result of this training, it reaches new highs on a number of public evaluations focused on real-world problems.

Permalink: /feed/openai-deep-research/

Tags: #ai #openai #agent #research

lqdev👽02/02/2025

https://www.maho.dev/2025/01/a-guide-to-implementing-activitypub-in-a-static-site-or-any-website-part-8/

Part 8 of "A guide to implement ActivityPub in a static site (or any website)" by maho.dev recently dropped.

This part goes over how to add replies and comments to your static site.

Permalink: /feed/guide-implementing-activitypub-static-site-pt-8/

Tags: #fediverse #indieweb #activitypub #personalweb #internet #opensource

lqdev👽01/28/2025

https://www.wisn.com/article/bucees-plans-first-wisconsin-travel-center-in-oak-creek/63527062

Buc-ee's, the popular Texas-based travel center chain, is set to open its first Wisconsin location in Oak Creek, city officials announced.

Time to get my banana pudding fix.

Permalink: /feed/bucees-wisconsin/

Tags: #bucees #wisconsin #travel

lqdev👽01/20/2025

https://manuelmoreale.com/short-long-form

This is something I've been dealing with on my site. I separate my long-form from short-form posts. However, I don't have a hard limit that clearly defines which is which. An initial reason for the separation was, I wanted to have a feed that was easy to scroll through. That was harder to do if I was rendering an entire long-form post as part of the feed. In the end though, most of my posts are all markdown files and their main distinction is how they are rendered.

As I work on my website redesign, I want to change how I think about posts. I want to separate the post content from the rendering. A post is a post regardless of content and length. Regardless of whether it's a response, image, video, note, or anything in between, I don't want to constrain the content of my post. I can choose to render posts differently depending on whether it's being viewed in a feed or individual post page. The content driving those different views though shouldn't change.

Permalink: /feed/short-long-form-moreale/

Tags: #instagram #posts #attention

lqdev👽01/20/2025

https://lepisma.xyz/2025/01/17/emacs-on-device-ml/index.html

This is a cool walk-through of how to add semantic search to Emacs using local ML models.

With ONNX.el you can run any ML model inside Emacs, including ones for images, audios, etc. In case you are running embedding models, combine this with sem.el to perform semantic searches. Depending on your input modality, you might have to figure out preprocessing. For text, tokenizers.el should cover all of what you need in modern NLP for preprocessing, though you will have to do something on your own for anything non-text or multimodal.

By default sem runs a local all-MiniLM-L6-v2 model for text embeddings. This model is small and runs fast on CPU. Additionally, we load an O2 optimized variant which helps further. The vectors are stored in a lancedb database on file system which runs without a separate process. I was originally writing the vector db core myself to learn more of Rust, but then stopped doing that since lancedb gives all that I needed, out of the box.

Permalink: /feed/semantic-search-on-device-ml-emacs/

Tags: #onnx #emacs #ai #ml

lqdev👽01/20/2025

https://www.mollywhite.net/micro/entry/202501190959

...you needed to move anything you care about online to a space you control. Digital sovereignty is more important than ever.

💯💯💯💯💯

Best time was yesterday. Next best time is now.

Permalink: /feed/digital-sovereignty-white/

Tags: #personalweb #digitalsoverignty #indieweb

lqdev👽01/18/2025

https://www.figma.com/blog/making-space-for-a-handmade-web/

Love this article.

Before the web was more commercialized, it was personal...A website was a way to represent yourself and connect to others, a space to tend and call your very own.

The imperfect, handmade character of websites has been sanded away in favor of efficiency, and social media platforms have risen in the stead of amateur sites. Many of us conform to these containers and retreat from expressing our authentic selves publicly...

If the idea of a more handcrafted internet resonates with you, the best way to be part of the movement is simply to make your own website. This may seem intimidating if we expect webpages to be a holistic reflection of ourselves, like a resume, portfolio, or blog. We limit the web as a medium when we expect that a website must fulfill a host of needs, or serve any function at all. A website doesn’t need to be anything but your own.

A more personal web becomes possible once we turn away from our preconceived notions of what a website must be. Let’s use our tools to continuously push at the boundaries of the web. Doing so will create a more creative, expansive environment that allows us to reclaim our agency and reinvigorate our collective understanding of software as an artisanal craft. Instead of concentrating on a few, monolithic sites, what if we visited many, more personalized sites? We should explore the far reaches of the web and invite our friends to join in: After all, we make the internet, and only we can build the web we want.

Permalink: /feed/making-space-handmade-web/

Tags: #indieweb #personalweb #internet #websites #handmadeweb

lqdev👽01/16/2025

https://apnews.com/article/david-lynch-dies-9107f3ce0b4dd49dbe3dc2ae3c09ed59

David Lynch, the filmmaker celebrated for his uniquely dark and dreamlike vision in such movies as “Blue Velvet” and “Mulholland Drive” and the TV series “Twin Peaks,” has died just days before his 79th birthday.

I didn't understand all of his work, but I appreciate his originality and overall aesthetic.

Permalink: /feed/rip-david-lynch/

Tags: #davidlynch #twinpeaks

lqdev👽01/15/2025

https://arxiv.org/abs/2403.17392

I'm going to tell future generations, this is how we built multi-agent systems back in my day.

Cyborg insects refer to hybrid robots that integrate living insects with miniature electronic controllers to enable robotic-like programmable control. These creatures exhibit advantages over conventional robots in adaption to complex terrain and sustained energy efficiency. Nevertheless, there is a lack of literature on the control of multi-cyborg systems. This research gap is due to the difficulty in coordinating the movements of a cyborg system under the presence of insects' inherent individual variability in their reactions to control input. Regarding this issue, we propose a swarm navigation algorithm and verify it under experiments. This research advances swarm robotics by integrating biological organisms with control theory to develop intelligent autonomous systems for real-world applications.

Permalink: /feed/swarm-cyborg-insects-agents/

Tags: #robotics #ai #insects #agents

lqdev👽01/15/2025

https://news.itsfoss.com/ai-subtitles-vlc/

Shown off at CES 2025 is VLC's upcoming automatic subtitling feature, which is local in nature, running entirely on a user's machine without needing to connect to some distant server.

According to Jean-Baptiste Kempf, the creator and lead developer of VLC, this new feature is powered by open source AI models, which carry out subtitle generation tasks in two stages.

One aspect is automatic generation of subtitles from the video, and the other is translation of those subtitles into over 100 languages.

Unfortunately, there's no word on when this feature will be arriving on VLC.

Permalink: /feed/ai-subtitles-coming-vlc/

Tags: #vlc #ai #opensource #local

lqdev👽01/14/2025

https://josebriones.substack.com/p/the-future-of-calm-technology-with

Interesting interview and introduction to Calm Technology and the Calm Tech Institute.

Permalink: /feed/calm-tech-interview/

Tags: #digitalminimalism #technology

lqdev👽01/13/2025

https://ellanew.com/ptpl/139-2025-01-13-be-the-landlord-of-your-notes

When you choose to keep your notes in locally stored plain text files, you’re choosing to be your own landlord. Keeping notes in a proprietary app is the same as renting: the land you’re building on belongs to someone else, with no guarantee of continuing access.

This is so true for many other areas of our digital lives as well.

Permalink: /feed/landlord-of-your-notes/

Tags: #localfirst #notetaking #pkm

lqdev👽01/08/2025

https://nesslabs.com/a-year-of-curiosity

When we encounter something new and interesting, our brains release dopamine – the same neurotransmitter associated with reward and pleasure. This creates a simple pattern: the more we learn, the more we want to learn.

This drive to explore and understand isn’t just nice to have – it’s essential to who we are as humans.

There’s no single “right” way to be curious. Learning practical skills like coding or cooking, diving into topics like history or science, joining groups of people who share your interests… These are all great ways to inject more curiosity into your life.

Designing a year of curiosity

Monthly: Design one tiny experiment at the beginning of each month This could be as simple as exploring a topic you know nothing about, trying a new hobby, or doing something that pushes you out of your comfort zone...

Weekly: Every week, take 10 to 15 minutes to conduct a weekly review. What went well this week? What didn’t go as planned? What will you focus on next week? This could mean doubling down on what worked or tweaking something that didn’t...

Daily: Create at least one moment of curiosity in your day, no matter how small. Experiment with a new recipe or tool, have one meaningful conversation, try a journaling prompt, or take a different route to work. Even one minute of curiosity a day can add up to a much richer life.

Permalink: /feed/year-of-curiosity/

Tags: #innovation #exploration #curiosity #growth

lqdev👽01/07/2025

https://huyenchip.com/2025/01/07/agents.html

The unprecedented capabilities of foundation models have opened the door to agentic applications that were previously unimaginable. These new capabilities make it finally possible to develop autonomous, intelligent agents to act as our assistants, coworkers, and coaches. They can help us create a website, gather data, plan a trip, do market research, manage a customer account, automate data entry, prepare us for interviews, interview our candidates, negotiate a deal, etc. The possibilities seem endless, and the potential economic value of these agents is enormous.

This section will start with an overview of agents and then continue with two aspects that determine the capabilities of an agent: tools and planning. Agents, with their new modes of operations, have new modes of failure. This section will end with a discussion on how to evaluate agents to catch these failures.

Permalink: /feed/agents-huyen/

Tags: #ai #agents

lqdev👽01/06/2025

https://www.kaggle.com/whitepaper-agents

Humans are fantastic at messy pattern recognition tasks. However, they often rely on tools - like books, Google Search, or a calculator - to supplement their prior knowledge before arriving at a conclusion. Just like humans, Generative AI models can be trained to use tools to access real-time information or suggest a real-world action. For example, a model can leverage a database retrieval tool to access specific information, like a customer's purchase history, so it can generate tailored shopping recommendations. Alternatively, based on a user's query, a model can make various API calls to send an email response to a colleague or complete a financial transaction on your behalf. To do so, the model must not only have access to a set of external tools, it needs the ability to plan and execute any task in a self- directed fashion. This combination of reasoning, logic, and access to external information that are all connected to a Generative AI model invokes the concept of an agent, or a program that extends beyond the standalone capabilities of a Generative AI model. This whitepaper dives into all these and associated aspects in more detail.

Permalink: /feed/kaggle-agents-whitepaper/

Tags: #agents #ai #whitepaper

lqdev👽01/03/2025

https://blog.archive.org/2025/01/01/welcome-to-the-public-domain-in-2025/

On January 1, 2025, we celebrate published works from 1929 and published sound recordings from 1924 entering the public domain! The passage of these works into the public domain celebrates our shared cultural heritage. The ability to breathe new life into long forgotten works, remix the most popular and enduring works of the time, and to better circulate the oddities we find in thrift stores, attics, and on random pockets of the internet are now freely available for us all.

While not at the same blockbuster level as 2024 with Steamboat Willie’s passage into the public domain, works from 1929 still inhabit strong cultural significance today. The works of 1929 continue to capture the Lost Generation’s voice, the rise of sound film, and the emerging modern moment of the 1920s.

Permalink: /feed/internet-archive-public-domain-2025/

Tags: #internetarchive #publicdomain

lqdev👽12/29/2024

https://blog.langchain.dev/langchain-state-of-ai-2024/

In 2024, developers leaned into complexity with multi-step agents, sharpened efficiency by doing more with fewer LLM calls, and added quality checks to their apps using methods of feedback and evaluation.

Bar Chart Showing Top 10 LLMs Used 2024 — Source: *LangChain Blog*

Graphic showing Top 10 Retrievers / Vector Stores 2024 — Source: *LangChain Blog*

Graphic showing Top LLM Evaluation Metrics 2024 — Source: *LangChain Blog*

Permalink: /feed/langchain-state-of-ai-2024/

Tags: #ai #2024 #report #langchain #llm

lqdev👽12/18/2024

https://github.com/microsoft/markitdown

MarkItDown is a utility for converting various files to Markdown (e.g., for indexing, text analysis, etc). It supports:

PDF

PowerPoint

Word

Excel

Images (EXIF metadata and OCR)

Audio (EXIF metadata and speech transcription)

HTML

Text-based formats (CSV, JSON, XML)

ZIP files (iterates over contents)

Permalink: /feed/microsoft-markitdown/

Tags: #markdown #etl #data #python #ai #microsoft

lqdev👽12/17/2024

https://github.com/muni-town/agentic-fediverse

The "Agentic Fediverse" is the idea of a new kind of network federation, a complementary iteration on the concept of federation currently established with ActivityPub. It's an experiment and something that we in Muni Town are still trying to define more concretely.

Tenets

We are still working on defining the core tenets of the agentic fediverse, but here is what we have so far.

An agentic fediverse is...

A fediverse of agents; agent-centric, as opposed to server-centric.

Local-first; solve local problems for local people.

Accessible by default; inaccessibility is a bug.

Systematically consensual; architected on the basis of informed consent.

Permalink: /feed/agentic-fediverse/

Tags: #fediverse #munitown #agents #agentic #activitypub #protocols #socialweb

lqdev👽12/17/2024

https://www.anew.social/hello-social-web/

We're A New Social, a new non-profit organization focused on building cross-protocol tools and services for the open social web.

Our mission is to liberate people's networks from their platforms, enabling The Last Network Effect and leveling the playing field across the open social web.

The first project we'll take on to accomplish this mission is Bridgy Fed, a service that enables users of ActivityPub-based platforms like Mastodon, ATProto-based platforms like Bluesky, and websites to interact and engage across ecosystems.

Permalink: /feed/hello-social-web/

Tags: #anewsocial #fediverse #socialmedia #protocols #atproto #activitypub #socialweb #personalweb

lqdev👽12/17/2024

https://engineering.tumblr.com/post/722102563011493888/streambuilder-our-open-source-framework-for

Interesting post. Whether building your feeds or plugging into existing platforms like Bluesky, this framework could serve as a good starting point.

Today, we’re abnormally jazzed to announce that we’re open-sourcing the custom framework we built to power your dashboard on Tumblr. We call it StreamBuilder, and we’ve been using it for many years.

StreamBuilder has a lot going on. The primary architecture centers around “streams” of content: whether posts from a blog, a list of blogs you’re following, posts using a specific tag, or posts relating to a search. These are separate kinds of streams, which can be mixed together, filtered based on certain criteria, ranked for relevancy or engagement likelihood, and more.

So, what’s included in the box?

The full framework library of code that we use today, on Tumblr, to power almost every feed of content you see on the platform.

A YAML syntax for composing streams of content, and how to filter, inject, and rank them.

Abstractions for programmatically composing, filtering, ranking, injecting, and debugging streams.

Abstractions for composing streams together—such as with carousels, for streams-within-streams.

An abstraction for cursor-based pagination for complex stream templates.

Unit tests covering the public interface for the library and most of the underlying code.

GitHub Repo

Permalink: /feed/streambuilder-tumblr-framework-building-feed/

Tags: #tumblr #algorithms #feed #socialmedia

lqdev👽12/17/2024

https://openverse.org/

Openverse is a tool that allows openly licensed and public domain works to be discovered and used by everyone.

Openverse searches across more than 800 million images and audio tracks from open APIs and the Common Crawl dataset. We aggregate works from multiple public repositories, and facilitate reuse through features like one-click attribution.

Openverse is the successor to CC Search which was launched by Creative Commons in 2019

Permalink: /feed/openverse/

Tags: #openverse #opensource #creativecommons

lqdev👽12/17/2024

https://www.youtube.com/watch?v=KLybH5YvIPQ

Cool retrospective and announcements from this WordPress' State of the Word 2024.

Focused mode and templates look interesting.

Permalink: /feed/state-of-the-word-2024-wordpress/

Tags: #wordpress #conference #blogging #website #opensource

lqdev👽12/16/2024

https://www.theverge.com/2024/12/13/24320336/tumblr-communities-reddit-topics-groups

Tumblr is introducing a new Community feature — in-app groups organized by topic or interest.

Communities are similar to subreddits or Facebook groups and had previously been in beta.

Given recent announcements like Tumblr replatforming on top of WordPress and WordPress adding support for ActivityPub, I could see communities becoming not only an alternative to Reddit, but also Lemmy in the Fediverse.

Permalink: /feed/tumblr-adding-reddit-like-communities/

Tags: #tumblr #reddit #communities #automattic #wordpress #activitypub #fediverse #lemmy

lqdev👽12/16/2024

https://jordanmatthiesen.me/programming/build-youtube-chatbot-dotnet/

Great post from jordanmatthiesen.me.

Recently on a trip for a tech conference I created a YouTube chat app using .NET and AI. This is part of my exploration into creating a larger app for chatting about .NET AI development (leveraging docs, presentations, and sample code my team has been working on at Microsoft).

Permalink: /feed/youtube-chat-app-ai-dotnet/

Tags: #dotnet #ai #youtube #chat #microsoftextensionsai #openai

lqdev👽12/09/2024

https://openai.com/index/sora-is-here/

Our video generation model is rolling out at sora.com⁠.

Users can generate videos up to 1080p resolution, up to 20 sec long, and in widescreen, vertical or square aspect ratios. You can bring your own assets to extend, remix, and blend, or generate entirely new content from text.

The version of Sora we are deploying has many limitations. It often generates unrealistic physics and struggles with complex actions over long durations.

We’re introducing our video generation technology now to give society time to explore its possibilities and co-develop norms and safeguards that ensure it’s used responsibly as the field advances.

Permalink: /feed/sora-released/

Tags: #sora #openai #ai

lqdev👽12/05/2024

https://www.fastcompany.com/91235471/how-i-turned-my-blog-into-a-social-media-hub

...my blog has become a central hub for almost everything I write online, including nearly all my social media posts, links to my published work on other sites, and the occasional freestanding blog post. While this approach has a few trade-offs, I feel better about posting in the first place knowing that I’m doing it on my own terms.

There’s also just something deeply satisfying about seeing a running feed of my own writing, on my own website, with my own design and presentation. A social network alone can’t replicate that, but the POSSE approach offers the best of both worlds.

I've enjoyed seeing over the past few weeks, as people seek new online platforms, the personal website is being rediscovered.

Permalink: /feed/how-i-turned-website-social-media-hub-newman/

Tags: #indieweb #social #microblog #blogging #internet #web #socialmedia

lqdev👽11/24/2024

https://www.yardbarker.com/college_football/articles/jimmy_the_gambler_made_all_the_difference_for_penn_state/s1_17286_41290206

😂 😂 😂

I don't think I've ever heard James Franklin called a gambler.

The game against Minnesota was too close for comfort, but a win is a win. In many ways, thanks to Jimmy, The Gambler.

Permalink: /feed/jimmy-gambler-psu/

Tags: #pennstate #ncaaf #collegefootball #psu

lqdev👽11/19/2024

https://maho.dev/2024/02/a-guide-to-implement-activitypub-in-a-static-site-or-any-website/

I've been following along with this amazing series from Maho on implementing ActivityPub on static websites.

As I think about what I want out of my website and the way I engage in the Fediverse, there's a lot of overlap.

Today, what I mainly use my Mastodon instance for is:

Having a Fediverse presence on a self-hosted Mastodon instance.
Cross-posting posts on by website using the POSSE pattern.

Although I have learned a ton from self-hosting my own Mastodon instance, neither of the points listed above require me self-hosting or even having an account on someone else's instance.

Yesterday, I took the first step in linking my Fediverse presence to my domain. That fulfills my first requirement.

I rarely post original content on Mastodon and for consumption, I already subscribe to accounts and tags via RSS. Therefore, the second requirement is one that can naturally occur without having to use a Mastodon instance as an intermediary. I can just post on my website and because my posts can show up in my outbox, there's no need to POSSE. I don't want my presence to be limited to Mastodon though, but since I plan on supporting media, reviews, and other types of posts, I expect my posts to be accessible across other platforms on the Fediverse like Pixelfed, Bookwyrm, and many others.

Since my website and website features are built using .NET, Maho's guide simplifies my implementation. Because I'm not using any of the existing static site generators and rolled my own, that means that there's going to be some effort and customizations required on my end. It would no different from my custom Webmentions implementation though. My hope is that I will save time I often spend maintaining the server as well as money since I no longer need to rent a server to host my instance. At the same time, I get to learn and contribute to building a more open, decentralized, and personal web.

Permalink: /feed/implement-activitypub-static-site-series-maho/

Tags: #acvititypub #fediverse #staticsite #blogging #indieweb #openweb #personalweb #dotnet #internet

lqdev👽11/19/2024

https://www.youtube.com/watch?v=KK9ktzbBDPE

I saw a reference to this paper yesterday and added it to my Read-It-Later list.

Here's a link to the original paper: https://philpapers.org/archive/AYLITA.pdf

Chances are, I wasnt going to get to it until way later with the backlog of other things I want to read later.

Lucky for me, that was the subject of the latest Deep Questions episode. Cal does a nice job breaking down the main arguments of the paper.

The main takeaway for me was, technology is a great tool. However, when it robs you of your rational decision making abilities and pushes you to compulsive behaviors that move you away from the habits and practices that fuel your growth, it's a problem. For example, lets say you want to build a habit of reading or working out. Unfortunately, when you , time, attention, and energy that would've otherwise been spent on those habits is now diverted towards the distraction. That is irrational and goes against your desire to grow, yet making the rational choice is where most of us struggle. Although the paper focuses on our digital lives, I can see other ways similar arguments can be applied.

I've also included the audio version as well: https://www.buzzsprout.com/1121972/episodes/16124495-ep-327-would-kant-use-tiktok

Permalink: /feed/digital-minimalism-moral-duty-deep-questions/

Tags: #digitalminimalism #postcast #deepquestions #philosophy #calnewport

lqdev👽11/16/2024

https://youtu.be/uNlZ50b6wSs

This video showed up in my feed yesterday. LuvstarKei makes some great points for building your own space on the web.

With the most recent Twitter / X exodus, it's great to see folks moving to places built on more open protocols like AT (Bluesky) or ActivityPub (Fediverse).

That said, those platforms are not your own, even in cases where you self-host. If tomorrow Threads, Bluesky, or Mastodon stop hosting your instance or stop development altogether, what happens to the content and your community? In the case of the Fediverse, many projects are being built in the open and there are other providers you could switch to. The switching costs though may not make it worth bothering to export and keep your content as is usually the case. That's not to say with a personal website you might not have similar issues (i.e. switching from WordPress to Ghost). However, HTML is HTML and you can rehost that anywhere on the internet.

Everyone is looking for different things from the social platforms, and that's not to say they don't have their value. However, from an ownership, permanence, and creative freedom perspective, nothing beats having your own website.

Permalink: /feed/why-you-should-make-a-website-luvstarkei/

Tags: #website #indieweb #socialmedia #personalweb

lqdev👽11/15/2024

https://www.youtube.com/playlist?list=PLdo4fOcmZ0oXeSG8BgCVru3zQtw_K4ANY

The recordings are out for .NET Conf 2024 in case you missed any of the sessions.

Here are the recordings from the sessions I participated in.

Also, there's a Premier Bonus playlist which has a ton of amazing bonus content.

Permalink: /feed/dotnet-conf-2024-session-playlist/

Tags: #dotnet #dotnetconf #net9 #dotnet9 #programming #dev

lqdev👽11/13/2024

https://www.youtube.com/watch?v=KvR_QlXaAfU

Love this new "making of" video. I was surprised to learn Thundercat is on the new album. I didn't catch which track it's on, but will be listening for it.

Permalink: /feed/thundercat-chromakopia/

Tags: #tylerthecreator #chromakopia #thundercat #music

lqdev👽11/13/2024

https://www.youtube.com/watch?v=hM4ifrqF_lQ

Day 1 of .NET Conf was a lot of fun. Tons of great sessions. Here's the recording of all the sessions.

If you're interested in checking out my session with Jeremy, Building AI Applications From Scratch, here's a link to it.

Tomorrow, I'll also get a chance to share the stage with Tarek and Tanner to talk AI Fundamentals.

Permalink: /feed/dotnetconf-2024-day-1-success/

Tags: #dotnet #dotnetconf #dotnet9 #net9 #conference #ai #aspire #microsoft

lqdev👽11/10/2024

https://leo.fm/2024/11/04/you-might-notice.html

So cool to see you posting more on your own platform.

Permalink: /feed/leo-fm-moves-to-microblog/

Tags: #indieweb #microblog #twit

lqdev👽11/05/2024

https://www.microsoft.com/en-us/research/blog/introducing-drift-search-combining-global-and-local-search-methods-to-improve-quality-and-efficiency/

Introducing DRIFT Search: Combining global and local search methods to improve quality and efficiency

DRIFT search (Dynamic Reasoning and Inference with Flexible Traversal)...builds upon Microsoft’s GraphRAG technique, combining characteristics of both global and local search to generate detailed responses in a method that balances computational costs with quality outcomes.

DRIFT Search: A step-by-step process

Primer: When a user submits a query, DRIFT compares it to the top K most semantically relevant community reports. This generates an initial answer along with several follow-up questions, which act as a lighter version of global search. To do this, we expand the query using Hypothetical Document Embeddings (HyDE), to increase sensitivity (recall), embed the query, look up the query against all community reports, select the top K and then use the top K to try to answer the query. The aim is to leverage high-level abstractions to guide further exploration.

Follow-Up: With the primer in place, DRIFT executes each follow-up using a local search variant. This yields additional intermediate answers and follow-up questions, creating a loop of refinement that continues until the search engine meets its termination criteria, which is currently configured for two iterations (further research will investigate reward functions to guide terminations). This phase represents a globally informed query refinement. Using global data structures, DRIFT navigates toward specific, relevant information within the knowledge graph even when the initial query diverges from the indexing persona. This follow-up process enables DRIFT to adjust its approach based on emerging information.

Output Hierarchy: The final output is a hierarchy of questions and answers ranked on their relevance to the original query. This hierarchical structure can be customized to fit specific user needs. During benchmark testing, a naive map-reduce approach aggregated all intermediate answers, with each answer weighted equally.

Permalink: /feed/introducing-drift-search/

Tags: #ai #search #rag #llm #microsoft #research #msr #graphrag

lqdev👽11/05/2024

https://www.microsoft.com/en-us/research/blog/research-focus-week-of-october-28-2024/

METAREFLECTION: Learning Instructions for Language Agents using Past Reflections

Language agents are AI systems that can understand, reason and respond in natural language to complete various tasks. While the latest LLMs are capable enough to power reasonably good language agents, the closed-API model makes it hard to improve them when they perform sub-optimally. Recent studies have explored using techniques like self-reflection and prompt optimization to improve performance. Unfortunately, self-reflection can be used only during the agent’s current run, while contemporary prompt optimization techniques are designed and tested to work on simple single-step agents.

In a recent paper: METAREFLECTION: Learning Instructions for Language Agents using Past Reflections, researchers from Microsoft introduce a novel offline reinforcement learning technique that enhances the performance of language agents by augmenting a semantic memory based on experiential learnings from past trials. They demonstrate the efficacy of METAREFLECTION across multiple domains, including complex logical reasoning, biomedical semantic similarity, open world question answering, and vulnerability threat detection, in Infrastructure-as-Code, spanning different agent designs. METAREFLECTION boosts language agents’ performance by 4% to 16.82% over the baseline agent implementations and performs on par with existing state-of-the-art prompt optimization techniques while requiring fewer LLM calls.

Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping

Generative AI applications rely on large, foundation models, particularly LLMs. LLMs often have tens to hundreds of billions of parameters, making them too large for a single graphics processing unit (GPU) to handle in terms of both memory and computation. Because of their size, training these models requires distributing the workload across hundreds or even thousands of GPUs. This can lead to significant communication overhead, a challenge that arises when data needs to be shared between different GPUs.

In a recent paper: Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping, researchers from Microsoft introduce a system designed to enhance the efficiency of LLM training by reducing the time lost to communication between GPUs.

Domino breaks down data dependencies in a single batch of training into smaller, independent pieces. These smaller pieces are processed in parallel, and communication between GPUs happens simultaneously with computation, minimizing delays.

Test results comparing Domino to Megatron-LM show that Domino speeds up the training process by up to 1.3x on Nvidia DGX-H100 GPUs.

OmniParser for pure vision-based GUI agent

Large vision-language models (VLMs) such as GPT-4V and GPT-4o show promise in driving intelligent agent systems that operate within user interfaces (UI). However, VLMs’ full potential remains underexplored in real-world applications, particularly when it comes to acting as general agents across diverse operating systems and applications with only vision input. One limiting factor is the absence of a robust technique for screen parsing which is capable of 1) reliably identifying interactable icons within the user interface, and 2) understanding the semantics of various elements in a screenshot and accurately associating the intended action with the corresponding region on the screen.

In a recent article: OmniParser for pure vision-based GUI agent, researchers from Microsoft present a compact screen parsing module that can convert UI screenshots into structured elements. OmniParser can be used with a variety of models to create agents capable of taking actions on UIs. When used with GPT-4V, OmniParser significantly improves the agent capability to generate precisely grounded actions for interface regions.

OmniParser with GPT-4V agent achieved the best performance on the recently released  WindowsAgentArena (opens in new tab) benchmark.

Permalink: /feed/microsoft-research-focus-oct-28-2024/

Tags: #ai #microsoft #research #msr #microsoftresearch

lqdev👽11/05/2024

https://arxiv.org/abs/2410.23775

Recent research arXiv:2410.15027 has explored the use of diffusion transformers (DiTs) for task-agnostic image generation by simply concatenating attention tokens across images. However, despite substantial computational resources, the fidelity of the generated images remains suboptimal. In this study, we reevaluate and streamline this framework by hypothesizing that text-to-image DiTs inherently possess in-context generation capabilities, requiring only minimal tuning to activate them. Through diverse task experiments, we qualitatively demonstrate that existing text-to-image DiTs can effectively perform in-context generation without any tuning. Building on this insight, we propose a remarkably simple pipeline to leverage the in-context abilities of DiTs: (1) concatenate images instead of tokens, (2) perform joint captioning of multiple images, and (3) apply task-specific LoRA tuning using small datasets (e.g., 20∼100 samples) instead of full-parameter tuning with large datasets. We name our models In-Context LoRA (IC-LoRA). This approach requires no modifications to the original DiT models, only changes to the training data. Remarkably, our pipeline generates high-fidelity image sets that better adhere to prompts. While task-specific in terms of tuning data, our framework remains task-agnostic in architecture and pipeline, offering a powerful tool for the community and providing valuable insights for further research on product-level task-agnostic generation systems.

Repo: https://github.com/ali-vilab/In-Context-LoRA

Permalink: /feed/in-context-lora/

Tags: #ai #transformers #diffusers #imagegeneration #lora #finetuning

lqdev👽11/05/2024

https://arxiv.org/abs/2410.10630

LLMs are typically trained to answer user questions or follow instructions similarly to how human experts respond. However, in the standard alignment framework they lack the basic ability of explicit thinking before answering. Thinking is important for complex questions that require reasoning and planning -- but can be applied to any task. We propose a training method for equipping existing LLMs with such thinking abilities for general instruction following without use of additional human data. We achieve this by an iterative search and optimization procedure that explores the space of possible thought generations, allowing the model to learn how to think without direct supervision. For each instruction, the thought candidates are scored using a judge model to evaluate their responses only, and then optimized via preference optimization. We show that this procedure leads to superior performance on AlpacaEval and Arena-Hard, and shows gains from thinking on non-reasoning categories such as marketing, health and general knowledge, in addition to more traditional reasoning & problem-solving tasks.

Permalink: /feed/thinking-llms-instruction-following-thought-generation/

Tags: #ai #llm #research #meta #chainofthought #training #finetuning

lqdev👽11/01/2024

https://notes.jim-nielsen.com/#2024-10-30T2141

The “have to” narrative positions us as repositories of instructions made elsewhere, as if we were just programs following the code we’ve been given...

The “choose to” narrative has no illusions about our power and recognizes that we are small players in a bigger, and certainly unjust, world. But we are not machines. And maybe we don’t like the choices available to us, maybe we wish there were others within reach. But once we accept that there are choices to make, we may notice where we have some room to maneuver, some space to play with, some opportunity or avenue or loophole we can exploit.

Refusing your own agency time and again is like disconnecting from a power source—the energy is still there, latent and ready, but the plug dangles inches from the outlet.

...accepting that you have some agency might hurt: you bump up against the systems that constrain your choices; you see more clearly how other people’s choices limit (or expand) your own. But it keeps you connected to that source, that font of energy that is yours and no one else’s. It keeps you hooked up to who you are, and to what you want.

Interesting excerpts from Mandy Brown's "Haves and Choices" article to think about.

Permalink: /feed/haves-choices-brown-nielsen/

Tags: #life #thoughts

lqdev👽11/01/2024

https://github.com/meta-llama/llama-recipes/tree/main/recipes/quickstart/NotebookLlama

This is cool! Already looking at how I can repurpose this sample for a project I have in mind.

Permalink: /feed/notebook-llama-oss-notebook-lm/

Tags: #ai #sample #postcast #texttospeech #programming #sample #tutorial

lqdev👽10/31/2024

https://bloody-disgusting.com/the-further/3837684/terrifier-takeover-art-the-clown-will-ring-the-nasdaq-closing-bell-in-times-square-on-halloween/

With Terrifier 3 now playing in theaters and just passing $50 million worldwide, Art the Clown’s domination of the Halloween season continues with a takeover of the Nasdaq MarketSite in Times Square!

...Art the Clown and star Lauren LaVera will ring the Nasdaq closing bell!

This is so cool! Hopefully that's all he does.

Permalink: /feed/art-clown-nasdaq-bell-halloween/

Tags: #art #terrifier #halloween #movie #horror #newyork #ny #nasdaq

lqdev👽10/30/2024

https://www.youtube.com/watch?v=E-BbWldXfJk&list=OLAK5uy_kQDB4vsLJy0bmjuYsCXKa-eMO-W33na2w

He's back! A day after Tyler too.

I already loved Garmonbozia but Ajhussi is my favorite track.

Although on this album you hear some of the early FlyLo sound, it still feels new and fresh.

Permalink: /feed/spirit-box-flying-lotus/

Tags: #music #flyinglotus #spiritbox #newmusic #album

lqdev👽10/28/2024

https://www.youtube.com/watch?v=hCcwCv3G1FQ&list=OLAK5uy_nt1Nw4wT6I7VlzNknxTiIz3hfED0ttO8Q

I did a first pass earlier today and I generally enjoyed this new album from Tyler.

Like many previous albums, it's highly introspective. I loved the instrumentals as well.

Some first impressions:

Track 10 is not a double track as it's generally been the case of his previous albums.
"Like Him" adds a new contrasting dimension to Tyler's relationship with his father.

I'll need some more time to digest it and read through the lyrics but overall, great album.

Permalink: /feed/chromakopia-tyler-the-creator-released/

Tags: #chromakopia #music #tylerthecreator #newmusic #album #rap #hiphop

lqdev👽10/25/2024

https://apnews.com/article/famous-grizzly-399-killed-grand-teton-wyoming-3e13c4b5234926cbd799dbb3db1ffac8

Sad news from the Tetons this week. Grizzly 399 passed away after being struck by a vehicle.

Grizzly No. 399 died Tuesday night on a highway in Snake River Canyon south of Jackson...

At 28 years old, No. 399 was the oldest known reproducing female grizzly in the Yellowstone ecosystem. Each spring, wildlife enthusiasts eagerly awaited her emergence from her den to see how many cubs she had birthed over the winter — then quickly shared the news online.

The grizzly lived through a time of strife over her species in the region...

Permalink: /feed/rip-399-queen-tetons/

Tags: #399 #grandteton #grizzly #teton

lqdev👽10/22/2024

https://www.youtube.com/watch?v=x8WTPgeVPjg

I think this is Kamasi's first time on Tiny Desk. Great performance. I didn't realize Brandon Coleman was on keys. The encore, "Asha The First", was really good.

Permalink: /feed/kamasi-washington-tiny-desk-concert-npr/

Tags: #kamasiwashington #music #tinydesk #npr #jazz #brainfeeder

lqdev👽10/21/2024

https://open.spotify.com/track/1tnZxHryc2wWtjUZC1LQw5

I'm ready for this new album. It reminds me of Cherry Bomb in some ways. Love the instrumentals.

Permalink: /feed/noid-tyler-the-creator-chromakopia/

Tags: #music #chromakopia #tylerthecreator #noid #newmusic

lqdev👽10/18/2024

https://www.youtube.com/watch?v=gkZ4dLMH-B8

2024 just keeps delivering. Most of the artists I follow have dropped an album this year. Now I'm just waiting on FlyLo. Hopefully he releases his album by end of year.

I don't know what to expect from Chromakopia, but I can't wait after watching this trailer.

Permalink: /feed/chromakopia-tyler-the-creator/

Tags: #music #tylerthecreator #chromakopia #newmusic

lqdev👽10/18/2024

https://pluralistic.net/2024/10/16/keep-it-really-simple-stupid/#read-receipts-are-you-kidding-me-seriously-fuck-that-noise

Yes, you should!

...the conduit through which I experience Molly's excellent work is totally enshittification-proof, and the more I use it, the easier it is for everyone to be less enshittified.

This conduit is anti-lock-in, it works for nearly the whole internet. It is surveillance-resistant, far more accessible than the web or any mobile app interface. It is my secret super-power.

It's RSS.

...But RSS isn't just good for the news! It's good for everything.

Your RSS reader doesn't (necessarily) have an algorithm. By default, you'll get everything as it appears, in reverse-chronological order.

Now, you sign up to so many feeds that you're feeling overwhelmed and you want an algorithm to prioritize posts – or recommend content. Lots of RSS readers have some kind of algorithm and recommendation system...

But you control the algorithm, you control the recommendations. And if a new RSS reader pops up with an algorithm you're dying to try, you can export all the feeds you follow with a single click, which will generate an OPML file. Then, with one click, you can import that OPML file into any other RSS reader in existence and all your feeds will be seamlessly migrated there. You can delete your old account, or you can even use different readers for different purposes.

RSS basically works like social media should work. Using RSS is a chance to visit a utopian future in which the platforms have no power, and all power is vested in publishers, who get to decide what to publish, and in readers, who have total control over what they read and how, without leaking any personal information through the simple act of reading.

And here's the best part: every time you use RSS, you bring that world closer into being!

Unlike those largely useless, performative boycotts of widely used platforms, switching to RSS doesn't require that you give anything up. Not only does switching to RSS let you continue to follow all the newsletters, webpages and social media accounts you're following now, it makes doing so better: more private, more accessible, and less enshittified.

Using RSS to follow the stuff that matters to you will have an immediate, profoundly beneficial impact on your own digital life – and it will appreciably, irreversibly nudge the whole internet towards a better state.

Great article by Cory Doctorow.

I've written at length on my website as to why I enjoy RSS. My use goes back to the days of Google Reader. Like many, once that was disbanded, my consumption pretty much shifted to social media for a few years. After growing dissatisfaction with the limitations and lack of control over my feed, I started my journey back to RSS readers eventually landing on NewsBlur and elfeed as my readers of choice.

On this website, RSS plays an important role in a few areas.

RSS is how I:

So how can you get started?

Learn about feeds.
Check out the many getting started guides.
Browse through blogrolls and feed catalogs
Explore various feed readers to find the one that works best for you.

Permalink: /feed/you-should-be-using-rss-reader-pluralistic/

Tags: #rss #pluralistic #indieweb #protocols #openweb #freeweb #internet #smallweb

lqdev👽10/10/2024

https://media.defcon.org/

Videos, Slides, Presentations, and assets from DEFCON.

Permalink: /feed/defcon-media-server/

Tags: #defcon #media #persentations #videos #slides

lqdev👽10/10/2024

https://www.citationneeded.news/fighting-for-our-web/

I haven't watched the entire talk, but here are some excerpts from the transcript that resonated with me.

For a lot of people, the web feels worse than it used to. Much of our time is spent on a handful of giant social networks that do everything they can to keep people in their apps for as long as possible, even if it has detrimental effects on the people who are using them, or on the social networks themselves.

Websites outside of these handful of social networks are harder and harder to even find, and partially because of that, they have a harder and harder time sustaining themselves.

What is left for those of us who saw the web as an infinite canvas, a tool to reach those who we never could have dreamed of reaching before, a medium that could stretch the limits of what was even possible in an analog world?

...having said all this, would it surprise you to hear that now more than ever, I feel that same burning feeling of excitement around what’s possible?

That’s because what really sucks about the web these days, what has us feeling despair and anger, has everything to do with the industry that has formed around the web, but not the web itself. The web is still just a substrate on which anything can be built. Most importantly, the web is the people who use it, not the companies that have established themselves around it.

And the widespread disillusionment that we’re seeing may actually be a good thing. More people than ever have realized that the utopian dreams of a web that could only bring about positive and wonderful things might have been misguided.

With this knowledge comes power. The power to shape the web that we want to see, while fighting against the one that we don’t.

My experience in fighting this fight has helped to convince me of just how much power we, everyday normal people, have—even when we’re staring down massive platforms and billion-dollar companies.

I ended up doing what I enjoy doing—building something cool, mostly for the sake of building it, and writing down what I saw.

...this simple act of building something interesting and somewhat different had an impact that I could not have anticipated. It turned out that people were starving for it.

And even though I was a nobody, it had a major impact.

The lesson was clear. The tech industry, even with its billions of dollars, is not an indomitable force. Though they can and will ignore what people want from them, they cannot control what those people think. And the tide can turn against them. And it doesn’t always take a job at The New York Times or a huge pre-established platform to become one of the voices speaking up, helping to turn that tide. Sometimes you just have to make something cool. And using the very same technology that enabled crypto guys to sell their scam tokens, or the boosterist journalists to publish obsequious descriptions of companies that were really selling vaporware, or the crypto community to spam social media with promotions for their NFTs, I was able to do something a little different to push back against that very same phenomenon.

What they don’t seem to realize is that in doing so, by reducing the web only to the types of expression that can happen within their cramped boxes—where you can’t write more than 280 characters, or you can’t publish your cool JavaScript-based art project, or you can’t say the things that you want to say without getting de-boosted by the engagement maximization machine, or you can’t read what your friends are posting without the platform interjecting offensive troll posts or soulless AI-generated meme images—they’re creating a thirst for everything outside of those boxes.

A thirst for what the web really is—a medium, a conduit, a tool that is used by readers and artists and creators and explorers, not a gatekeeper that seems to be in an ever more adversarial relationship with everyone who uses it.

The platforms I talk about are deeply entrenched to the point where many people barely use the web outside of them.

But the platforms do not exist without the people, and there are a lot more of us than there are of them.

...it has never been easier to do cool things on the web. What used to be expensive and require a great deal of technical expertise is now becoming more and more accessible, both financially and technologically, by the day. More people than ever have access to the web, both in terms of access to devices, but also in terms of access to internet connectivity and software. What used to be the realm of the nerds and those with the financial wherewithal to purchase expensive home computers, connectivity, and software packages, is now home to people from all walks of life, who bring new ideas, perspectives, and experiences that we often forget were in short supply during what some of us think of as the “good old days” of the internet.

And so, we—all of us—can build the projects we want to see, write the software that the big platforms won’t, and create the services that people need. We can wire everything together, whether the platforms like it or not, to tear down the walls that they put up. We can modify the software, reverse engineer their systems, and wrestle back control of how we experience the web, even through these very platforms. We can share information and help people understand what is happening around them. We can teach others to build the things they want to build, and share with one another the important and meaningful work that is being done.

We can build the web that we want to see, and we can return to that place where the web is a place of wonder, where all of us feel that same burning feeling of excitement as we push the web back towards the wonderful, beautiful, joyful place it ought to be.

Permalink: /feed/fighting-for-our-web-white/

Tags: #internet #indieweb #protocolsnotplatforms #humanweb #smallweb #web30

lqdev👽09/25/2024

https://www.youtube.com/watch?v=o5TmORitlKk&list=OLAK5uy_krxU85sRSpvmZqiwHyVmYNhKBDVgvj-CE

This is such a beautiful album. It seems I'm not the only one that thinks so since it's rated as the #1 album of all time by Rolling Stone. The powerful messages in the lyrics still resonate to this day and the composition is such a perfect match for Marvin Gaye's voice. 50 years later it feels like this album has transcended space and time.

Over the past few days, I've listened to it from beginning to end at least 3 times.

Of course there's the classics like What's Going On and Mercy Mercy Me that many people know. However, the ones that have quickly become my favorites are:

Save The Children

God Is Love

Permalink: /feed/marvin-gaye-whats-going-on/

Tags: #music #marvingaye #whatsgoingon #soul #rnb #album #war #nature #environment #conservation #love

lqdev👽09/15/2024

https://www.youtube.com/watch?v=2nOxSeBhQAA

Simply amazing.

Permalink: /feed/andre-3000-listening-to-the-sun/

Tags: #andre3000 #music #instrumental #experimental #ambient #jazz #creative #instrumental

lqdev👽09/15/2024

https://www.youtube.com/watch?v=3pxrECZYEAA

I love the visuals on this video.

Permalink: /feed/cosmohedron-duncan-hatch/

Tags: #short #animation #psychidelic #duncanhatch #video #youtube

lqdev👽09/05/2024

https://automattic.com/2024/08/27/shipping-tumblr-and-wordpress/

Since Automattic acquired Tumblr we’ve made it more efficient, grown its revenue, and worked to improve the platform. But there’s one part of the plan that we haven’t yet started, which is to run Tumblr on WordPress. I’m pleased to say we’re kicking off that project now!

A little late, but I can't express how much I love this.

We love Tumblr’s streamlined posting experience and its current product direction. We’re not changing that. We’re talking about running Tumblr’s backend on WordPress. You won’t even notice a difference from the outside.

Although it sounds outside the scope of the original plan, it would be amazing if the shared backend as well as the Tumblr publishing front-end made their way into the open-source version of WordPress.

That would open the door to self-hosted personal websites that also make it significantly easier to post content in smaller chunks like a microblog.

Pair that with ActivityPub integrations and you now have a connected web of individuals where the first place they post content to is their own website and for discovery / broader reach can also federate and post to other platforms that also support protocols like ActivityPub.

Can't wait to see how this project develops.

Do you yearn for the days when people owned their corner of the internet and expressed themselves in wild and wacky ways? Do you want to see an internet focused on creativity, art, and ideas instead of debating and dividing? Do you think content and data should be owned by authors and artists, instead of getting locked behind the closed platform of a mega-corporation? Do you want to build an internet where anyone with a story can tell it, and anyone with a product can sell it, regardless of income, gender, politics, language, or where they live in the world?

The answer to all of those is YES.

Permalink: /feed/shipping-wordpress-tumblr/

Tags: #wordpress #tumblr #personalweb #blogging #indieweb #microblog #posse #smallweb #automattic

lqdev👽09/05/2024

https://devblogs.microsoft.com/dotnet/discover-dotnet-at-dev-intersection-las-vegas-2024/

I'm excited to be at DEVintersection again this year where I'll get a chance to meet old and new friends.

I have a few sessions where I'll be talking about some of my favorite things, .NET and AI.

Other sessions I recommend as well:

There's so many more, but in the interest of not listing them all out, check out the schedule.

See you there!

Permalink: /feed/devintersection-vegas-sept-2024/

Tags: #devintersection #conference #dotnet #ai #aspnet #las #lasvegas #desert

lqdev👽08/27/2024

https://www.youtube.com/watch?v=4l_gUwdPrNY

I came across this gem over the weekend. Mikaela Davis on the harp just takes Bird Song and Ripple to the next level.

Permalink: /feed/npr-tiny-desk-bob-weir-wolf-bros-mikaela-davis/

Tags: #music #tinydesk #gratefuldead #bobweir #npr #wolfbros #mikaeladavis #concert #deadhead

lqdev👽08/22/2024

https://engineering.fb.com/2024/08/21/production-engineering/bringing-llama-3-to-life/

At AI Infra @ Scale 2024, Meta engineers discussed every step of how we built and brought Llama 3 to life, from data and training to inference.

Permalink: /feed/meta-bringing-llama-3-to-life/

Tags: #ai #meta #llama3 #opensource #llm

lqdev👽08/22/2024

https://research.google/blog/transformers-in-music-recommendation/

We present a music recommendation ranking system that uses Transformer models to better understand the sequential nature of user actions based on the current user context.

Permalink: /feed/transformers-music-recommendation-google/

Tags: #google #ai #transformers #deeplearning #neuralneworks #recommendation #research #music

lqdev👽08/21/2024

https://open.spotify.com/track/518ruJoGWraifuVpTBKr5a?si=6e7c5ac2ff7c42f3

As above, so below.

Some new Chicano Batman was a pleasant surprise this morning. Even better, an instrumental track.

You can hear some of their older sound in this track, which I really enjoyed.

Permalink: /feed/new-chicano-batman-tanto-arriba-como-abajo/

Tags: #music #chicanobatman #instrumental #newmusic

lqdev👽08/20/2024

https://www.youtube.com/watch?v=CfB0Uqd2aRE

I've had this song on repeat sinceFriday. Also, I didn't know the name of the song is a Twin Peaks reference. Given his recent focus as a composer, I hope this means there's a bigger Twin Peaks project or collaboration in the works.

Permalink: /feed/pain-and-sorrow-on-repeat/

Tags: #music #flyinglotus #twinpeaks #davidlynch #flylo #tv

lqdev👽08/17/2024

https://x.com/DatDaDatty/status/1824251654813704281

Here's something I didn't know I needed in my life. Perfect way to start a Saturday morning.

Permalink: /feed/thundercat-yo-gabba-gabba/

Tags: #music #thundercat #yogabbagabba #fun

lqdev👽08/17/2024

https://twitter.com/organicmapsapp/status/1824727403580596260

Last night Organic Maps was removed from the Play Store without any warnings or additional details due to "not meeting the requirements for the Family Program". Compared to Google Maps and other maps apps rated for 3+ age, there are no ads or in-app purchases in Organic Maps. We have asked for an appeal.

The app is still available on F-Droid. Much better place for getting apps in my opinion.

Permalink: /feed/organic-maps-removed-play-store/

Tags: #organicmaps #google #play #fdroid #opensource #osm #openstreetmaps #maps #app #store

lqdev👽08/16/2024

https://flyinglotus.bandcamp.com/track/garmonbozia

Good way to start a Friday. New drop from FlyLo. I like it.

Permalink: /feed/new-flying-lotus-garmonbozia/

Tags: #music #flyinglotus #garmonbozia #newmusic #flylo #bandcamp

lqdev👽08/15/2024

https://arxiv.org/abs/2408.04619

Transformers have revolutionized machine learning, yet their inner workings remain opaque to many. We present Transformer Explainer, an interactive visualization tool designed for non-experts to learn about Transformers through the GPT-2 model. Our tool helps users understand complex Transformer concepts by integrating a model overview and enabling smooth transitions across abstraction levels of mathematical operations and model structures. It runs a live GPT-2 instance locally in the user's browser, empowering users to experiment with their own input and observe in real-time how the internal components and parameters of the Transformer work together to predict the next tokens. Our tool requires no installation or special hardware, broadening the public's education access to modern generative AI techniques. Our open-sourced tool is available at this https URL. A video demo is available at this https URL.

Tool
Video

Permalink: /feed/transformer-explainer-interactive-learning-text-generative-models/

Tags: #ai #transformer #neuralnetwork #learning #gpt

lqdev👽08/15/2024

https://arxiv.org/abs/2408.04948

Extraction and interpretation of intricate information from unstructured text data arising in financial applications, such as earnings call transcripts, present substantial challenges to large language models (LLMs) even using the current best practices to use Retrieval Augmented Generation (RAG) (referred to as VectorRAG techniques which utilize vector databases for information retrieval) due to challenges such as domain specific terminology and complex formats of the documents. We introduce a novel approach based on a combination, called HybridRAG, of the Knowledge Graphs (KGs) based RAG techniques (called GraphRAG) and VectorRAG techniques to enhance question-answer (Q&A) systems for information extraction from financial documents that is shown to be capable of generating accurate and contextually relevant answers. Using experiments on a set of financial earning call transcripts documents which come in the form of Q&A format, and hence provide a natural set of pairs of ground-truth Q&As, we show that HybridRAG which retrieves context from both vector database and KG outperforms both traditional VectorRAG and GraphRAG individually when evaluated at both the retrieval and generation stages in terms of retrieval accuracy and answer generation. The proposed technique has applications beyond the financial domain

Permalink: /feed/hybridrag-knowledge-graphs-vector-rag-info-extraction/

Tags: #ai #rag #graph #database #knowledgegraph #kb #vector

lqdev👽08/13/2024

https://apnews.com/article/consumer-protection-ftc-fcc-biden-250f6eece6e2665535019128e8fa38da

Given my recent experience unsubscribing from content, there's definitely room for improvement.

Permalink: /feed/us-govt-fcc-easier-unsubscribe/

Tags: #regulation #fcc #subscriptions #uspol #government #us #darkpatterns

lqdev👽08/09/2024

https://openai.com/index/gpt-4o-system-card/

GPT-4o is an autoregressive omni model, which accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. It\u2019s trained end-to-end across text, vision, and audio, meaning that all inputs and outputs are processed by the same neural network.

GPT-4o can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window)2 in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models.

In line with our commitment to building AI safely and consistent with our voluntary commitments to the White House3, we are sharing the GPT-4o System Card, which includes our Preparedness Framework(opens in a new window)5 evaluations. In this System Card, we provide a detailed look at GPT-4o\u2019s capabilities, limitations, and safety evaluations across multiple categories, with a focus on speech-to-speech (voice)A while also evaluating text and image capabilities, and the measures we\u2019ve taken to enhance safety and alignment. We also include third party assessments on general autonomous capabilities, as well as discussion of potential societal impacts of GPT-4o text and vision capabilities.

Permalink: /feed/gpt4o-system-card/

Tags: #ai #openai #gpt-4o #documentation #llm

lqdev👽08/08/2024

https://bsky.social/about/blog/08-06-2024-board

Nice! I remember reading Protocols, Not Platforms many years ago and felt inspired to seek out a better web.

Permalink: /feed/mike-masnick-bluesky-board-of-directors/

Tags: #bluesky #protocols #platforms #socialmedia #decentralization #distributedweb #indieweb #smallweb #humanweb #web

lqdev👽08/08/2024

https://www.theverge.com/2024/8/7/24215490/jackson-hole-travel-tourism-board-instagram-filter-animals-national-park-safety

Chapelle Show Meme Modern Problems Require Modern Solutions

Permalink: /feed/jackson-hole-instagram-filter-wildlife/

Tags: #wildlife #wyoming #tourons #instagram #socialmedia #fun #jacksonhole #teton #nationalpark

lqdev👽08/06/2024

https://www.theverge.com/2024/8/6/24214374/cnet-zeff-davis-acquisition-digital-media-100-million

JUST $100 million. That's pocket change 🙃

Red Ventures paid $500 million for the tech property once valued at $1.8 billion.

Given previous valuations though, I see why the headline was phrased that way.

Permalink: /feed/cnet-bought-for-just-100-million/

Tags: #cnet #tech #acquisition #news

lqdev👽08/06/2024

https://notes.jeddacp.com/a-blog-directory/

A Blog Directory is a cool project from JC (Probably) and Lou Plummer. Check it out!

Permalink: /feed/blogroll-club-blog-directory/

Tags: #blog #rss #indieweb #blogroll #community #web #smallweb #personalweb

lqdev👽08/04/2024

https://marco.org/2024/07/16/overcast-rewrite

I no longer have an iPhone, but whenever anyone asks for a podcast app recommendation, Overcast is the first one I mention. Congrats on 10 years. Here's to many more.

Permalink: /feed/congratulations-overcast-ten-years/

Tags: #podcast #app #overcast #anniversary #milestone #apple #ios #rss

lqdev👽07/31/2024

https://www.theverge.com/2024/7/31/24210565/reddit-microsoft-anthropic-perplexity-pay-ai-search

GIF of Snoopy laughing

I'm all for compensation. Remind me again, how much are the communities that create the content and make Reddit what it is getting out of these licensing deals?

Permalink: /feed/reddit-companies-pay-data-access/

Tags: #ai #reddit #data #licensing #microsoft #anthropic #perplexity

lqdev👽07/31/2024

https://kiko.io/post/My-well-known-feeds-and-thoughts-beyond/

This is a clever use of .well-known and OPML.

Definitely something I want to experiment with and implement on my site even if it's not widely adopted.

Permalink: /feed/well-known-feeds/

Tags: #standards #feeds #rss #web #indieweb #smallweb

lqdev👽07/30/2024

https://arxiv.org/abs/2402.13753

Large context window is a desirable feature in large language models (LLMs). However, due to high fine-tuning costs, scarcity of long texts, and catastrophic values introduced by new token positions, current extended context windows are limited to around 128k tokens. This paper introduces LongRoPE that, for the first time, extends the context window of pre-trained LLMs to an impressive 2048k tokens, with up to only 1k fine-tuning steps at within 256k training lengths, while maintaining performance at the original short context window. This is achieved by three key innovations: (i) we identify and exploit two forms of non-uniformities in positional interpolation through an efficient search, providing a better initialization for fine-tuning and enabling an 8x extension in non-fine-tuning scenarios; (ii) we introduce a progressive extension strategy that first fine-tunes a 256k length LLM and then conducts a second positional interpolation on the fine-tuned extended LLM to achieve a 2048k context window; (iii) we readjust LongRoPE on 8k length to recover the short context window performance. Extensive experiments on LLaMA2 and Mistral across various tasks demonstrate the effectiveness of our method. Models extended via LongRoPE retain the original architecture with minor modifications to the positional embedding, and can reuse most pre-existing optimizations.

GitHub Repo

Permalink: /feed/longrope-extending-llm-context-2m-tokens/

Tags: #ai #llm #longrope #microsoft #research #msr

lqdev👽07/30/2024

https://maharshi.bearblog.dev/tensors-from-scratch-part-2/

I've been enjoying reading through this Tensors series.

If you're interested, here's also the link to part 1.

Permalink: /feed/tensors-from-scratch-blog-series/

Tags: #tensors #math #ai #c #machinelearning #ml #artificialintelligence

lqdev👽07/30/2024

https://ai.meta.com/blog/segment-anything-2/

Takeaways

Following up on the success of the Meta Segment Anything Model (SAM) for images, we're releasing SAM 2, a unified model for real-time promptable object segmentation in images and videos that achieves state-of-the-art performance.

In keeping with our approach to open science, we're sharing the code and model weights with a permissive Apache 2.0 license.

We're also sharing the SA-V dataset, which includes approximately 51,000 real-world videos and more than 600,000 masklets (spatio-temporal masks).

SAM 2 can segment any object in any video or image - even for objects and visual domains it has not seen previously, enabling a diverse range of use cases without custom adaptation.

SAM 2 has many potential real-world applications. For example, the outputs of SAM 2 can be used with a generative video model to create new video effects and unlock new creative applications. SAM 2 could also aid in faster annotation tools for visual data to build better computer vision systems.

Permalink: /feed/meta-ai-segment-anything-model-2/

Tags: #meta #ai #computervision #sam2 #segmentanythingmodel #aimodel #cv

lqdev👽07/28/2024

https://nyxt.atlas.engineer/article/emacs-hacks.org

Saving this guide for future reference as I set up my elfeed / Nyxt capture workflows for the website.

Additional articles that might be helpful.

Org capture in Nyxt: Taking Notes While Browsing

Permalink: /feed/nyxt-emacs-hacks/

Tags: #nyxt #emacs #orgmode #capture #templates #workflow #lisp

lqdev👽07/16/2024

https://simonwillison.net/2024/Jul/15/facebook-is-the-zombie-internet/#atom-everything

In my experience, the supermajority of engagement on viral AI Facebook pages is just as artificially-generated as the content they publish.

Whether it's a child transforming into a water bottle cyborg, a three-armed flight attendant rescuing Tiger Jesus from a muddy plane crash, or a hybrid human-monkey baby being stung to death by giant hornets, all tend to have copy+pasted captions, reactions & comments which usually make no sense in the observed context.

I've noticed similar patterns on YouTube. Sometime the comments include timestamp links which makes them seem more credible, but upon further inspection, it's all bot activity.

Permalink: /feed/facebook-zombie-internet-willison/

Tags: #bots #web #comments #ai #facebook #internet #personalweb

lqdev👽07/14/2024

https://orgmode.org/manual/Capture-templates.html

Testing org-capture template generated response file

Permalink: /feed/test-emacs-capture-response-org/

Tags: #org #emacs #blogging #automation #orgmode #templates #capturetemplates #website #personalweb #capture

lqdev👽07/06/2024

https://maggieappleton.com/home-cooked-software

The emerging golden age of home-cooked software, barefoot developers, and why the local-first community should help build it

This is a talk I presented Local-first Conference in Berlin, May 2024. It's specifically directed at the local-first community, but its relevant to anyone involved in building software.

For the last ~year I've been keeping a close eye on how language models capabilities meaningfully change the speed, ease, and accessibility of software development. The slightly bold theory I put forward in this talk is that we're on a verge of a golden age of local, home-cooked software and a new kind of developer – what I've called the barefoot developer.

Permalink: /feed/home-cooked-software-barefoot-developers-appleton/

Tags: #localfirst #smallweb #sofware #llm #programming #talk #indieweb

lqdev👽06/27/2024

https://huggingface.co/collections/facebook/llm-compiler-667c5b05557fe99a9edd25cb

Large Language Models (LLMs) have demonstrated remarkable capabilities across a variety of software engineering and coding tasks. However, their application in the domain of code and compiler optimization remains underexplored. Training LLMs is resource-intensive, requiring substantial GPU hours and extensive data collection, which can be prohibitive. To address this gap, we introduce Meta Large Language Model Compiler (LLM Compiler), a suite of robust, openly available, pre-trained models specifically designed for code optimization tasks. Built on the foundation of Code Llama, LLM Compiler enhances the understanding of compiler intermediate representations (IRs), assembly language, and optimization techniques. The model has been trained on a vast corpus of 546 billion tokens of LLVM-IR and assembly code and has undergone instruction fine-tuning to interpret compiler behavior. LLM Compiler is released under a bespoke commercial license to allow wide reuse and is available in two sizes: 7 billion and 13 billion parameters. We also present fine-tuned versions of the model, demonstrating its enhanced capabilities in optimizing code size and disassembling from x86_64 and ARM assembly back into LLVM-IR. These achieve 77% of the optimising potential of an autotuning search, and 45% disassembly round trip (14% exact match). This release aims to provide a scalable, cost-effective foundation for further research and development in compiler optimization by both academic researchers and industry practitioners.

Permalink: /feed/meta-llm-compiler/

Tags: #ai #compiler #meta #llm

lqdev👽06/24/2024

https://www.youtube.com/watch?v=OvlfCW3Ec1g

Nice job by Cal debunking misconceptions about AI model capabilities. The segment highlights a few points I cover in my unpublished NoLM - Not Only Language Models blog post. Specifically the fact that Language Models on their own can't do much and need to be connected to data sources and other systems. Complex AI systems will be built with more specialized roles and leverage various components for their planning and execution. In the end though, models will require integration into existing systems. Those integrations need to be done by people, meaning humans are still in control of the AI-assisted system capabilities.

Later in the podcast, Cal takes a question about distributed webs of trust. I agree with Cal's point of using existing open standards like RSS for content consumption. It's the reason you often hear the phrase, "or wherever you get your podcasts". Assuming you have a program that can read an RSS feed, you can follow all types of content. On the topic of discovery, Cal makes the suggestion of using distributed webs of trust. Using domain names and linking as ways of discovering content. While blogrolls were not directly called out, it's one of the benefits a curated set of links provides.

Permalink: /feed/deep-questions-debunking-genai-rss-blogroll/

Tags: #ai #deepquestions #rss #calnewport #podcast #blogroll #opml #social #socialmedia #distributedweb

lqdev👽06/11/2024

https://ntietz.com/blog/blogging-affirmations/

The affirmations

Here are the things I've seen and learned. Each of these will be expanded in its own section.

You have things to write about.

Your perspective matters.

You are good enough.

Posts don't have to be novel.

People will read it.

Mistakes are okay!

It's okay to ask for things.

You can get started quickly.

You can write on a schedule.

Meme of man pointing at himself in the mirror

Permalink: /feed/ntietz-blogging-affirmations/

Tags: #blogging #writing #personalweb #socialweb #indieweb

lqdev👽06/11/2024

https://www.npr.org/2024/06/07/nx-s1-4976071/the-cassette-tape-is-making-a-comeback-thanks-to-a-family-run-company-in-missouri

Despite the odds, cassette tapes are making a comeback. And one family-owned company in Springfield, Missouri is a leader in the revival.

Hopefully this also means that players are also making a comeback because you need something to play them on. Having visited a museum recently full of old radio recordings on cassette tapes, I was sad to find out I couldn't listen to them because the tape player was broken. It's hard enough finding a decent MP3 player, I'm sure it must be just as hard if not harder finding a tape player.

Permalink: /feed/npr-cassette-tapes-making-comeback/

Tags: #cassette #tapes #retro #music #comeback

lqdev👽06/11/2024

https://www.modular.com/blog/deep-dive-into-ownership-in-mojo

In the second part of the ownership series in Mojo, we built on the mental model developed in the first part and provided practical examples to illustrate how ownership works in Mojo. We covered the different kinds of values (BValue, LValue, and RValue) and how they propagate through expressions. We also explained the function argument conventions (borrowed, inout, owned) and demonstrated how these conventions help manage memory safely and efficiently. We concluded with three fundamental rules:

Rule 1: Owned arguments take RValue on the caller side but are LValue on the callee side.

Rule 2: Owned arguments own the type if the transfer operator ^ is used; otherwise, they copy the type if it is Copyable.

Rule 3: Copy operations are optimized to move operations if the type is Copyable and Movable and isn’t used anymore, reducing unnecessary overhead.

Lastly, we emphasized that the main goals of ownership in Mojo are:

Memory Safety: Enforcing exclusive ownership and proper lifetimes to prevent memory errors such as use-after-free and double-free.

Performance Optimization: Converting unnecessary copy operations into move operations to reduce overhead and enhance performance.

Ease of Use: Automating memory management through ownership rules and the transfer operator, simplifying development.

Compile-Time Guarantees: Providing strong compile-time guarantees through type-checking and dataflow lifetime analysis, catching errors early in the development process.

Permalink: /feed/deep-dive-ownership-mojo/

Tags: #mojo #python #c #cpp #c++ #pl #programming #programminglanguage

lqdev👽06/11/2024

https://www.eff.org/deeplinks/2024/05/bigfoot

I’m proud to share the first post in a series from our friends, The Encryptids—the rarely-seen enigmas who inspire campfire lore. But this time, they’re spilling secrets about how they survive this ever-digital world. We begin by checking in with the legendary Bigfoot de la Sasquatch...

People say I'm the most famous of The Encryptids, but sometimes I don't want the spotlight. They all want a piece of me: exes, ad trackers, scammers, even the government. A picture may be worth a thousand words, but my digital profile is worth cash (to skeezy data brokers). I can’t hit a city block without being captured by doorbell cameras, CCTV, license plate readers, and a maze of street-level surveillance. It can make you want to give up on privacy altogether. Honey, no. Why should you have to hole up in some dank, busted forest for freedom and respect? You don’t.

Privacy isn't about hiding. It's about revealing what you want to who you want on your terms. It's your basic right to dignity.

Permalink: /feed/eff-the-encryptids-bigfoot/

Tags: #eff #privacy #digitalrights #campaign

lqdev👽06/11/2024

https://www.anthropic.com/research/mapping-mind-language-model

Today we report a significant advance in understanding the inner workings of AI models. We have identified how millions of concepts are represented inside Claude Sonnet, one of our deployed large language models. This is the first ever detailed look inside a modern, production-grade large language model. This interpretability discovery could, in future, help us make AI models safer.

Previously, we made some progress matching patterns of neuron activations, called features, to human-interpretable concepts. We used a technique called "dictionary learning", borrowed from classical machine learning, which isolates patterns of neuron activations that recur across many different contexts. In turn, any internal state of the model can be represented in terms of a few active features instead of many active neurons. Just as every English word in a dictionary is made by combining letters, and every sentence is made by combining words, every feature in an AI model is made by combining neurons, and every internal state is made by combining features.

In October 2023, we reported success applying dictionary learning to a very small "toy" language model and found coherent features corresponding to concepts like uppercase text, DNA sequences, surnames in citations, nouns in mathematics, or function arguments in Python code.

We used the same scaling law philosophy that predicts the performance of larger models from smaller ones to tune our methods at an affordable scale before launching on Sonnet.

We successfully extracted millions of features from the middle layer of Claude 3.0 Sonnet, (a member of our current, state-of-the-art model family, currently available on claude.ai), providing a rough conceptual map of its internal states halfway through its computation. This is the first ever detailed look inside a modern, production-grade large language model.

Permalink: /feed/mapping-mind-large-language-model-anthropic/

Tags: #ai #interpretability #anthropic #llm

lqdev👽06/11/2024

https://www.anthropic.com/research/claude-character

Companies developing AI models generally train them to avoid saying harmful things and to avoid assisting with harmful tasks. The goal of this is to train models to behave in ways that are "harmless". But when we think of the character of those we find genuinely admirable, we don’t just think of harm avoidance. We think about those who are curious about the world, who strive to tell the truth without being unkind, and who are able to see many sides of an issue without becoming overconfident or overly cautious in their views. We think of those who are patient listeners, careful thinkers, witty conversationalists, and many other traits we associate with being a wise and well-rounded person.

AI models are not, of course, people. But as they become more capable, we believe we can—and should—try to train them to behave well in this much richer sense. Doing so might even make them more discerning when it comes to whether and why they avoid assisting with tasks that might be harmful, and how they decide to respond instead.

Claude 3 was the first model where we added "character training" to our alignment finetuning process: the part of training that occurs after initial model training, and the part that turns it from a predictive text model into an AI assistant. The goal of character training is to make Claude begin to have more nuanced, richer traits like curiosity, open-mindedness, and thoughtfulness.

Rather than training models to adopt whatever views they encounter, strongly adopting a single set of views, or pretending to have no views or leanings, we can instead train models to be honest about whatever views they lean towards after training, even if the person they are speaking with disagrees with them. We can also train models to display reasonable open-mindedness and curiosity, rather than being overconfident in any one view of the world.

In order to steer Claude’s character and personality, we made a list of many character traits we wanted to encourage the model to have...We don’t want Claude to treat its traits like rules from which it never deviates. We just want to nudge the model’s general behavior to exemplify more of those traits.

Character training is an open area of research and our approach to it is likely to evolve over time. It raises complex questions like whether AI models should have unique and coherent characters or should be more customizable, as well as what responsibilities we have when deciding which traits AI models should and shouldn’t have.

Permalink: /feed/claudes-character-anthropic/

Tags: #anthropic #claude #alignment #ai

lqdev👽06/11/2024

https://github.com/fixie-ai/ultravox

Ultravox is a new kind of multimodal LLM that can understand text as well as human speech, without the need for a separate Audio Speech Recognition (ASR) stage. Building on research like AudioLM, SeamlessM4T, Gazelle, SpeechGPT, and others, we've extended Meta's Llama 3 model with a multimodal projector that converts audio directly into the high-dimensional space used by Llama 3. This direct coupling allows Ultravox to respond much more quickly than systems that combine separate ASR and LLM components. In the future this will also allow Ultravox to natively understand the paralinguistic cues of timing and emotion that are omnipresent in human speech.

The current version of Ultravox (v0.1), when invoked with audio content, has a time-to-first-token (TTFT) of approximately 200ms, and a tokens-per-second rate of ~100, all using a Llama 3 8B backbone. While quite fast, we believe there is considerable room for improvement in these numbers. We look forward to working with LLM hosting providers to deliver state-of-the-art performance for Ultravox.

Ultravox currently takes in audio and emits streaming text. As we evolve the model, we'll train it to be able to emit a stream of speech tokens that can then be converted directly into raw audio by an appropriate unit vocoder. We're interested in working with interested parties to build this functionality!

Permalink: /feed/ultravox-multimodal-llm/

Tags: #ai #multimodal #llm

lqdev👽06/11/2024

https://security.apple.com/blog/private-cloud-compute/

We set out to build Private Cloud Compute with a set of core requirements:

Stateless computation on personal user data. ...we want a strong form of stateless data processing where personal data leaves no trace in the PCC system.

Enforceable guarantee. Security and privacy guarantees are strongest when they are entirely technically enforceable, which means it must be possible to constrain and analyze all the components that critically contribute to the guarantees of the overall Private Cloud Compute system.

No privileged runtime access. Private Cloud Compute must not contain privileged interfaces that would enable Apple’s site reliability staff to bypass PCC privacy guarantees, even when working to resolve an outage or other severe incident.

Non-targetability. An attacker should not be able to attempt to compromise personal data that belongs to specific, targeted Private Cloud Compute users without attempting a broad compromise of the entire PCC system.

Verifiable transparency. Security researchers need to be able to verify, with a high degree of confidence, that our privacy and security guarantees for Private Cloud Compute match our public promises.

Permalink: /feed/apple-private-cloud-compute/

Tags: #ai #pcc #privacy #cloud #privatecloudcompute #wwdc

lqdev👽06/11/2024

https://machinelearning.apple.com/research/introducing-apple-foundation-models

Apple Intelligence is comprised of multiple highly-capable generative models that are specialized for our users’ everyday tasks, and can adapt on the fly for their current activity. The foundation models built into Apple Intelligence have been fine-tuned for user experiences such as writing and refining text, prioritizing and summarizing notifications, creating playful images for conversations with family and friends, and taking in-app actions to simplify interactions across apps.

In the following overview, we will detail how two of these models — a ~3 billion parameter on-device language model, and a larger server-based language model available with Private Cloud Compute and running on Apple silicon servers — have been built and adapted to perform specialized tasks efficiently, accurately, and responsibly. These two foundation models are part of a larger family of generative models created by Apple to support users and developers; this includes a coding model to build intelligence into Xcode, as well as a diffusion model to help users express themselves visually, for example, in the Messages app.

Pre-training

Our foundation models are trained on Apple's AXLearn framework, an open-source project we released in 2023. It builds on top of JAX and XLA, and allows us to train the models with high efficiency and scalability on various training hardware and cloud platforms, including TPUs and both cloud and on-premise GPUs. We used a combination of data parallelism, tensor parallelism, sequence parallelism, and Fully Sharded Data Parallel (FSDP) to scale training along multiple dimensions such as data, model, and sequence length.

Post-training

We find that data quality is essential to model success, so we utilize a hybrid data strategy in our training pipeline, incorporating both human-annotated and synthetic data, and conduct thorough data curation and filtering procedures. We have developed two novel algorithms in post-training: (1) a rejection sampling fine-tuning algorithm with teacher committee, and (2) a reinforcement learning from human feedback (RLHF) algorithm with mirror descent policy optimization and a leave-one-out advantage estimator. We find that these two algorithms lead to significant improvement in the model’s instruction-following quality.

Optimization

Both the on-device and server models use grouped-query-attention. We use shared input and output vocab embedding tables to reduce memory requirements and inference cost. These shared embedding tensors are mapped without duplications. The on-device model uses a vocab size of 49K, while the server model uses a vocab size of 100K, which includes additional language and technical tokens.

For on-device inference, we use low-bit palletization, a critical optimization technique that achieves the necessary memory, power, and performance requirements. To maintain model quality, we developed a new framework using LoRA adapters that incorporates a mixed 2-bit and 4-bit configuration strategy — averaging 3.5 bits-per-weight — to achieve the same accuracy as the uncompressed models.

Additionally, we use an interactive model latency and power analysis tool, Talaria, to better guide the bit rate selection for each operation. We also utilize activation quantization and embedding quantization, and have developed an approach to enable efficient Key-Value (KV) cache update on our neural engines.

Model Adaptation

Our foundation models are fine-tuned for users’ everyday activities, and can dynamically specialize themselves on-the-fly for the task at hand...

We represent the values of the adapter parameters using 16 bits, and for the ~3 billion parameter on-device model, the parameters for a rank 16 adapter typically require 10s of megabytes. The adapter models can be dynamically loaded, temporarily cached in memory, and swapped — giving our foundation model the ability to specialize itself on the fly for the task at hand while efficiently managing memory and guaranteeing the operating system's responsiveness.

Performance and Evaluation

We compare our models with both open-source models (Phi-3, Gemma, Mistral, DBRX) and commercial models of comparable size (GPT-3.5-Turbo, GPT-4-Turbo)1. We find that our models are preferred by human graders over most comparable competitor models. On this benchmark, our on-device model, with ~3B parameters, outperforms larger models including Phi-3-mini, Mistral-7B, and Gemma-7B. Our server model compares favorably to DBRX-Instruct, Mixtral-8x22B, and GPT-3.5-Turbo while being highly efficient.

To further evaluate our models, we use the Instruction-Following Eval (IFEval) benchmark to compare their instruction-following capabilities with models of comparable size. The results suggest that both our on-device and server model follow detailed instructions better than the open-source and commercial models of comparable size.

Permalink: /feed/introducing-apple-on-device-server-foundational-models/

Tags: #ai #apple #slm #edge #wwdc

lqdev👽06/11/2024

https://www.apple.com/newsroom/2024/06/introducing-apple-intelligence-for-iphone-ipad-and-mac/

Apple today introduced Apple Intelligence, the personal intelligence system for iPhone, iPad, and Mac that combines the power of generative models with personal context to deliver intelligence that’s incredibly useful and relevant.

Apple Intelligence unlocks new ways for users to enhance their writing and communicate more effectively. With brand-new systemwide Writing Tools built into iOS 18, iPadOS 18, and macOS Sequoia, users can rewrite, proofread, and summarize text nearly everywhere they write...

Apple Intelligence powers exciting image creation capabilities to help users communicate and express themselves in new ways. With Image Playground, users can create fun images in seconds, choosing from three styles: Animation, Illustration, or Sketch...All images are created on device, giving users the freedom to experiment with as many images as they want.

Powered by Apple Intelligence, Siri becomes more deeply integrated into the system experience. With richer language-understanding capabilities, Siri is more natural, more contextually relevant, and more personal, with the ability to simplify and accelerate everyday tasks.

A cornerstone of Apple Intelligence is on-device processing, and many of the models that power it run entirely on device. To run more complex requests that require more processing power, Private Cloud Compute extends the privacy and security of Apple devices into the cloud to unlock even more intelligence.

Apple is integrating ChatGPT access into experiences within iOS 18, iPadOS 18, and macOS Sequoia, allowing users to access its expertise — as well as its image- and document-understanding capabilities — without needing to jump between tools.

Permalink: /feed/introducing-apple-intelligence/

Tags: #ai #apple #appleintelligence

lqdev👽06/11/2024

https://www.youtube.com/watch?v=sBXdyUA6A88

I always look forward to watching these recap videos from the Verge.

Permalink: /feed/verge-wwdc-2024-18-minute-recap-video/

Tags: #apple #wwdc #keynote #wwdc2024 #ai #ios #ipad #iphone #watchos #ipados #macos #visionos

lqdev👽06/11/2024

https://www.youtube.com/watch?v=l8pRSuU81PU

Permalink: /feed/karpathy-reproduce-gpt-2-124m-tutorial/

Tags: #ai #gpt #transformer #tutorial

lqdev👽06/11/2024

https://bair.berkeley.edu/blog/2024/05/29/tiny-agent/

High level conceptual diagram of TinyAgent system

The ability of LLMs to execute commands through plain language (e.g. English) has enabled agentic systems that can complete a user query by orchestrating the right set of tools...recent multi-modal efforts such as the GPT-4o or Gemini-1.5 model, has expanded the realm of possibilities with AI agents. While this is quite exciting, the large model size and computational requirements of these models often requires their inference to be performed on the cloud. This can create several challenges for their widespread adoption. First and foremost, uploading data such as video, audio, or text documents to a third party vendor on the cloud, can result in privacy issues. Second, this requires cloud/Wi-Fi connectivity which is not always possible...latency could also be an issue as uploading large amounts of data to the cloud and waiting for the response could slow down response time, resulting in unacceptable time-to-solution. These challenges could be solved if we deploy the LLM models locally at the edge.

...current LLMs like GPT-4o or Gemini-1.5 are too large for local deployment. One contributing factor is that a lot of the model size ends up memorizing general information about the world into its parametric memory which may not be necessary for a specialized downstream application.

...this leads to an intriguing research question:

Can a smaller language model with significantly less parametric memory emulate such emergent ability of these larger language models?

Achieving this would significantly reduce the computational footprint of agentic systems and thus enable efficient and privacy-preserving edge deployment. Our study demonstrates that this is feasible for small language models through training with specialized, high-quality data that does not require recalling generic world knowledge.

Such a system could particularly be useful for semantic systems where the AI agent’s role is to understand the user query in natural language and, instead of responding with a ChatGPT-type question answer response, orchestrate the right set of tools and APIs to accomplish the user’s command. For example, in a Siri-like application, a user may ask a language model to create a calendar invite with particular attendees. If a predefined script for creating calendar items already exists, the LLM simply needs to learn how to invoke this script with the correct input arguments (such as attendees’ email addresses, event title, and time). This process does not require recalling/memorization of world knowledge from sources like Wikipedia, but rather requires reasoning and learning to call the right functions and to correctly orchestrate them.

Our goal is to develop Small Language Models (SLM) that are capable of complex reasoning that could be deployed securely and privately at the edge. Here we will discuss the research directions that we are pursuing to that end. First, we discuss how we can enable small open-source models to perform accurate function calling, which is a key component of agentic systems. It turns out that off-the-shelf small models have very low function calling capabilities. We discuss how we address this by systematically curating high-quality data for function calling, using a specialized Mac assistant agent as our driving application. We then show that fine-tuning the model on this high quality curated dataset, can enable SLMs to even exceed GPT-4-Turbo’s function calling performance. We then show that this could be further improved and made efficient through a new Tool RAG method. Finally, we show how the final models could be deployed efficiently at the edge with real time responses.

Permalink: /feed/TinyAgent-function-calling-edge/

Tags: #agent #ai #slm #function #edge #local #languagemodel #smalllanguagemodel #research

lqdev👽06/08/2024

https://www.youtube.com/watch?v=uRG6IWYVazM

Great performance.

Permalink: /feed/tierra-whack-npr-tiny-desk-2024/

Tags: #music #tinydesk #tierrawhack #hiphop #rap #livemusic #performance #npr

lqdev👽06/06/2024

https://genai-handbook.github.io/

This document aims to serve as a handbook for learning the key concepts underlying modern artificial intelligence systems. Given the speed of recent development in AI, there really isn’t a good textbook-style source for getting up-to-speed on the latest-and-greatest innovations in LLMs or other generative models, yet there is an abundance of great explainer resources (blog posts, videos, etc.) for these topics scattered across the internet. My goal is to organize the “best” of these resources into a textbook-style presentation, which can serve as a roadmap for filling in the prerequisites towards individual AI-related learning goals. My hope is that this will be a “living document”, to be updated as new innovations and paradigms inevitably emerge, and ideally also a document that can benefit from community input and contribution. This guide is aimed at those with a technical background of some kind, who are interested in diving into AI either out of curiosity or for a potential career. I’ll assume that you have some experience with coding and high-school level math, but otherwise will provide pointers for filling in any other prerequisites.

Permalink: /feed/genai-handbook-learning-resource-brown/

Tags: #generativeai #genai #handbook #learning #resource

lqdev👽06/06/2024

https://omakub.org/

Cool project.

Turn a fresh Ubuntu installation into a fully-configured, beautiful, and modern web development system by running a single command.

Omakub is an opinionated take on what Linux can be at its best.

Omakub includes a curated set of applications and tools that one might discover through hours of watching YouTube, reading blogs, or just stumbling around Linux internet. All so someone coming straight from a platform like Windows or the Mac can immediately start enjoying a ready-made system, without having to do any configuration and curation legwork at all.

Permalink: /feed/omakub-ubuntu-developer-desktop/

Tags: #linux #developer #ubuntu #desktop #37signals

lqdev👽05/30/2024

https://rscottjones.com/my-permadomain/

I'm guilty of owning way too many domains but this short-hand (lqdev.me) and other permadomain (luisquintanilla.me) are the ones I use mostly.

A potential happy medium I've seen is using subdomains (i.e. project.lqdev.me).

Jim Nielsen has great examples of how he's doing that today. Though that's an older post, he also addressed it most recently in the post Domain Sins of My Youth.

That somehow feels better than the URL you get when you use free hosting like GitHub Pages. As a bonus, it's already tied to your identity (if using your personal domain). Also, you save some money.

There's a few ways I practice this today. In addition to the previously mentioned domains, I also own lqdev.tech. Services and projects I host live there.

For example:

Mastodon Instance (toot.lqdev.tech)
Matrix server (matrix.lqdev.tech)
Webmentions service (webmentions.lqdev.tech)

So far it's been working well.

Permalink: /feed/my-permadomain-rscottjones/

Tags: #domains #personalweb #permadomain #indieweb #projects #selfhosting #hosting #self-hosting

lqdev👽05/30/2024

https://www.manton.org/2024/05/29/podcast-hosting-for.html

This is cool! The Micro.blog premium plan has tons of great features as well.

Six years ago, we launched our $10/month plan with podcast hosting. Since then we’ve added several big features to the plan...

Today, I want to bring the podcast feature to more people, so we’re moving it down to the standard $5/month plan.

Permalink: /feed/microblog-podcast-hosting/

Tags: #microblog #podcast #indieweb #hosting #blog #blogging

lqdev👽05/28/2024

https://github.com/karpathy/llm.c/discussions/481

...the TLDR is that we're training a 12-layer GPT-2 (124M), from scratch, on 10B tokens of FineWeb, with max sequence length of 1024 tokens.

The 124M model is the smallest model in the GPT-2 series released by OpenAI in 2019, and is actually quite accessible today, even for the GPU poor. With llm.c, which is quite efficient at up to ~60% model flops utilization, reproducing this model on one 8X A100 80GB SXM node takes ~90 minutes. For example, on Lambda this node goes for ~$14/hr, so the total cost of reproducing this model today is about $20. You can train the model with a single GPU too, it would just take proportionally longer (e.g. ~4-24 hours depending on the GPU).

Permalink: /feed/repro-gpt-2-llm-c-90-min-20-dollars-karpathy/

Tags: #ai #llm #gpt #gpt2 #llmc #c #slm

lqdev👽05/27/2024

https://ma.tt/2024/05/wp21/

21 years since Mike and I did the first release of WordPress, forking Michel’s work on b2/cafélog.

I’ve been thinking a lot about elements that made WordPress successful in its early years that we should keep in mind as we build this year and beyond. Here’s 11 opinions:

Simple things should be easy and intuitive, and complex things possible.

...

Wikis are amazing, and our documentation should be wiki-easy to edit.

...

It’s important that we all do support, go to meetups and events, anything we can to stay close to regular end-users of what we make.

Congrats to WordPress and the team on 21 years. I think the following are some good elements to keep building on.

Permalink: /feed/21-years-of-wordpress/

Tags: #wordpress #blogging #web #anniversary #opensource #openweb

lqdev👽05/26/2024

https://www.theverge.com/2024/5/23/24163225/daylight-dc1-tablet-livepaper

Like many in the comments, I like the idea of this device and what it's aiming to do.

The "LivePaper" display technology seems interesting.

At $729 though, it seems a little steep. I paid a fraction of that for my Onyx Boox Nova Air 2 and I have most of the functionality of this device. As an e-reader, note-taking device, and tablet that can install any Android app, it works perfectly fine for my use cases and I'm very happy with it.

Still, I'm interested in seeing whether this takes off or whether it ends up being another Rabbit R1 or Humane Ai Pin.

Permalink: /feed/daylight-dc-1-tablet-computer/

Tags: #eink #digitalminimalism #daylight #tablet #technology #gadgets

lqdev👽05/25/2024

https://tracydurnell.com/2024/05/17/indieweb-next-stage/

🙋

Great post.

One of the challenges I've found, even when using appliances like those from hosting providers like Linode or tools like YunoHost is connecting them to your domain.

Usually, that's specific to your domain name provider and usually a manual process.

In cases where that's easy, you're often overcharged for the convenience. Compared to "free" social media and publishing websites, it makes using your own domain a less desirable option.

Maybe advocating for a multi-staged approach like IndieWebify.Me could make the journey more approachable. More importantly, making it a journey, rather than a destination could help with meeting folks where they are and guide them closer towards their goals.

In any case, the proposed list of goals seem like a great start towards helping people create their own place on the web.

Permalink: /feed/indieweb-next-steps-durnell/

Tags: #indieweb #community #personalweb

lqdev👽05/25/2024

https://www.theverge.com/2024/5/24/24163865/doge-meme-shiba-inu-kabosu-dead-crypto

The face of one of the defining memes of the 2010s, the doge meme, died on Friday. Kabosu, the shiba inu with the knowing face that launched a million internet jokes, was 18 when she died

Permalink: /feed/rip-kabosu-doge/

Tags: #shiba #doge #meme #dog

lqdev👽05/18/2024

https://www.youtube.com/watch?v=AtR1yVmCCvw

I never knew I needed this in my life.

Permalink: /feed/badu-watts-rebillet/

Tags: #badu #watts #rebillet #music

lqdev👽04/28/2024

https://www.theverge.com/24140675/light-phone-2-one-year-retrospect

GIF image with man signing into microphone with caption don't call it a comeback

Between this article and The Dumbphone Boom is Real, I'm seeing more publications on the subject. Far from a comeback, but still cool to see.

Permalink: /feed/one-year-retrospect-lightphone-2-verge/

Tags: #dumbphone #retro #technology #lightphone #minimalism #digitalminimalism

lqdev👽04/28/2024

https://kamasiwashington.ffm.to/fearlessmovement

2024 just keeps getting better in terms of new music releases.

Kamasi Washington has a new album coming out this week called Fearless Movement.

The most recent single, Dream State, with Andre 3000 is great.

Permalink: /feed/new-kamasi-washington-album-fearless-movement/

Tags: #newmusic #kamasiwashington #jazz #album #music

lqdev👽04/25/2024

https://www.newyorker.com/culture/infinite-scroll/the-dumbphone-boom-is-real

Will Stults spent too much time on his iPhone, doom-scrolling the site formerly known as Twitter and tweeting angrily at Elon Musk as if the billionaire would actually notice. Stults’s partner, Daisy Krigbaum, was addicted to Pinterest and YouTube, bingeing videos on her iPhone before going to sleep. Two years ago, they both tried Apple’s Screen Time restriction tool and found it too easy to disable, so the pair decided to trade out their iPhones for more low-tech devices. They’d heard about so-called dumbphones, which lacked the kinds of bells and whistles—a high-resolution screen, an app store, a video camera—that made smartphones so addictive. But they found the process of acquiring one hard to navigate. “The information on it was kind of disparate and hard to get to. A lot of people who know the most about dumbphones spend the least time online,” Krigbaum said. A certain irony presented itself: figuring out a way to be less online required aggressive online digging.

The growing dumbphone fervor may be motivated, in part, by the discourse around child safety online. Parents are increasingly confronted with evidence that sites like Instagram and TikTok intentionally try to hook their children. Using those sites can increase teens’ anxiety and lower their self-esteem, according to some studies, and smartphones make it so that kids are logged on constantly. Why should this situation be any healthier for adults? After almost two decades with iPhones, the public seems to be experiencing a collective ennui with digital life. So many hours of each day are lived through our portable, glowing screens, but the Internet isn’t even fun anymore. We lack the self-control to wean ourselves off, so we crave devices that actively prevent us from getting sucked into them. That means opting out of the prevailing technology and into what Cal Newport, a contributing writer for The New Yorker, has called a more considered “digital minimalism.”

While dumbphones aren't a cure-all for unhealthy technology habits, as a dumbphone user, I can relate to the frustrations that come from the lack of device availability and support. Even when new devices hit the market, they tend to be targeted towards non-US markets.

Permalink: /feed/dumbphone-boom-real/

Tags: #dumbphone #technology #culture #digitalminimalism #minimalism

lqdev👽04/25/2024

https://www.snowflake.com/blog/arctic-open-efficient-foundation-language-models-snowflake/

Today, the Snowflake AI Research Team is thrilled to introduce Snowflake Arctic, a top-tier enterprise-focused LLM that pushes the frontiers of cost-effective training and openness. Arctic is efficiently intelligent and truly open.

Efficiently Intelligent: Arctic excels at enterprise tasks such as SQL generation, coding and instruction following benchmarks even when compared to open source models trained with significantly higher compute budgets. In fact, it sets a new baseline for cost-effective training to enable Snowflake customers to create high-quality custom models for their enterprise needs at a low cost.

Truly Open: Apache 2.0 license provides ungated access to weights and code. In addition, we are also open sourcing all of our data recipes and research insights.

Snowflake Arctic is available from Hugging Face, NVIDIA API catalog and Replicate today or via your model garden or catalog of choice, including Snowflake Cortex, Amazon Web Services (AWS), Microsoft Azure, Lamini, Perplexity and Together over the coming days.

Permalink: /feed/snowflake-arctic-enterprise-ai-llm/

Tags: #snowflake #ai #llm #enterprise

lqdev👽04/25/2024

https://www.theverge.com/2024/4/24/24139057/pbs-retro-free-roku-channel-fast-streaming

PBS is making child edutainment classics like Zoboomafoo, Mister Rogers’ Neighborhood, and Reading Rainbow available for free on a new ‘PBS Retro’ channel on Roku.

This is cool! Although I'm kind of bummed the stuff I grew up watching is now considered "retro".

Permalink: /feed/pbs-retro-roku-channel/

Tags: #pbs #roku #tv #fast #retro #education

lqdev👽04/25/2024

https://www.noemamag.com/we-need-to-rewild-the-internet/

The internet has become an extractive and fragile monoculture. But we can revitalize it using lessons learned by ecologists.

Our online spaces are not ecosystems, though tech firms love that word. They’re plantations; highly concentrated and controlled environments...

We all know this. We see it each time we reach for our phones. But what most people have missed is how this concentration reaches deep into the internet’s infrastructure — the pipes and protocols, cables and networks, search engines and browsers. These structures determine how we build and use the internet, now and in the future.

They’ve concentrated into a series of near-planetary duopolies.

Two kinds of everything may be enough to fill a fictional ark and repopulate a ruined world, but can’t run an open, global “network of networks” where everyone has the same chance to innovate and compete.

The internet made the tech giants possible. Their services have scaled globally, via its open, interoperable core. But for the past decade, they’ve also worked to enclose the varied, competing and often open-source or collectively provided services the internet is built on into their proprietary domains. Although this improves their operational efficiency, it also ensures that the flourishing conditions of their own emergence aren’t repeated by potential competitors. For tech giants, the long period of open internet evolution is over. Their internet is not an ecosystem. It’s a zoo.

Up close, internet concentration seems too intricate to untangle; from far away, it seems too difficult to deal with. But what if we thought of the internet not as a doomsday “hyperobject,” but as a damaged and struggling ecosystem facing destruction? What if we looked at it not with helpless horror at the eldritch encroachment of its current controllers, but with compassion, constructiveness and hope?

Rewilding “aims to restore healthy ecosystems by creating wild, biodiverse spaces,” according to the International Union for Conservation of Nature. More ambitious and risk-tolerant than traditional conservation, it targets entire ecosystems to make space for complex food webs and the emergence of unexpected interspecies relations. It’s less interested in saving specific endangered species. Individual species are just ecosystem components, and focusing on components loses sight of the whole. Ecosystems flourish through multiple points of contact between their many elements, just like computer networks. And like in computer networks, ecosystem interactions are multifaceted and generative.

Whatever we do, the internet isn’t returning to old-school then-common interfaces like FTP and Gopher, or organizations operating their own mail servers again instead of off-the-shelf solutions like G-Suite. But some of what we need is already here, especially on the web. Look at the resurgence of RSS feeds, email newsletters and blogs, as we discover (yet again) that relying on one app to host global conversations creates a single point of failure and control. New systems are growing, like the Fediverse with its federated islands, or Bluesky with algorithmic choice and composable moderation.

We don’t know what the future holds. Our job is to keep open as much opportunity as we can, trusting that those who come later will use it. Instead of setting purity tests for which kind of internet is most like the original, we can test changes against the values of the original design. Do new standards protect the network’s “generality,” i.e. its ability to support multiple uses, or is functionality limited to optimize efficiency for the biggest tech firms?

...our internet took off because it was designed as a general-purpose network, built to connect anyone.

Our internet was built to be complex and unbiddable, to do things we cannot yet imagine.

Internet infrastructure is a degraded ecosystem, but it’s also a built environment, like a city. Its unpredictability makes it generative, worthwhile and deeply human.

We need to stop thinking of internet infrastructure as too hard to fix. It’s the underlying system we use for nearly everything we do.

Rewilding the internet connects and grows what people are doing across regulation, standards-setting and new ways of organizing and building infrastructure, to tell a shared story of where we want to go. It’s a shared vision with many strategies. The instruments we need to shift away from extractive technological monocultures are at hand or ready to be built.

Permalink: /feed/rewild-the-internet-farrell-berjon/

Tags: #internet #technology #openweb #personalweb #online

lqdev👽04/25/2024

https://calculusmadeeasy.org/

Calculus Made Easy is a book on calculus originally published in 1910 by Silvanus P. Thompson, considered a classic and elegant introduction to the subject.

Project Gutenberg PDF

Permalink: /feed/calculus-made-easy/

Tags: #calculus #book #math

lqdev👽04/25/2024

https://projects.kwon.nyc/internet-is-fun/

I’ve been meaning to write some kind of Important Thinkpiece™ on the glory days of the early internet, but every time I sit down to do it, I find another, better piece that someone else has already written. So for now, here’s a collection of articles that to some degree answer the question “Why have a personal website?” with “Because it’s fun, and the internet used to be fun.”

This is a great catalog of posts about the personal web. Courtesy of Rachel Kwon

Permalink: /feed/internet-used-to-be-fun-kwon/

Tags: #internet #indieweb #personalweb #online #blogging #blogs

lqdev👽04/25/2024

https://growyourown.services/

This is a site encouraging non-technical people and organisations to create their own online services such as websites, social networks, personal clouds, instant messaging etc.

Permalink: /feed/grow-your-own-services/

Tags: #selfhosting #indieweb #openweb #internet #community #cloud #messaging #website #socialnetworks #websites #online

lqdev👽04/25/2024

https://www.microsoft.com/en-us/research/blog/sammo-a-general-purpose-framework-for-prompt-optimization/

Large language models (LLMs) have revolutionized a wide range of tasks and applications that were previously reliant on manually crafted machine learning (ML) solutions, streamlining through automation. However, despite these advances, a notable challenge persists: the need for extensive prompt engineering to adapt these models to new tasks. New generations of language models like GPT-4 and Mixtral 8x7B advance the capability to process long input texts. This progress enables the use of longer inputs, providing richer context and detailed instructions to language models. A common technique that uses this enhanced capacity is the Retrieval Augmented Generation (RAG) approach. RAG dynamically incorporates information into the prompt based on the specific input example.

To address these challenges, we developed the Structure-Aware Multi-objective Metaprompt Optimization (SAMMO) framework. SAMMO is a new open-source tool that streamlines the optimization of prompts, particularly those that combine different types of structural information like in the RAG example above. It can make structural changes, such as removing entire components or replacing them with different ones. These features enable AI practitioners and researchers to efficiently refine their prompts with little manual effort.

Central to SAMMO’s innovation is its approach to treating prompts not just as static text inputs but as dynamic, programmable entities—metaprompts. SAMMO represents these metaprompts as function graphs, where individual components and substructures can be modified to optimize performance, similar to the optimization process that occurs during traditional program compilation.

The following key features contribute to SAMMO’s effectiveness:

Structured optimization: Unlike current methods that focus on text-level changes, SAMMO focuses on optimizing the structure of metaprompts. This granular approach facilitates precise modifications and enables the straightforward integration of domain knowledge, for instance, through rewrite operations targeting specific stylistic objectives.

Multi-objective search: SAMMO’s flexibility enables it to simultaneously address multiple objectives, such as improving accuracy and computational efficiency. Our paper illustrates how SAMMO can be used to compress prompts without compromising their accuracy.

General purpose application: SAMMO has proven to deliver significant performance improvements across a variety of tasks, including instruction tuning, RAG, and prompt compression.

Permalink: /feed/sammo-prompt-optimization-framework/

Tags: #ai #microsoft #research #prompt #optimization #sammo

lqdev👽04/25/2024

https://amerpie.lol/2024/04/21/in-the-prestreaming.html

Reading this post and attached image brings back so many memories.

I remember in the last few years of Blockbuster and early days of Netflix, for about $20/month you could rent unlimited movies (I think it was up to three at a time).

If you binge watched them or weren't happy with the choices you made, you could just drive down to your local store, return them, and grab a new set of movies.

Initially you had a return period, but towards the end, there were none so you basically got to keep some movies as long as you liked.

Good times.

Permalink: /feed/remembering-90s-video-store-plummer/

Tags: #nostalgia #90s #videos #blockbuster

lqdev👽04/25/2024

https://activitypub.ghost.org/

In 2024, Ghost is adopting ActivityPub and connecting with other federated platforms across the web.

This means that, soon, Ghost publishers will be able to follow, like and interact with one another in the same way that you would normally do on a social network — but on your own website.

The difference, of course, is that you’ll also be able to follow, like, and interact with users on Mastodon, Threads, Flipboard, Buttondown, WriteFreely, Tumblr, WordPress, PeerTube, Pixelfed... or any other platform that has adopted ActivityPub, too. You don’t need to limit yourself to following people who happen to use the same platform as you.

For the past few years the choice has been difficult. Either participate in closed networks at the mercy of algorithms, or set up an independent website at the expense of your growth.

Email gave us private messaging technology that isn’t owned by a single company.

ActivityPub is doing the same for social technology.

The open web is coming back, and with it returns diversity. You can both publish independently and grow faster than ever before with followers from all over the world & the web.

I can't express how much I love this. Personally I don't use Ghost, but given platforms like WordPress and now Ghost are adding support for ActivityPub, it empowers people to build their own platforms.

That said, this still doesn't address the challenges of building your own website, which as mentioned in the post are one of the appealing aspects of current closed networks.

Still though, there is a vast number of creators, businesses, and company websites or blogs that can benefit from this today. When paired with RSS, it gives people choice and autonomy in how they create and consume content as I mentioned in a previous post, Rediscovering the RSS protocol

Permalink: /feed/ghost-activitypub-integration-announcement/

Tags: #ghost #openweb #activitypub #fediverse #blogging #websites #indieweb

lqdev👽04/25/2024

https://github.com/google-deepmind/penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Penzai is a JAX library for writing models as legible, functional pytree data structures, along with tools for visualizing, modifying, and analyzing them. Penzai focuses on making it easy to do stuff with models after they have been trained, making it a great choice for research involving reverse-engineering or ablating model components, inspecting and probing internal activations, performing model surgery, debugging architectures, and more. (But if you just want to build and train a model, you can do that too!)

Screenshot of Google Penzai Neural Network Visualization — Source: *github.com*

Permalink: /feed/google-jax-penzai/

Tags: #google #jax #ai #visualization

lqdev👽04/23/2024

https://azure.microsoft.com/en-us/blog/introducing-phi-3-redefining-whats-possible-with-slms/

Starting today, Phi-3-mini, a 3.8B language model is available on Microsoft Azure AI Studio, Hugging Face, and Ollama.

Phi-3-mini is available in two context-length variants—4K and 128K tokens. It is the first model in its class to support a context window of up to 128K tokens, with little impact on quality.

It is instruction-tuned, meaning that it’s trained to follow different types of instructions reflecting how people normally communicate. This ensures the model is ready to use out-of-the-box.

It is available on Azure AI to take advantage of the deploy-eval-finetune toolchain, and is available on Ollama for developers to run locally on their laptops.

It has been optimized for ONNX Runtime with support for Windows DirectML along with cross-platform support across graphics processing unit (GPU), CPU, and even mobile hardware.

It is also available as an NVIDIA NIM microservice with a standard API interface that can be deployed anywhere. And has been optimized for NVIDIA GPUs.

In the coming weeks, additional models will be added to Phi-3 family to offer customers even more flexibility across the quality-cost curve. Phi-3-small (7B) and Phi-3-medium (14B) will be available in the Azure AI model catalog and other model gardens shortly.

Permalink: /feed/introducing-phi-3/

Tags: #microsoft #phi3 #ai #slm #llm #genai

lqdev👽04/18/2024

https://llama.meta.com/llama3/

Build the future of AI with Meta Llama 3

Now available with both 8B and 70B pretrained and instruction-tuned versions to support a wide range of applications

Llama 3 models take data and scale to new heights. It’s been trained on our two recently announced custom-built 24K GPU clusters on over 15T token of data – a training dataset 7x larger than that used for Llama 2, including 4x more code. This results in the most capable Llama model yet, which supports a 8K context length that doubles the capacity of Llama 2.

Permalink: /feed/introducing-llama-3/

Tags: #meta #ai #llama3 #llama #llm

lqdev👽04/14/2024

http://radiobilingue.org/rb-programas/alterlatino/

While listening to KHOL earlier today, they were rebroadcasting a recording of A Todo Pulmon, a radio show from Radio Bilingue in Fresno, CA. Good stuff.

Permalink: /feed/radio-bilingue-rock-alterlatino/

Tags: #radio #music #rock #alternative #fresno #radiobilingue #khol

lqdev👽04/10/2024

https://proton.me/blog/proton-standard-notes-join-forces

...today, we’re happy to announce that Standard Notes will also join us to advance our shared mission.

Both Proton and Standard Notes share a strong commitment to our communities, so Standard Notes will remain open source, freely available, and fully supported. Prices are not changing, and if you have a current subscription to Standard Notes, it will continue to be honored. Proton aspires to do the right thing and be a responsible home for open-source projects, and just as we did with SimpleLogin, we are committed to preserving what makes Standard Notes special and much loved.

In the coming months, we hope to find ways to make Standard Notes more easily accessible to the Proton community. This way, in addition to protecting your email, calendar, files, passwords, and online activity, you can also protect your notes.

This is another exciting acquisition! I mainly use org-mode in Emacs for note taking. However, I love the ecosystem Proton is building with their security and privacy focused set of collaborative software offerings.

Permalink: /feed/proton-acquires-standard-notes/

Tags: #proton #standardnotes #notetaking #security #privacy #email #notes

lqdev👽04/10/2024

https://arxiv.org/abs/2402.19427

Recurrent neural networks (RNNs) have fast inference and scale efficiently on long sequences, but they are difficult to train and hard to scale. We propose Hawk, an RNN with gated linear recurrences, and Griffin, a hybrid model that mixes gated linear recurrences with local attention. Hawk exceeds the reported performance of Mamba on downstream tasks, while Griffin matches the performance of Llama-2 despite being trained on over 6 times fewer tokens. We also show that Griffin can extrapolate on sequences significantly longer than those seen during training. Our models match the hardware efficiency of Transformers during training, and during inference they have lower latency and significantly higher throughput. We scale Griffin up to 14B parameters, and explain how to shard our models for efficient distributed training.

Permalink: /feed/griffin-mix-linear-recurrence-local-attention-language-models/

Tags: #griffin #ai #research #architecture #rnn #attention #transformers #llm

lqdev👽04/10/2024

https://arxiv.org/abs/2402.09910

How can we detect if copyrighted content was used in the training process of a language model, considering that the training data is typically undisclosed? We are motivated by the premise that a language model is likely to identify verbatim excerpts from its training text. We propose DE-COP, a method to determine whether a piece of copyrighted content was included in training. DE-COP's core approach is to probe an LLM with multiple-choice questions, whose options include both verbatim text and their paraphrases. We construct BookTection, a benchmark with excerpts from 165 books published prior and subsequent to a model's training cutoff, along with their paraphrases. Our experiments show that DE-COP surpasses the prior best method by 9.6% in detection performance (AUC) on models with logits available. Moreover, DE-COP also achieves an average accuracy of 72% for detecting suspect books on fully black-box models where prior methods give ≈ 4% accuracy. Our code and datasets are available at this https URL

Repo

Permalink: /feed/decop-detecting-copyright-llm-training-data/

Tags: #ai #copyright #research #llm #data

lqdev👽04/10/2024

https://arxiv.org/abs/2401.02115

Text-to-SQL models can generate a list of candidate SQL queries, and the best query is often in the candidate list, but not at the top of the list. An effective re-rank method can select the right SQL query from the candidate list and improve the model's performance. Previous studies on code generation automatically generate test cases and use them to re-rank candidate codes. However, automatic test case generation for text-to-SQL is an understudied field. We propose an automatic test case generation method that first generates a database and then uses LLMs to predict the ground truth, which is the expected execution results of the ground truth SQL query on this database. To reduce the difficulty for LLMs to predict, we conduct experiments to search for ways to generate easy databases for LLMs and design easy-to-understand prompts. Based on our test case generation method, we propose a re-rank method to select the right SQL query from the candidate list. Given a candidate list, our method can generate test cases and re-rank the candidate list according to their pass numbers on these test cases and their generation probabilities. The experiment results on the validation dataset of Spider show that the performance of some state-of-the-art models can get a 3.6% improvement after applying our re-rank method.

Permalink: /feed/llm-text-to-sql-right-candidates/

Tags: #ai #research #sql #llm

lqdev👽04/10/2024

https://github.com/google-deepmind/recurrentgemma

RecurrentGemma is a family of open-weights Language Models by Google DeepMind, based on the novel Griffin architecture. This architecture achieves fast inference when generating long sequences by replacing global attention with a mixture of local attention and linear recurrences.

This repository contains the model implementation and examples for sampling and fine-tuning. We recommend most users adopt the Flax implementation, which is highly optimized. We also provide an un-optimized PyTorch implementation for reference.

Permalink: /feed/google-deepmind-recurrent-gemma/

Tags: #ai #gemma #google #llm #opensource #slm #griffin #neuralnetwork

lqdev👽04/09/2024

https://blog.allenai.org/hello-olmo-a-truly-open-llm-43f7e7359222

Today, The Allen Institute for AI (AI2) has released OLMo 7B, a truly open, state-of-the-art large language model released alongside the pre-training data and training code. This empowers researchers and developers to use the best and open models to advance the science of language models collectively.

OLMo-7B on HuggingFace

Permalink: /feed/allen-ai-olmo-llm/

Tags: #allenai #llm #opensource

lqdev👽04/09/2024

https://github.com/karpathy/llm.c

LLM training in simple, pure C/CUDA. There is no need for 245MB of PyTorch or 107MB of cPython. For example, training GPT-2 (CPU, fp32) is ~1,000 lines of clean code in a single file. It compiles and runs instantly, and exactly matches the PyTorch reference implementation. I chose GPT-2 as the first working example because it is the grand-daddy of LLMs, the first time the modern stack was put together.

Permalink: /feed/llm-c-karpathy/

Tags: #llm #gpt #c #programming #learning #tutorial

lqdev👽04/09/2024

https://www.theverge.com/2024/4/9/24124179/beeper-app-automattic-acquisition-matrix-messaging

Beeper, the upstart messaging app that attempts to corral all your messaging services into one inbox, is being acquired by Automattic, the giant that runs Wordpress.com, Tumblr, and a number of other hugely popular web properties

This is exciting especially given some of the recent developments in the EU. What's most interesting to me is how Beeper leverages open protocols like Matrix and for bridging capabilities where possible to provide secure messaging.

With more people moving to smaller spaces to communicate with their communities, being able to do so in a single place without everyone being on the same platform like in the early days of the internet is a welcome development.

Additional coverage from the Beeper blog.

What we’re announcing today…

No more waitlist – Beeper is now available to everyone!

Beeper has been acquired by Automattic

Our new Android app is out of beta

We’re renaming Beeper Cloud → Beeper (sorry for the confusion)

and Matt Mullenweg's blog

Today the announcement went out that we’re combining the best technology from Beeper and Texts to create a great private, secure, and open source messaging client for people to have control of their communications. We’re going to use the Beeper brand, because it’s fun. This is not unlike how browsers have evolved, where solid tech and encryption on top of an open ecosystem has created untold value for humanity.

A lot of people are asking about iMessage on Android… I have zero interest in fighting with Apple, I think instead it’s best to focus on messaging networks that want more engagement from power-user clients. This is an area I’m excited to work on when I return from my sabbatical next month.

Permalink: /feed/beeper-acquired-automattic/

Tags: #beeper #messaging #automattic #matrix

lqdev👽04/09/2024

https://arxiv.org/abs/2404.01037

Retrieval-Augmented Generation (RAG) is essential for integrating external knowledge into Large Language Model (LLM) outputs. While the literature on RAG is growing, it primarily focuses on systematic reviews and comparisons of new state-of-the-art (SoTA) techniques against their predecessors, with a gap in extensive experimental comparisons. This study begins to address this gap by assessing various RAG methods' impacts on retrieval precision and answer similarity. We found that Hypothetical Document Embedding (HyDE) and LLM reranking significantly enhance retrieval precision. However, Maximal Marginal Relevance (MMR) and Cohere rerank did not exhibit notable advantages over a baseline Naive RAG system, and Multi-query approaches underperformed. Sentence Window Retrieval emerged as the most effective for retrieval precision, despite its variable performance on answer similarity. The study confirms the potential of the Document Summary Index as a competent retrieval approach. All resources related to this research are publicly accessible for further investigation through our GitHub repository ARAGOG (this https URL). We welcome the community to further this exploratory study in RAG systems.

GitHub repo

Permalink: /feed/aragog-advanced-rag-output-grading/

Tags: #rag #ai #research #llm #knowledge #retrieval #retrievalaugmentedgeneration

lqdev👽04/07/2024

https://faroutguides.com/

Outdoor nagivation app for long-distance trails

Permalink: /feed/far-out-guides-hiking-app/

Tags: #outdoors #app #hiking #technology

lqdev👽04/07/2024

https://www.mollywhite.net/blogroll/

Bookmarking for reference.

I'm already subscribed to many of these websites and publications. However, there's several new ones I found that I think will eventually make their rotation into my blogroll.

Permalink: /feed/blogroll-molly-white/

Tags: #blogroll #indieweb #rss #blogging #web #internet #smallweb #community

lqdev👽04/04/2024

https://openai.com/blog/introducing-improvements-to-the-fine-tuning-api-and-expanding-our-custom-models-program

New fine-tuning API features

Today, we’re introducing new features to give developers even more control over their fine-tuning jobs, including:

Epoch-based Checkpoint Creation: Automatically produce one full fine-tuned model checkpoint during each training epoch, which reduces the need for subsequent retraining, especially in the cases of overfitting

Comparative Playground: A new side-by-side Playground UI for comparing model quality and performance, allowing human evaluation of the outputs of multiple models or fine-tune snapshots against a single prompt

Third-party Integration: Support for integrations with third-party platforms (starting with Weights and Biases this week) to let developers share detailed fine-tuning data to the rest of their stack

Comprehensive Validation Metrics: The ability to compute metrics like loss and accuracy over the entire validation dataset instead of a sampled batch, providing better insight on model quality

Hyperparameter Configuration: The ability to configure available hyperparameters from the Dashboard (rather than only through the API or SDK)

Fine-Tuning Dashboard Improvements: Including the ability to configure hyperparameters, view more detailed training metrics, and rerun jobs from previous configurations

Expanding our Custom Models Program

Assisted Fine-Tuning

Today, we are formally announcing our assisted fine-tuning offering as part of the Custom Model program. Assisted fine-tuning is a collaborative effort with our technical teams to leverage techniques beyond the fine-tuning API, such as additional hyperparameters and various parameter efficient fine-tuning (PEFT) methods at a larger scale. It’s particularly helpful for organizations that need support setting up efficient training data pipelines, evaluation systems, and bespoke parameters and methods to maximize model performance for their use case or task.

Custom-Trained Model

In some cases, organizations need to train a purpose-built model from scratch that understands their business, industry, or domain. Fully custom-trained models imbue new knowledge from a specific domain by modifying key steps of the model training process using novel mid-training and post-training techniques. Organizations that see success with a fully custom-trained model often have large quantities of proprietary data—millions of examples or billions of tokens—that they want to use to teach the model new knowledge or complex, unique behaviors for highly specific use cases.

Permalink: /feed/openai-introducing-fine-tuning-improvements/

Tags: #openai #finetuning #llm

lqdev👽04/04/2024

https://txt.cohere.com/command-r-plus-microsoft-azure/

Command R+ is a state-of-the-art RAG-optimized model designed to tackle enterprise-grade workloads, and is available first on Microsoft Azure

Command R+, like our recently launched Command R model, features a 128k-token context window and is designed to offer best-in-class:

Advanced Retrieval Augmented Generation (RAG) with citation to reduce hallucinations

Multilingual coverage in 10 key languages to support global business operations

Tool Use to automate sophisticated business processes

Permalink: /feed/introducing-cohere-comandr-plus/

Tags: #cohere #llm #comandr #azure #comandrplus

lqdev👽04/04/2024

https://github.com/ngruver/llmtime

By encoding time series as a string of numerical digits, we can frame time series forecasting as next-token prediction in text. Developing this approach, we find that large language models (LLMs) such as GPT-3 and LLaMA-2 can surprisingly zero-shot extrapolate time series at a level comparable to or exceeding the performance of purpose-built time series models trained on the downstream tasks. To facilitate this performance, we propose procedures for effectively tokenizing time series data and converting discrete distributions over tokens into highly flexible densities over continuous values. We argue the success of LLMs for time series stems from their ability to naturally represent multimodal distributions, in conjunction with biases for simplicity, and repetition, which align with the salient features in many time series, such as repeated seasonal trends. We also show how LLMs can naturally handle missing data without imputation through non-numerical text, accommodate textual side information, and answer questions to help explain predictions. While we find that increasing model size generally improves performance on time series, we show GPT-4 can perform worse than GPT-3 because of how it tokenizes numbers, and poor uncertainty calibration, which is likely the result of alignment interventions such as RLHF.

Github Repo

Permalink: /feed/llm-time-series-forecasting/

Tags: #forecasting #llm #ai #research

lqdev👽04/04/2024

https://github.com/intel-analytics/ipex-llm

IPEX-LLM is a PyTorch library for running LLM on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with very low latency

Permalink: /feed/intel-ipex-llm/

Tags: #intel #llm #pytorch #cpu #gpu

lqdev👽04/04/2024

https://arxiv.org/abs/2403.20329

Reference resolution is an important problem, one that is essential to understand and successfully handle context of different kinds. This context includes both previous turns and context that pertains to non-conversational entities, such as entities on the user's screen or those running in the background. While LLMs have been shown to be extremely powerful for a variety of tasks, their use in reference resolution, particularly for non-conversational entities, remains underutilized. This paper demonstrates how LLMs can be used to create an extremely effective system to resolve references of various types, by showing how reference resolution can be converted into a language modeling problem, despite involving forms of entities like those on screen that are not traditionally conducive to being reduced to a text-only modality. We demonstrate large improvements over an existing system with similar functionality across different types of references, with our smallest model obtaining absolute gains of over 5% for on-screen references. We also benchmark against GPT-3.5 and GPT-4, with our smallest model achieving performance comparable to that of GPT-4, and our larger models substantially outperforming it.

Permalink: /feed/apple-realm-research-paper/

Tags: #ai #research #apple #gpt

lqdev👽04/04/2024

https://stability.ai/news/stable-audio-2-0

Stable Audio 2.0 sets a new standard in AI-generated audio, producing high-quality, full tracks with coherent musical structure up to three minutes in length at 44.1kHz stereo.

The new model introduces audio-to-audio generation by allowing users to upload and transform samples using natural language prompts.

Stable Audio 2.0 was exclusively trained on a licensed dataset from the AudioSparx music library, honoring opt-out requests and ensuring fair compensation for creators.

Permalink: /feed/introducing-stable-audio-2-0/

Tags: #stabilityai #ai #audio #nlp

lqdev👽04/04/2024

https://www.theverge.com/24120122/the-matrix-sequel-drew-goddard

Deadline reports that The Martian writer Drew Goddard has been tapped to pen and direct another Matrix movie executive produced by Lana Wachowski. Currently, the new film has no title or projected premiere date, and there’s been no announcement as to whether franchise stars like Keanu Reeves, Carrie-Anne Moss, Laurence Fishburne, Yahya Abdul-Mateen II, or Jessica Henwick will return.

Not sure how to feel about this, but I'll end up watching anyway.

Permalink: /feed/matrix-five-announced/

Tags: #matrix #movie #scifi #neo #theone

lqdev👽04/02/2024

https://www.theverge.com/2024/4/2/24118873/google-podcasts-shutdown-graveyard

Good article. I felt this way when Google Reader and a few other services were shut down.

That being said, this is kind of a good thing.

Luckily, there are plenty of good podcast apps out there, like Pocket Casts, Overcast, Antennapod, and even Apple Podcasts.

This line basically says it all. Podcasts, like blogging, continue to be an open ecosystem and where the saying,"wherever you get your podcasts", is still going strong.

Permalink: /feed/google-podcasts-gone-youtube-cant-replace-it-verge/

Tags: #google #podcasts #youtube #audio #opensource #rss

lqdev👽04/01/2024

https://openai.com/blog/start-using-chatgpt-instantly

We’re making it easier for people to experience the benefits of AI without needing to sign up.

We may use what you provide to ChatGPT to improve our models for everyone. If you’d like, you can turn this off through your Settings - whether you create an account or not.

We’ve also introduced additional content safeguards for this experience, such as blocking prompts and generations in a wider range of categories.

Permalink: /feed/start-using-chatgpt-seamlessly-no-signup/

Tags: #openai #chatgpt #ai #llm

lqdev👽04/01/2024

https://www.theverge.com/24115039/danger-hiptop-t-mobile-sidekick-jump-button

Bring back the Sidekick! Ayaneo Slide is probably the closest to this today. Would love to see a smaller version of it running on Windows on ARM-based Snapdragon processors.

Before the iPhone, before Android, before webOS, a revolutionary soap bar of a phone made it incredibly easy to get shit done. The Danger Hiptop, better known as the T-Mobile Sidekick, made the internet portable and affordable like no phone before.

Permalink: /feed/tmobile-sidekick-jump-button-mobile-productivity/

Tags: #tmobile #nostalgia #sidekick #tmobile #phones #smartphones #productivity #mobile #cellphones #handhelds

lqdev👽03/31/2024

https://www.databricks.com/blog/announcing-dbrx-new-standard-efficient-open-source-customizable-llms

Today, we are excited to advance our mission by open sourcing DBRX, a general purpose large language model (LLM) built by our Mosaic Research team that outperforms all established open source models on standard benchmarks. We believe that pushing the boundary of open source models enables generative AI for all enterprises that is customizable and transparent.

We are excited about DBRX for three distinct reasons. First, it handily beats open source models, such as, LLaMA2-70B, Mixtral, and Grok-1 on language understanding, programming, math, and logic...

Second, DBRX beats GPT-3.5 on most benchmarks...

Third, DBRX is a Mixture-of-Experts (MoE) model built on the MegaBlocks research and open source project, making the model extremely fast in terms of tokens/second.

Permalink: /feed/announcing-dbrx-llm/

Tags: #databricks #ai #llm #opensource

lqdev👽03/26/2024

https://tianweiy.github.io/dmd/

Our one-step generator achieves comparable image quality with StableDiffusion v1.5 while being 30x faster.

Diffusion models are known to approximate the score function of the distribution they are trained on. In other words, an unrealistic synthetic image can be directed toward higher probability density region through the denoising process (see SDS). Our core idea is training two diffusion models to estimate not only the score function of the target real distribution, but also that of the fake distribution. We construct a gradient update to our generator as the difference between the two scores, essentially nudging the generated images toward higher realism as well as lower fakeness (see VSD). Our method is similar to GANs in that a critic is jointly trained with the generator to minimize a divergence between the real and fake distributions, but differs in that our training does not play an adversarial game that may cause training instability, and our critic can fully leverage the weights of a pretrained diffusion model. Combined with a simple regression loss to match the output of the multi-step diffusion model, our method outperforms all published few-step diffusion approaches, reaching 2.62 FID on ImageNet 64x64 and 11.49 FID on zero-shot COCO-30k, comparable to Stable Diffusion but orders of magnitude faster. Utilizing FP16 inference, our model generates images at 20 FPS on modern hardware.

Paper

Permalink: /feed/one-step-diffusion-distribution-matching-distillation/

Tags: #ai #genai #diffusion #dmd #stablediffusion #dalle

lqdev👽03/26/2024

https://www.404media.co/404-media-now-has-a-full-text-rss-feed/

We paid for the development of full text RSS feeds for Ghost-based publishers. Now we can offer them to our paid subscribers, and other Ghost sites can use the service too.

Our friends Anil Dash and Ernie Smith have recently written passionately and persuasively about the importance of RSS to the open web, and about how a technology that turns 25 years old this month remains both subversive and quite versatile. RSS-based distribution underpins a podcasting ecosystem that has allowed for shows to be distributed not just on Apple Podcasts but on Spotify, Google Podcasts, Pocket Casts, Overcast, and whatever other podcast player you might want to listen on. “Being able to say, ‘wherever you get your podcasts’ is a radical statement,” Dash wrote. “Because what it represents is the triumph of exactly the kind of technology that's supposed to be impossible: open, empowering tech that's not owned by any one company, that can't be controlled by any one company, and that allows people to have ownership over their work and their relationship with their audience.”

RSS has empowered podcasters, but that it needs a “creator economy rethink” for text.

Permalink: /feed/404-media-full-text-rss/

Tags: #rss #404media #ghost #404 #media #protocols #opensource #technology

lqdev👽03/23/2024

https://stability.ai/news/stabilityai-announcement

Earlier today, Emad Mostaque resigned from his role as CEO of Stability AI and from his position on the Board of Directors of the company to pursue decentralized AI.

Permalink: /feed/stability-ai-ceo-resigns/

Tags: #stability #ai #news

lqdev👽03/21/2024

https://arxiv.org/abs/2312.00752

Foundation models, now powering most of the exciting applications in deep learning, are almost universally based on the Transformer architecture and its core attention module. Many subquadratic-time architectures such as linear attention, gated convolution and recurrent models, and structured state space models (SSMs) have been developed to address Transformers' computational inefficiency on long sequences, but they have not performed as well as attention on important modalities such as language. We identify that a key weakness of such models is their inability to perform content-based reasoning, and make several improvements. First, simply letting the SSM parameters be functions of the input addresses their weakness with discrete modalities, allowing the model to selectively propagate or forget information along the sequence length dimension depending on the current token. Second, even though this change prevents the use of efficient convolutions, we design a hardware-aware parallel algorithm in recurrent mode. We integrate these selective SSMs into a simplified end-to-end neural network architecture without attention or even MLP blocks (Mamba). Mamba enjoys fast inference (5× higher throughput than Transformers) and linear scaling in sequence length, and its performance improves on real data up to million-length sequences. As a general sequence model backbone, Mamba achieves state-of-the-art performance across several modalities such as language, audio, and genomics. On language modeling, our Mamba-3B model outperforms Transformers of the same size and matches Transformers twice its size, both in pretraining and downstream evaluation.

Permalink: /feed/mamba-linear-time-sequence-selective-state-spaces/

Tags: #ai #models #mamba #llm #compute

lqdev👽03/20/2024

https://huggingface.co/blog/Pclanglais/common-corpus

We announce today the release of Common Corpus on HuggingFace:

Common Corpus is the largest public domain dataset released for training LLMs.

Common Corpus includes 500 billion words from a wide diversity of cultural heritage initiatives.

Common Corpus is multilingual and the largest corpus to date in English, French, Dutch, Spanish, German and Italian.

Common Corpus shows it is possible to train fully open LLMs on sources without copyright concerns.

Permalink: /feed/common-corpus-llm-training-dataset/

Tags: #ai #data #llm #huggingface #nlp

lqdev👽03/19/2024

https://huggingface.co/learn/ml-games-course/unit0/introduction

Welcome to the course that will teach you the most fascinating topic in game development: how to use powerful AI tools and models to create unique game experiences.

New AI models are revolutionizing the Game Industry in two impactful ways:

On how we make games:

Generate textures using AI

Using AI voice actors for the voices.

How we create gameplay:

Crafting smart Non-Playable Characters (NPCs) using large language models.

This course will teach you:

How to integrate AI models for innovative gameplay, featuring intelligent NPCs.

How to use AI tools to help your game development pipeline.

Permalink: /feed/machine-learning-for-games-course-huggingface/

Tags: #machinelearning #games #huggingface #course #ai #ml

lqdev👽03/19/2024

https://doc.searls.com/2024/03/19/the-online-local-chronicle/

In the same way that every little place in America used to have a printed newspaper, every little place in America could have an online local chronicle.

Broadly speaking, an online local chronicle is a collection of facts organized mostly in chronological order. The “pages” of the chronicle can be thought of as subsets of a community’s universal timeline of events. These online local chronicles could become the backbone of local news operations.

Nice project. Unfortunately, it's rare you get local news. I like publications / websites like Hoboken Girl and Block Club Chicago. I wish there were more of them in more cities and towns. I know they're there in some forms like Facebook Groups. Even better, it'd be great to have the websites for these publications be the main source of truth that then syndicated their content to the various platforms out there.

Permalink: /feed/online-local-chronicle-searls/

Tags: #community #news #local #knowledgebase #information #web #opensource

lqdev👽03/19/2024

https://huggingface.co/blog/quanto-introduction

Quantization is a technique to reduce the computational and memory costs of evaluating Deep Learning Models by representing their weights and activations with low-precision data types like 8-bit integer (int8) instead of the usual 32-bit floating point (float32).

Today, we are excited to introduce quanto, a versatile pytorch quantization toolkit, that provides several unique features:

available in eager mode (works with non-traceable models)

quantized models can be placed on any device (including CUDA and MPS),

automatically inserts quantization and dequantization stubs,

automatically inserts quantized functional operations,

automatically inserts quantized modules (see below the list of supported modules),

provides a seamless workflow for a float model, going from a dynamic to a static quantized model,

supports quantized model serialization as a state_dict,

supports not only int8 weights, but also int2 and int4,

supports not only int8 activations, but also float8.

Permalink: /feed/quanto-pytorch-quantization-toolkit/

Tags: #huggingface #quantization #pytorch #tools #ai

lqdev👽03/19/2024

https://arxiv.org/abs/2310.04475

Embeddings have become a pivotal means to represent complex, multi-faceted information about entities, concepts, and relationships in a condensed and useful format. Nevertheless, they often preclude direct interpretation. While downstream tasks make use of these compressed representations, meaningful interpretation usually requires visualization using dimensionality reduction or specialized machine learning interpretability methods. This paper addresses the challenge of making such embeddings more interpretable and broadly useful, by employing Large Language Models (LLMs) to directly interact with embeddings -- transforming abstract vectors into understandable narratives. By injecting embeddings into LLMs, we enable querying and exploration of complex embedding data. We demonstrate our approach on a variety of diverse tasks, including: enhancing concept activation vectors (CAVs), communicating novel embedded entities, and decoding user preferences in recommender systems. Our work couples the immense information potential of embeddings with the interpretative power of LLMs.

Permalink: /feed/demistifying-embedding-spaces-llms/

Tags: #ai #llm #embedding #interpretability

lqdev👽03/18/2024

https://huggingface.co/spaces/Xenova/the-tokenizer-playground

Experiment with different tokenizers (running locally in your browser). I really love doing this thing

Permalink: /feed/tokenizer-playground/

Tags: #ai #tokenizer #huggingface #playground #aiplayground

lqdev👽03/18/2024

https://stability.ai/news/introducing-stable-video-3d

Today we are releasing Stable Video 3D (SV3D), a generative model based on Stable Video Diffusion, advancing the field of 3D technology and delivering greatly improved quality and view-consistency.

This release features two variants: SV3D_u and SV3D_p. SV3D_u generates orbital videos based on single image inputs without camera conditioning. SV3D_p extends the capability by accommodating both single images and orbital views, allowing for the creation of 3D video along specified camera paths.

Stable Video 3D can be used now for commercial purposes with a Stability AI Membership. For non-commercial use, you can download the model weights on Hugging Face and view our research paper here.

Permalink: /feed/introducing-stable-video-3d/

Tags: #stabilityai #ai #video #3d #computervision

lqdev👽03/18/2024

https://www.theverge.com/2024/3/18/24105157/nvidia-blackwell-gpu-b200-ai

Nvidia reveals Blackwell B200 GPU, the ‘world’s most powerful chip’ for AI

‘Built to democratize trillion-parameter AI.’

Nvidia says the new B200 GPU offers up to 20 petaflops of FP4 horsepower from its 208 billion transistors. Also, it says, a GB200 that combines two of those GPUs with a single Grace CPU can offer 30 times the performance for LLM inference workloads while also potentially being substantially more efficient. It “reduces cost and energy consumption by up to 25x” over an H100, says Nvidia.

Permalink: /feed/nvidia-blackwell-b200-gpu/

Tags: #ai #nvidia #gpu #hardware #pchardware #pc

lqdev👽03/18/2024

https://arxiv.org/abs/2403.09611

In this work, we discuss building performant Multimodal Large Language Models (MLLMs). In particular, we study the importance of various architecture components and data choices. Through careful and comprehensive ablations of the image encoder, the vision language connector, and various pre-training data choices, we identified several crucial design lessons. For example, we demonstrate that for large-scale multimodal pre-training using a careful mix of image-caption, interleaved image-text, and text-only data is crucial for achieving state-of-the-art (SOTA) few-shot results across multiple benchmarks, compared to other published pre-training results. Further, we show that the image encoder together with image resolution and the image token count has substantial impact, while the vision-language connector design is of comparatively negligible importance. By scaling up the presented recipe, we build MM1, a family of multimodal models up to 30B parameters, consisting of both dense models and mixture-of-experts (MoE) variants, that are SOTA in pre-training metrics and achieve competitive performance after supervised fine-tuning on a range of established multimodal benchmarks. Thanks to large-scale pre-training, MM1 enjoys appealing properties such as enhanced in-context learning, and multi-image reasoning, enabling few-shot chain-of-thought prompting.

Permalink: /feed/mm1-multimodal-llm-pretraining/

Tags: #ai #mm1 #llm #multimodal #apple

lqdev👽03/17/2024

https://blog.langchain.dev/enhancing-rag-based-applications-accuracy-by-constructing-and-leveraging-knowledge-graphs/

A practical guide to constructing and retrieving information from knowledge graphs in RAG applications with Neo4j and LangChain

Graph retrieval augmented generation (Graph RAG) is gaining momentum and emerging as a powerful addition to traditional vector search retrieval methods. This approach leverages the structured nature of graph databases, which organize data as nodes and relationships, to enhance the depth and contextuality of retrieved information.

Permalink: /feed/enhace-rag-application-accuracy-knowledge-graphs/

Tags: #knowledgegraph #rag #genai #langchain #generativeai #retrievalaugmentedgeneration #patterns #applicationpatterns

lqdev👽03/17/2024

https://github.com/lavague-ai/LaVague

Redefining internet surfing by transforming natural language instructions into seamless browser interactions.

Permalink: /feed/lavague-large-action-model/

Tags: #ai #genai #lavague #selenium #web #lam #largeactionmodel

lqdev👽03/17/2024

https://en.algorithmica.org/hpc/

This is an upcoming high performance computing book titled “Algorithms for Modern Hardware” by Sergey Slotin.

Its intended audience is everyone from performance engineers and practical algorithm researchers to undergraduate computer science students who have just finished an advanced algorithms course and want to learn more practical ways to speed up a program than by going from O(nlog⁡n)O(nlogn) to O(nlog⁡log⁡n)O(nloglogn).

Permalink: /feed/algorithms-modern-hardware-book/

Tags: #book #algorithms #hardware

lqdev👽03/17/2024

https://spreadsheets-are-all-you-need.ai/index.html

A low-code way to learn AI - Learn how AI works from a real LLM implemented entirely in Excel

Permalink: /feed/spreadsheets-are-all-you-need/

Tags: #excel #tutorial #ai #genai #tokenizers #transformers

lqdev👽03/17/2024

https://engineering.fb.com/2024/03/12/data-center-engineering/building-metas-genai-infrastructure/

Marking a major investment in Meta’s AI future, we are announcing two 24k GPU clusters. We are sharing details on the hardware, network, storage, design, performance, and software that help us extract high throughput and reliability for various AI workloads. We use this cluster design for Llama 3 training.

We are strongly committed to open compute and open source. We built these clusters on top of Grand Teton, OpenRack, and PyTorch and continue to push open innovation across the industry.

This announcement is one step in our ambitious infrastructure roadmap. By the end of 2024, we’re aiming to continue to grow our infrastructure build-out that will include 350,000 NVIDIA H100 GPUs as part of a portfolio that will feature compute power equivalent to nearly 600,000 H100s.

Permalink: /feed/building-meta-genai-infrastructure/

Tags: #meta #ai #infrastraucture #generativeai #genai #facebook

lqdev👽03/17/2024

https://github.com/openai/transformer-debugger

Transformer Debugger (TDB) is a tool developed by OpenAI's Superalignment team with the goal of supporting investigations into specific behaviors of small language models. The tool combines automated interpretability techniques with sparse autoencoders.

TDB enables rapid exploration before needing to write code, with the ability to intervene in the forward pass and see how it affects a particular behavior. It can be used to answer questions like, "Why does the model output token A instead of token B for this prompt?" or "Why does attention head H attend to token T for this prompt?" It does so by identifying specific components (neurons, attention heads, autoencoder latents) that contribute to the behavior, showing automatically generated explanations of what causes those components to activate most strongly, and tracing connections between components to help discover circuits.

Permalink: /feed/openai-transformer-debugger/

Tags: #openai #ai #transformer #debugger #tools

lqdev👽03/17/2024

https://www.tonyduan.com/diffusion/index.html

Here, we'll cover the derivations from scratch to provide a rigorous understanding of the core ideas behind diffusion. What assumptions are we making? What properties arise as a result?

A reference [codebase] is written from scratch, which provides minimalist re-production of the MNIST example below. It clocks in at under 500 lines of code.

Each page takes up to an hour to read thoroughly. Approximately a lecture each.

Permalink: /feed/diffusion-models-from-scratch-duan/

Tags: #diffusion #ai #concepts #models

lqdev👽03/17/2024

https://www.chenyang.co/diffusion.html

This tutorial aims to introduce diffusion models from an optimization perspective as introduced in our paper (joint work with Frank Permenter). It will go over both theory and code, using the theory to explain how to implement diffusion models from scratch. By the end of the tutorial, you will learn how to implement training and sampling code for a toy dataset, which will also work for larger datasets and models.

Permalink: /feed/diffusion-models-from-scratch-yuan/

Tags: #diffusion #ai #concepts #models

lqdev👽03/17/2024

https://arxiv.org/abs/2311.12224

We introduce a new algorithm called the Free-pipeline Fast Inner Product (FFIP) and its hardware architecture that improve an under-explored fast inner-product algorithm (FIP) proposed by Winograd in 1968. Unlike the unrelated Winograd minimal filtering algorithms for convolutional layers, FIP is applicable to all machine learning (ML) model layers that can mainly decompose to matrix multiplication, including fully-connected, convolutional, recurrent, and attention/transformer layers. We implement FIP for the first time in an ML accelerator then present our FFIP algorithm and generalized architecture which inherently improve FIP's clock frequency and, as a consequence, throughput for a similar hardware cost. Finally, we contribute ML-specific optimizations for the FIP and FFIP algorithms and architectures. We show that FFIP can be seamlessly incorporated into traditional fixed-point systolic array ML accelerators to achieve the same throughput with half the number of multiply-accumulate (MAC) units, or it can double the maximum systolic array size that can fit onto devices with a fixed hardware budget. Our FFIP implementation for non-sparse ML models with 8 to 16-bit fixed-point inputs achieves higher throughput and compute efficiency than the best-in-class prior solutions on the same type of compute platform.

GitHub

Permalink: /feed/fast-inner-product-dnn-accelerators/

Tags: #ai #math #acceleration #neuralnetworks #dnn

lqdev👽03/17/2024

https://github.com/xai-org/grok-1

This repository contains JAX example code for loading and running the Grok-1 open-weights model.

Permalink: /feed/grok-1-open-release/

Tags: #grok #ai #x #github #opensource #moe

lqdev👽03/17/2024

https://ollama.com/blog/amd-preview

Ollama now supports AMD graphics cards in preview on Windows and Linux. All the features of Ollama can now be accelerated by AMD graphics cards on Ollama for Linux and Windows.

Permalink: /feed/ollama-supports-amd-gpus/

Tags: #ollama #gpu #amd #ai #llm #opensource

lqdev👽03/15/2024

https://hugo.blog/2024/03/11/vision-pro/

Friends and colleagues have been asking me to share my perspective on the Apple Vision Pro as a product.

This started as blog post and became an essay before too long, so I’ve structured my writing in multiple sections each with a clear lead to make it a bit easier to digest — peppered with my own ‘takes’. I’ve tried to stick to original thoughts for the most part and link to what others have said where applicable.

Some of the topics I touch on:

Why I believe Vision Pro may be an over-engineered “devkit”

The genius & audacity behind some of Apple’s hardware decisions

Gaze & pinch is an incredible UI superpower and major industry ah-ha moment

Why the Vision Pro software/content story is so dull and unimaginative

Why most people won’t use Vision Pro for watching TV/movies

Apple’s bet in immersive video is a total game-changer for Live Sports

Why I returned my Vision Pro… and my Top 10 wishlist to reconsider

Apple’s VR debut is the best thing that ever happened to Oculus/Meta

My unsolicited product advice to Meta for Quest Pro 2 and beyond

Permalink: /feed/apple-vision-pro-barra/

Tags: #apple #visionpro #vr #ar

lqdev👽03/15/2024

https://bsky.social/about/blog/03-12-2024-stackable-moderation

Bluesky was created to put users and communities in control of their social spaces online. The first generation of social media platforms connected the world, but ended up consolidating power in the hands of a few corporations and their leaders. Our online experience doesn’t have to depend on billionaires unilaterally making decisions over what we see. On an open social network like Bluesky, you can shape your experience for yourself.

Today, we’re excited to announce that we’re open-sourcing Ozone, our collaborative moderation tool. With Ozone, individuals and teams can work together to review and label content across the network. Later this week, we’re opening up the ability for you to run your own independent moderation services, seamlessly integrated into the Bluesky app. This means that you'll be able to create and subscribe to additional moderation services on top of what Bluesky requires, giving you unprecedented control over your social media experience.

Permalink: /feed/bluesky-stackable-approach-moderation/

Tags: #moderation #community #bluesky #atprotocol #socialmedia #internet #web

lqdev👽03/15/2024

https://proton.me/blog/proton-mail-desktop-app

Today, we’re excited to broaden the horizons of secure communication by launching the Proton Mail desktop app. Anyone can now use the new Proton Mail desktop app for Windows and macOS, with a Linux version in beta.

With the new Proton Mail desktop apps, you get a dedicated email experience, allowing you to enjoy all the productivity innovations of our web app, allowing you to go through your emails and events faster without the potential distractions that pop up anytime you open your browser. And, of course, your privacy remains protected at all times.

Permalink: /feed/introducing-protonmail-desktop-app/

Tags: #email #proton #desktop #app #privacy #security

lqdev👽03/14/2024

https://huyenchip.com//2024/03/14/ai-oss.html

So many cool ideas are being developed by the community. Here are some of my favorites.

Batch inference optimization: FlexGen, llama.cpp

Faster decoder with techniques such as Medusa, LookaheadDecoding

Model merging: mergekit

Constrained sampling: outlines, guidance, SGLang

Seemingly niche tools that solve one problem really well, such as einops and safetensors.

Permalink: /feed/learned-from-looking-900-oss-ai-tools-huyen/

Tags: #ai #tools #analysis #openai #github

lqdev👽03/08/2024

https://www.answer.ai/posts/2024-03-06-fsdp-qlora.html

Today, we’re releasing Answer.AI’s first project: a fully open source system that, for the first time, can efficiently train a 70b large language model on a regular desktop computer with two or more standard gaming GPUs (RTX 3090 or 4090). This system, which combines FSDP and QLoRA, is the result of a collaboration between Answer.AI, Tim Dettmers (U Washington), and Hugging Face’s Titus von Koeller and Sourab Mangrulkar.

Permalink: /feed/answer-ai-train-70b-llm-at-home/

Tags: #ai #llm #qlora #gpu #nvidia #finetuning

lqdev👽03/07/2024

https://jxnl.github.io/blog/writing/2024/02/28/levels-of-complexity-rag-applications/

This post comprehensive guide to understanding and implementing RAG applications across different levels of complexity. Whether you're a beginner eager to learn the basics or an experienced developer looking to deepen your expertise, you'll find valuable insights and practical knowledge to help you on your journey. Let's embark on this exciting exploration together and unlock the full potential of RAG applications.

Permalink: /feed/levels-of-complexity-rag-applications/

Tags: #rag #ai #llm #architecture #app

lqdev👽03/07/2024

https://www.zdnet.com/article/5-reasons-why-desktop-linux-is-finally-growing-in-popularity/

StatCounter reported that desktop Linux reached over 4% market share for the first time.

Why is Linux finally growing?

While Windows is the king of the hill with 72.13% and MacOS comes in a distant second at 15.46%, it's clear that Linux is making progress.

Microsoft isn't that interested in Windows

Linux gaming, thanks to Steam, is also growing

Users are finally figuring out that some Linux distros are easy to use

Finding and installing Linux desktop software is easier than ever

The Linux desktop is growing in popularity in India

Permalink: /feed/year-of-linux-desktop-4-percent/

Tags: #linux #pc #desktop #desktop

lqdev👽03/07/2024

https://inflection.ai/inflection-2-5

At Inflection, our mission is to create a personal AI for everyone. Last May, we released Pi—a personal AI, designed to be empathetic, helpful, and safe. In November we announced a new major foundation model, Inflection-2, the second best LLM in the world at the time.

Now we are adding IQ to Pi’s exceptional EQ.

We are launching Inflection-2.5, our upgraded in-house model that is competitive with all the world's leading LLMs like GPT-4 and Gemini. It couples raw capability with our signature personality and unique empathetic fine-tuning. Inflection-2.5 is available to all Pi's users today, at pi.ai, on iOS, on Android, or our new desktop app.

We achieved this milestone with incredible efficiency: Inflection-2.5 approaches GPT-4’s performance, but used only 40% of the amount of compute for training.

Permalink: /feed/inflection-2-5/

Tags: #ai #llm #inflection

lqdev👽03/07/2024

https://slack.com/blog/news/the-surprising-connection-between-after-hours-work-and-decreased-productivity

Quick take: How do you spend your time at work and what is it costing you? Slack’s Workforce Index, based on survey responses from more than 10,000 desk workers around the globe, uncovers new findings on how to structure the workday to maximize productivity and strengthen employee well-being and satisfaction.

Key learnings include:

Employees who log off at the end of the workday register 20% higher productivity scores than those who feel obligated to work after hours.

Making time for breaks during the workday improves employee productivity and well-being, and yet half of all desk workers say they rarely or never take breaks.

On average, desk workers say that the ideal amount of focus time is around four hours a day, and more than two hours a day in meetings is the tipping point at which a majority of workers feel overburdened by meetings.

Three out of every four desk workers report working in the 3 to 6pm timeframe, but of those, only one in four consider these hours highly productive.

Permalink: /feed/surprising-connection-after-hours-work-productivity-slack/

Tags: #slack #productivity #work

lqdev👽03/07/2024

https://www.yitay.net/blog/training-great-llms-entirely-from-ground-zero-in-the-wilderness

Given that we’ve successfully trained pretty strong multimodal language models at Reka, many people have been particularly curious about the experiences of building infrastructure and training large language & multimodal models from scratch from a completely clean slate.

I complain a lot about external (outside Google) infrastructure and code on my social media, leading people to really be curious about what are the things I miss and what I hate/love in the wilderness. So here’s a post (finally). This blogpost sheds light on the challenges and lessons learned.

Figuring out things in the wilderness was an interesting experience. It was unfortunately not painless. Compute scarcity and also unreliable compute providers made things significantly harder than expected but we’re glad we pulled through with brute technical strength.

All in all, this is only a small part of the story of how we started a company, raised some money, bought some chips and matched Gemini pro/GPT 3.5 and outperformed many others in less than a year having to build everything from scratch.

Permalink: /feed/training-llms-ground-up-wilderness-startup/

Tags: #llm #ai #startup #compute #gpu #training

lqdev👽03/06/2024

https://tylerhou.com/posts/datalog-go-brrr/

The datatype for a graph is a relation, and graph algorithms are queries on the relation. But modern languages need better support for the relational model.

This post is a response to/inspired by The Hunt for the Missing Data Type (HN) by Hillel Wayne. I suggest reading his article first.

I claim the reason why it is so difficult to support graphs in languages nowadays is because the imperative/structured programming model of modern programming languages is ill-suited for graph algorithms. As Wayne correctly points out, the core problem is that when you write a graph algorithm in an imperative language like Python or Rust, you have to choose some explicit representation for the graph. Then, your traversal algorithm is dependent on the representation you chose. If you find out later that your representation is no longer efficient, it is a lot of work to adapt your algorithms for a new representation.

So what if we just, like, didn’t do this?

We already have a declarative programming language where expressing graph algorithms is extremely natural—Datalog, whose semantics are based on* the relational algebra, which was developed in the 1970s.

Wonderful! Except for the “writing Datalog” part.

If Datalog is so great, why hasn’t it seen more adoption?

The short answer is that Datalog is relatively esoteric outside of academia and some industry applications and, as a result, is not a great language from a “software engineering” perspective. It is hard for programmers accustomed to imperative code to write Datalog programs, and large Datalog programs can be hard to write and understand.

Permalink: /feed/datalog-the-missing-graph-data-type-already-exists/

Tags: #datalog #programming #graphs #datatypes #programminglanguages #software #database #algorithms

lqdev👽03/05/2024

https://www.theverge.com/2024/3/5/24091555/apple-podcasts-transcripts-ios-17-4-update

Apple Podcasts will auto-generate transcripts for podcasts beginning today, thanks to the 17.4 update for iPhones and iPads. Transcripts will automatically appear for new podcast episodes shortly after their publication, while Apple will transcribe podcast back catalogs over time.
The podcast transcripts are searchable, allowing users to type in a specific word or phrase and skip to that part of an episode. Users can find transcripts for individual podcast episodes on the bottom-left corner of the “Now Playing” screen.
Podcasters who don’t want to use Apple’s automated transcription can opt to upload their own transcripts via RSS tags or in Apple Podcasts Connect for premium episodes, or they can download and edit Apple’s transcript before reuploading.

This is cool and great for accessibility.

I recently chose to read the latest episode of Decoder instead of listening to it. One of the advantages this also provided was that I could reference direct quotes in my post from the episode.

I could see Apple taking this further by making it easier to generate show notes / descriptions based on the episode using AI.

Permalink: /feed/apple-podcasts-transcript-17-4/

Tags: #iphone #apple #podcasts #ios #ipados

lqdev👽03/05/2024

https://www.theverge.com/2024/3/5/24091370/microsoft-windows-11-android-apps-end-of-support

Microsoft is ending support for its Android subsystem in Windows 11 next year. The software giant first announced it was bringing Android apps to Windows 11 with Amazon’s Appstore nearly three years ago, but this Windows Subsystem for Android will now be deprecated starting March 5th, 2025.

That's unfortunate considering the new lineup of ARM-based PCs expected later this year. It would've been nice to have a mobile PC with 5G support that could run mobile apps for scenarios where there are no web / native PC apps.

Permalink: /feed/windows-subsystem-for-android-end-of-support-2025/

Tags: #windows #android #wsa #pc

lqdev👽03/05/2024

https://stability.ai/news/stable-diffusion-3-research-paper

Key Takeaways:

Today, we’re publishing our research paper that dives into the underlying technology powering Stable Diffusion 3.

Stable Diffusion 3 outperforms state-of-the-art text-to-image generation systems such as DALL·E 3, Midjourney v6, and Ideogram v1 in typography and prompt adherence, based on human preference evaluations.

Our new Multimodal Diffusion Transformer (MMDiT) architecture uses separate sets of weights for image and language representations, which improves text understanding and spelling capabilities compared to previous versions of SD3.

Permalink: /feed/stable-diffusion-3-research-paper/

Tags: #stabilityai #stablediffusion #ai #generativeai #genai #images

lqdev👽03/04/2024

https://www.theverge.com/2024/3/4/24090095/wix-ai-website-generator-chatbot

You can now build a website, images and all, using only prompts in Wix’s new AI website builder. Creating a website is free, but you’ll have to upgrade to one of Wix’s premium plans if you want to do things like accept payments or don’t want to be limited to using a Wix domain name.

You’d probably need to delve into Wix’s advanced editing features and know things about actual web development for that. But it was very easy to use the basic AI generator to create something that looks close to a legitimate site to start with, making it much easier to get to a basic starting point.

Permalink: /feed/wix-website-builder-ai-chatbot/

Tags: #wix #ai #websites #internet #web

lqdev👽03/04/2024

https://www.theverge.com/24080426/smart-home-tech-matter-pets-kitchen-hubs-how-to

The Verge team and others share their experiences of how smart technologies affect their lives — how it can often help and sometimes frustrate.

In these articles, we’ve concentrated on how our own experiences, and the experiences of others, have affected how we regard smart home tech. We’ve got personal accounts by one reporter who decided to put together a brand-new smart home and another whose brother moved into a home haunted by the ghosts of someone else’s smart tech. Several of our staffers wax enthusiastically about their favorite devices and automations. A writer describes how smart tech makes his home more accessible. Our smart home reviewer tells how she uses technology to keep her varied pets (and she has a lot of them) happy and healthy. We talk to people who use smart devices to help them care for their parents — and more.

Permalink: /feed/todays-smart-homes-hopes-realities-verge/

Tags: #iot #verge #smarthome #technology

lqdev👽03/04/2024

https://www.theverge.com/24087834/hank-green-decoder-podcast-google-youtube-web-media-platforms-distribution-future

...the last platform on the web of any scale or influence is Google Search. And so, over time, webpages have become dramatically optimized for Google Search. And that means the kinds of things people write about, the containers that we write in, are mostly designed to be optimized for Google Search. They’re not designed for, “I need to just quickly tell you about this and move on.” Our little insight was, “Well, what if we just don’t do that? What if we only write for the people who come directly to our website instead of the people who find our articles through Search or Google Discover or whatever other Google platforms are in the world?” And so we just made these little blog posts, and the idea was, if you just come to our website one more time a day because there’s one more thing to look at that you’ll like, we will be fine.

more and more people are starting to realize, “Oh, we should just make the websites more valuable.

...if you start writing for other people, which is the heart of what a blog post really is: it’s you trying to entertain yourself and trying to entertain just a handful of other people, you’re going to go really much farther than trying to satisfy the robot.

Why am I writing in the text box that pays money to Elon and Mark [Zuckerberg] and not my text box?

Why do we all work for free? Look, we want to talk about the platform era and media. Why do we all work for free?

...It’s very confusing, and there are a lot of reasons. If you just sit back and think about why, there are a million reasons why.

One, the software is nicer to use than most CMSes. You just pick one. Name a company that makes a CMS. They’re like, “Is this as fun to use as Twitter?” And the answer is no. Flatly no. Even the one we have now for quick posts is not as fun to use as Twitter was in its heyday. Will this immediately bring me the dopamine hit of immediate feedback? No.

[When redesigning the website]...the first instinct was, “Let’s at least make it easier to publish. Let’s at least remove the barriers to entry to getting on the website, and then we can do comments, and then we can think about how we can distribute in different ways.” So that is working. My team is happier. We did not know that the Twitter thing would happen, but the Twitter thing happened, and our desire to publish in the boxes we controlled went up as a group. And then, on top of it, our audience saw that we were having fun. And once you are having fun anywhere on the internet, people sort of gravitate to you. So traffic has gone up.

The distribution actually just creates the work or creates the pressures that force all the work to be the same. And I think over time that’s what drives the audiences away. So there’s a real change in how these platforms work, where, over time, they just become more and more of the same thing and the creators become more and more the same. And that’s a little exhausting. And every place where you see open distribution, you see a huge variety of creators and content.

Podcasts have basically open distribution. Like podcast or distributor RSS feeds, that means people kind of own their distribution, there’s a vast array of podcast creators. There’s a vast array of podcast formats. They don’t all sound like the beginning of YouTube videos or whatever. And I hate to keep picking on YouTube; you can pick any algorithmic platform, and it’s the same. TikTokers are more the same than different. Podcasters are more different than the same. The web is distributed largely through websites and through RSS. There’s a huge variety of websites and the way websites look. But then you see the algorithmic search pressure push web design kind of all under the same box.

Newsletters distributed by email: open distribution. The newsletter economy is full of a huge variety of creators doing a huge variety of things. They’re more different than the same. So all I see with the fediverse is, “Oh, this is going to open social distribution up a little bit.” It’s going to allow us to control our distribution networks. It’s going to say, “I’m not on Twitter, but people on Twitter can follow my website, and I can go promote that follow anywhere I want in different ways and build an audience outside of the pressures of the algorithm.” To me, just that, that ability to try, is 1 percent better.
If you’re me and you run a big website and you are thinking, “How can I redistribute this website, how can I reach people more directly?” my brain is lit up. You should be able to follow me at TheVerge.com and see all my quick posts in your Threads account when Threads federates.

Permalink: /feed/decoder-nilay-patel-why-websites-are-the-future/

Tags: #websites #internet #podcast #decoder #rss #platforms #blogs #technology

lqdev👽03/04/2024

https://github.com/google/gemma_pytorch

Gemma is a family of lightweight, state-of-the art open models built from research and technology used to create Google Gemini models. They are text-to-text, decoder-only large language models, available in English, with open weights, pre-trained variants, and instruction-tuned variants.

This is the official PyTorch implementation of Gemma models. We provide model and inference implementations using both PyTorch and PyTorch/XLA, and support running inference on CPU, GPU and TPU.

Permalink: /feed/gemma-pytorch/

Tags: #google #gemma #pytorch #ai #llm

lqdev👽03/04/2024

https://www.anthropic.com/news/claude-3-family

Today, we're announcing the Claude 3 model family, which sets new industry benchmarks across a wide range of cognitive tasks. The family includes three state-of-the-art models in ascending order of capability: Claude 3 Haiku, Claude 3 Sonnet, and Claude 3 Opus. Each successive model offers increasingly powerful performance, allowing users to select the optimal balance of intelligence, speed, and cost for their specific application.

Claude 3 Opus is our most intelligent model, with best-in-market performance on highly complex tasks. It can navigate open-ended prompts and sight-unseen scenarios with remarkable fluency and human-like understanding. Opus shows us the outer limits of what’s possible with generative AI.

Claude 3 Sonnet strikes the ideal balance between intelligence and speed—particularly for enterprise workloads. It delivers strong performance at a lower cost compared to its peers, and is engineered for high endurance in large-scale AI deployments.

Claude 3 Haiku is our fastest, most compact model for near-instant responsiveness. It answers simple queries and requests with unmatched speed. Users will be able to build seamless AI experiences that mimic human interactions.

Permalink: /feed/announcing-claude-3-model-family/

Tags: #ai #anthropic #claude #llm

lqdev👽03/02/2024

https://ente.io/

Store, share, and rediscover your memories with absolute privacy

Permalink: /feed/ente-private-media-cloud-storage/

Tags: #e2ee #photos #cloud #storage #product

lqdev👽02/29/2024

https://vickiboykis.com/2024/02/28/gguf-the-long-way-around/

We’ve been on a whirlwind adventure to build up our intuition of how machine learning models work, what artifacts they produce, how the machine learning artifact storage story has changed over the past couple years, and finally ended up in GGUF’s documentation to better understand the log that is presented to us when we perform local inference on artifacts in GGUF.

Permalink: /feed/gguf-long-way-around/

Tags: #ai #genai #gguf #llm #opensource

lqdev👽02/29/2024

https://huyenchip.com//2024/02/28/predictive-human-preference.html

Human preference has emerged to be both the Northstar and a powerful tool for AI model development. Human preference guides post-training techniques including RLHF and DPO. Human preference is also used to rank AI models, as used by LMSYS’s Chatbot Arena.

Chatbot Arena aims to determine which model is generally preferred. I wanted to see if it’s possible to do predictive human preference: determine which model is preferred for each query.

This post first discusses the correctness of Chatbot Arena, which will then be used as a baseline to evaluate the correctness of preference predictions. It then discusses how to build a preference predictor and the initial results.

Permalink: /feed/predictive-human-preference-model-ranking-routing/

Tags: #ai #llm #evaluation #ml

lqdev👽02/28/2024

https://arxiv.org/abs/2402.17764

Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs). In this work, we introduce a 1-bit LLM variant, namely BitNet b1.58, in which every single parameter (or weight) of the LLM is ternary {-1, 0, 1}. It matches the full-precision (i.e., FP16 or BF16) Transformer LLM with the same model size and training tokens in terms of both perplexity and end-task performance, while being significantly more cost-effective in terms of latency, memory, throughput, and energy consumption. More profoundly, the 1.58-bit LLM defines a new scaling law and recipe for training new generations of LLMs that are both high-performance and cost-effective. Furthermore, it enables a new computation paradigm and opens the door for designing specific hardware optimized for 1-bit LLMs.

Permalink: /feed/era-1-bit-llms/

Tags: #llm #ai #bitnet #research

lqdev👽02/28/2024

https://www.theverge.com/2024/2/28/24085869/tubi-redesign-shows-movies-turple

Tubi says its new ‘turple’-forward brand identity is all about encouraging viewers to fall down rabbit holes to find exciting shows and movies to watch.

I've often found good content to watch on Tubi. Sure, they're not the latest blockbusters but there are decades worth of movies out there. Just like today, not all movies that come out are good but there's still tons of great content. Sometimes, there's even overlooked gems.

Permalink: /feed/tubi-redesign-down-the-rabbit-hole/

Tags: #tv #movies #tubi #streaming

lqdev👽02/28/2024

http://www.eastgate.com/garden/Enter.html

The time, care, and expense devoted to creating and promoting a hypertext are lost if readers arrive, glance around, and click elsewhere. How can the craft of hypertext invite readers to stay, to explore, and to reflect?

Permalink: /feed/hypertext-gardens-delightful-vistas/

Tags: #openweb #digitalgarden #indieweb

lqdev👽02/28/2024

https://www.theverge.com/24084772/celebrities-no-phone-bieber-sheeran-cruise-cera-ipad

...[phone-free celebs are] not trying to disconnect from everyone, but they are trying to get away from that feeling of being tapped constantly on the shoulder by all the calls, texts, and emails.

So many celebrities ditch their phone, disconnect from their social media, log off entirely.

A few years ago, Ed Sheeran shared a strategy...He hasn’t had a phone since 2015...Being phoneless hadn’t cut his contact to the world, Sheeran said, just reduced it — and that was the point. “I have friends email and people email, and every few days I’ll sit down and open up my laptop, and I’ll answer 10 emails at a time,” he said. “I’ll send them off, and I close my laptop, and then that’ll be it. And then I’ll go back to living my life, and I don’t feel overwhelmed by it.”

Read and watch enough celebrity interviews, and the lesson becomes obvious: that the most powerful and connected device in your life shouldn’t be within arm’s reach at all times. All that does is invite distraction and makes it too easy to disengage from your life every time you get bored or sad or curious even for a second.

It sounds a little like I’m advocating for the return of the ’90s, when the computer was a giant box that lived in a central room of your home and the only way to use it was to go to it. And to some extent, I am! I’m increasingly convinced that my primary computer should be a device I use on purpose — that I sit down at, operate, and then extract myself from until the next time. Whether it’s a laptop on a desk or an iPad on your nightstand, your computer should be a place as much as it is a device. And when you’re not in that place, you’re somewhere else. The computer doesn’t come along.

Over the last few weeks, as an experiment, I’ve moved as many apps as possible — the obviously distracting social media stuff but also anything I can live without on a minute-to-minute basis — off my phone and onto my tablet and my laptop...

So far, it’s been great. I’m realizing how much of a crutch my phone really has become: I would open up TikTok just to keep me company on the walk to the kitchen or scroll through Threads while I waited for the microwave to finish. Now, I’m not sure I’m doing any less of those things in aggregate, but at least I’m doing them on purpose. I’ve turned time-wasting into a deliberate activity — I sit in my scrolling chair and scroll away, then I get up, and the scrolling stays put. And best of all, when I leave the house, there’s nothing to scroll at all.

There has always been talk in tech about removing friction: the obsessive corporate desire to make everything easier, faster, fewer clicks, fewer chances for you to decide not to click that ad or buy that thing or like that post or upload that photo...It should be a little harder for someone to distract me while I’m eating dinner with my wife or hanging out with my kid.

It’s not about ditching technology, just about doing technology on purpose.

Permalink: /feed/celebs-how-to-use-phone/

Tags: #technology #communication #phones #smartphones #dumbphones #email #celebrities

lqdev👽02/27/2024

https://www.404media.co/tumblr-and-wordpress-to-sell-users-data-to-train-ai-tools/

Tumblr and WordPress.com are preparing to sell user data to Midjourney and OpenAI, according to a source with internal knowledge about the deals and internal documentation referring to the deals

The internal documentation details a messy and controversial process within Tumblr itself. One internal post made by Cyle Gage, a product manager at Tumblr, states that a query made to prepare data for OpenAI and Midjourney compiled a huge number of user posts that it wasn’t supposed to. It is not clear from Gage’s post whether this data has already been sent to OpenAI and Midjourney, or whether Gage was detailing a process for scrubbing the data before it was to be sent.

I generally enjoy what Automattic does for the web as a whole. However, if these claims are true, it's unfortunate. I believe there's a way to opt-out, but I'd love to learn more before jumping to conclusions.

That said, WordPress (.com not .org) and Tumblr are platforms just like Reddit, Twitter, and the Meta set of offerings. I'm sure somewhere in their Terms of Service, there's some clauses around their ownserhip of the data you published on their platforms and just like it's sold to data brokers and advertisers, they can also sell it to companies training AI models.

To counter these types of moves from platforms, I wish it were as easy as saying "build your own platform". Doing so can be as "simple" as setting up a website using your own domain. Unfortunately, today, it's still not as easy to do and one of the products / companies that help you do that is WordPress. I think it's important though to note the distinction there between WordPress the company and WordPress the technology. Another piece that complicates building your own site is, there's still other ways for companies training AI models to use data that's publicly available on the internet. These are the arguments that are currently being litigated in several legal cases. Maybe there are opportunities to explore a robots.txt for AI.

AI models need high quality data that's representative and as close as possible to the real world in order to improve. There is a role here for synthetic data. High quality synthetic data is behind groundbreaking models like Microsoft's Phi. Instincts tell me that synthetic data can only go so far though and real data is still needed. In that case, as an AI consumer who makes use of these AI models, but don't want to contribute my data, do I have a responsibility to contribute my data to improve the systems that I use? Piracy aside, in some ways it reminds me of torrenting. You usually run into scenarios where there are multiple people downloading a file. However, there's only a handful of seeders who once obtained, make the file available for others to download. There are also additional considerations such as how people are compensated for contributing their data to these systems. It's important to note that this is not a new problem and people had been thinking about this though in different contexts. Maybe it's time to reconsider ideas like inverse privacy and data dignity.

There are no clear answers here and there are a lot of things to consider. However, it's comforting that as a society we're having these conversations.

Permalink: /feed/automattic-ai-data-tumblr-wordpress/

Tags: #tumblr #automattic #wordpress #ai

lqdev👽02/27/2024

https://www.windowscentral.com/software-apps/the-latest-microsoft-copilot-update-on-android-makes-me-mourn-the-death-of-cortana

Microsoft Copilot will soon be able to be your default assistant app on Android.

It's a shame that Cortana never worked out for Microsoft. If things had lined up differently, we may have seen Copilot gain access to smart devices and commands like Gemini has with Google Assistant (though that setup isn't perfect). While Copilot has a place on a computer, I think an assistant on your smartphone needs to be able to do more day-to-day tasks.

So many things were ahead of their time. I just want a Windows Mobile PC (Windows Phone?) with an LLM-backed Cortana.

Permalink: /feed/copilot-android-update-mourn-cortana-death/

Tags: #cortana #copilot #ai #microsoft #virtualassistant #assistant #chatbot

lqdev👽02/27/2024

https://www.theverge.com/entertainment/24054458/physical-media-preservation-discs-cartridges-digital

The bright promise of streaming and digital stores has given way to a darker reality: we rarely have ownership over the art we love, and much is getting lost in the process. Only a fraction of movies released over the last century are available on streaming services, while a staggering 90 percent of classic video games are considered “critically endangered” by archivists. As these platforms continue to dominate the media landscape, a whole lot of cultural history is being abandoned.

In this special issue, The Verge will explore how physical media factors into this and its importance in keeping art alive and accessible. That could mean boutique publishers releasing beautiful special editions of games and movies, foundations dedicated to preserving the physical history of video games, or musicians releasing their latest albums on floppy discs. We’ll also be looking at some cautionary tales in the shift to subscription services and offering tips on building bookshelf-worthy collections.

Cartridges and discs have been hurtling toward obsolescence — but it turns out, they may be more important than ever.

Permalink: /feed/verge-physical-media-week/

Tags: #media #verge #disc #dvd #cd #floppy #tech #technology

lqdev👽02/26/2024

https://joeroganexp.libsyn.com/rss

Since Joe Rogan went exclusive with Spotify, I maybe listened to a handful of episodes. The main reason for it is, I don't use Spotify to listen to podcasts. Periodically, I'd scroll through the feed to see if he had any intereting guests on. As part of his new contract, the podcast is now available on other platforms. That means you can listen wherever you get your podcasts.

Permalink: /feed/jre-wherever-you-get-your-podcasts/

Tags: #rss #podcast #protocols #jre #joerogan #audio #spotify

lqdev👽02/26/2024

https://www.theverge.com/24044151/streaming-subscription-prices-dvd-collection

After spending years reassuring myself that I don’t need physical copies of movies because of streaming, DVDs have officially reentered my life.

Walmart...Thrift stores, flea markets, the library, and even my local mall’s FYE have also become places I frequent to get my hands on oft-ignored discs.

It makes sense to subscribe to all these services if you’re into the exclusive content on each one and have the patience to sift through their massive libraries. However, all I’ve been watching lately is the junk on Discovery Plus, simply because I’m too tired to find anything else — especially when the extremely specific shows and movies I want to watch keep switching services or just aren’t available. One of the most devastating examples of this was when both The Office and Parks and Recreation moved from Netflix to Peacock, disrupting the casual binge-watching sessions that I would default to when I was done with work.

Within the past year, nearly every streaming service has raised its prices, including Netflix, Disney Plus, Hulu, Paramount Plus, Discovery Plus, and Apple TV Plus.

I’m not saying DVDs are flawless: there’s a reason no one wants them anymore!

Despite this, it’s still nice to have something that you physically own and don’t even need an internet connection to use. So when Best Buy confirmed it would stop selling DVDs this year and rumors emerged that Walmart would do the same, I was pretty disappointed. I can’t imagine Walmart without its bin of DVDs, nor can I even see Best Buy without its already-shrunken selection of movies.

It’s 2024, and I’m not ready to say goodbye to DVDs — in fact, I’m just getting started.

Great article from Emma.

Personally, I've been doing the same. Just a few weekends ago, I got something like 8-10 DVDs for ~$25 at my local thrift store. That haul included 3 seasons of The Sopranos.

With streaming services taking back control of their content and putting it on their own platforms, I don't want to have to keep signing up for a new service just to watch the shows and movies I enjoy. Also, that's assuming you can find the content to begin with (i.e. Westworld).

Parks and Recreation, The Office, and Breaking Bad were some of the first shows I started collecting and have slowly been building up my collection. To save on space, I've ditched the cases and have the DVDs organized in a CD case. I've not only limited myself to DVDs but have also started collecting CDs as well.

Whenever I want variety, I just use one of the free streaming services like:

Freevee
Pluto TV
Tubi
YouTube

Yes, there are ads but at least I'm not paying for it and I know that's part of the deal. There are a ton of good older (and sometimes original) TV shows and movies on those platforms to keep me entertained. The most recent ones being Stargate and Vampire's Kiss.

Permalink: /feed/streaming-fatigue-collecting-dvds/

Tags: #streaming #media #dvds #movies #tv #physicalmedia

lqdev👽02/26/2024

https://mistral.ai/news/mistral-large/

Mistral Large is our flagship model, with top-tier reasoning capacities. It is also available on Azure.

Mistral Large comes with new capabilities and strengths:

It is natively fluent in English, French, Spanish, German, and Italian, with a nuanced understanding of grammar and cultural context.

Its 32K tokens context window allows precise information recall from large documents.

Its precise instruction-following enables developers to design their moderation policies – we used it to set up the system-level moderation of le Chat.

It is natively capable of function calling. This, along with constrained output mode, implemented on la Plateforme, enables application development and tech stack modernisation at scale.

At Mistral, our mission is to make frontier AI ubiquitous. This is why we’re announcing today that we’re bringing our open and commercial models to Azure.

Permalink: /feed/announcing-mistral-large/

Tags: #ai #mistral #llm #opensource #azure

lqdev👽02/24/2024

https://www.theverge.com/24078662/twin-peaks-zelda-links-awakening-influence

In a 2010 interview, Link’s Awakening director Takashi Tezuka revealed the inspiration for this memorably bizarre cast of characters. “At the time, Twin Peaks was rather popular. The drama was all about a small number of characters in a small town,” Tezuka said. “So I wanted to make something like that, while it would be small enough in scope to easily understand, it would have deep and distinctive characteristics.”

... [Mark] Frost reveals in an interview with The Verge, he actually spoke with Nintendo about the Zelda franchise. “I don’t want to overstate it. It was a single conversation. But it was fun,” he tells me.

“They were talking to me about a Twin Peaks game, and they mentioned Zelda at the time,” says Frost. “They said, ‘One of the things we love about your show is how there’s all sorts of sideways associations that can drive the story forward.’ They asked me about that as they were thinking about expanding the Zelda universe.”

Though he’d never played a Zelda game, Frost had enough experience with fantasy storytelling that he had some suggestions. “I’d played lots of Dungeons & Dragons when I was young, so I was familiar with the kind of story they were thinking about,” he says. “I think I said, ‘Don’t be afraid to use dreamlike, Jungian symbolism. Things can connect thematically without having to connect concretely.’ It was things like that that I was urging them [to consider].”

Permalink: /feed/direct-influence-twin-peaks-zelda/

Tags: #twinpeaks #nintendo #zelda #tv #culture #gaming

lqdev👽02/24/2024

https://explainextended.com/2023/12/31/happy-new-year-15/

GitHub Repo

Permalink: /feed/gpt-500-lines-sql/

Tags: #sql #gpt #ai #llm

lqdev👽02/24/2024

https://www.youtube.com/watch?v=eIm2eK5uuVA

Such a good fight. Frazier had no defense but just kept coming forward.

Permalink: /feed/ali-frazier-I/

Tags: #boxing #ali #frazier

lqdev👽02/24/2024

https://www.cnbc.com/2024/02/23/jim-cramer-mcdonalds-use-of-ai-at-drive-thrus-is-good-news-for-nvidia.html

I wish they'd use AI to keep the ice cream machines from breaking instead.

Permalink: /feed/mcdonalds-ai-drive-thru-nvidia/

Tags: #mcdonalds #ai #nvidia

lqdev👽02/22/2024

https://www.youtube.com/watch?v=VrnEQ3TqZGE

Great performance. I remember the first time I listened to Salami was when she opened for Flying Lotus and she was amazing.

Permalink: /feed/salami-rose-joe-louis-live-bandcamp-2023/

Tags: #brainfeeder #salamirosejoelouis #music #bandcamp

lqdev👽02/22/2024

https://bsky.social/about/blog/02-22-2024-open-social-web

Today, we’re excited to announce that the Bluesky network is federating and opening up in a way that allows you to host your own data. What does this mean?

Your data, such as your posts, likes, and follows, needs to be stored somewhere. With traditional social media, your data is stored by the social media company whose services you've signed up for. If you ever want to stop using that company's services, you can do that—but you would have to leave that social network and lose your existing connections.

It doesn't have to be this way! An alternative model is how the internet itself works. Anyone can put up a website on the internet

We think social media should work the same way. When you register on Bluesky, by default we'll suggest that Bluesky will store your data. But if you'd like to let another company store it, or even store it yourself, you can do that. You'll also be able to change your mind at any point, moving your data to another provider without losing any of your existing posts, likes, or follows. From your followers' perspective, your profile is always available at your handle—no matter where your information is actually stored, or how many times it has been moved.

I don't spend a lot of time on Bluesky, but I love what they're doing.

They now:

Now you can self-host your data. I'm excited.

The other piece of this that's interesting is their feature which enables you to use your domain as a custom handle. Not only is your identity portable, but also your data. I'd be interested to see how this works in practice given you can already do some of this on the Fediverse on platforms like Mastodon. Again, that portable identity component to me is crucial. That's one of the challenges with Mastodon today. While you can move instances, your identity changes and while most of your data comes with you, there's tihngs that still dont transfer over. The other part that I'd be interested in seeing is whether or not they can be efficient in the storage of federated data. One of the challenges with Mastodon is, your server quickly fills up because of data from other instances (when you federate). It's gotten better, but this is an area I spend most of my time when maintaining my own self-hosted instance.

I'm excited to tinker and self-host my own data. Maybe I'll also syndicate to Bluesky just like I do with Mastodon today.

In the meantime, you can find me on Bluesky @lqdev.me.

Permalink: /feed/bluesky-open-social-web/

Tags: #bluesky #socialmedia #web #internet #community

lqdev👽02/22/2024

https://stability.ai/news/stable-diffusion-3

Announcing Stable Diffusion 3 in early preview, our most capable text-to-image model with greatly improved performance in multi-subject prompts, image quality, and spelling abilities.

The Stable Diffusion 3 suite of models currently range from 800M to 8B parameters.

Stable Diffusion 3 combines a diffusion transformer architecture and flow matching.

Permalink: /feed/stable-diffusion-3-early-preview/

Tags: #stabilityai #ai #stablediffusion #imagegeneration #ml #image #genai #generativeai

lqdev👽02/21/2024

https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/the-ai-study-guide-azure-machine-learning-edition/ba-p/4063656

The AI Study Guide: Discover Machine Learning with these free Azure resources

Welcome to the February edition of the Azure AI Study Guide. Every month I’ll be spilling the tea on the best and newest tools for skilling up on Azure AI. This month we’re putting on our thinking caps to investigate Azure Machine Learning (ML). I’ll give you a quick breakdown of what it is, then we’ll explore a four-week roadmap of our top FREE resources for you to continue your AI learning journey! And as a bonus, stay tuned to the end to see what makes machine learning and generative AI a dynamic duo.

Permalink: /feed/ai-study-guide-azure-ml-edition/

Tags: #azure #tutorial #machinelearning #ml #ai #azureml

lqdev👽02/21/2024

https://doc.searls.com/2024/02/21/on-blogs/

Thoughts I jotted down on Mastodon:

Blogs are newsletters that don’t require subscriptions.

Blogrolls are lists of blogs.

Both require the lowest possible cognitive and economic overhead.

That’s why they are coming back.

I know, they never left. But you get my point.

Permalink: /feed/on-blogs-searls/

Tags: #blogs #writing #newsletters #blogroll

lqdev👽02/21/2024

https://www.windowscentral.com/phones/windows-phone/what-would-microsofts-windows-phone-look-like-in-2024-its-like-a-micro-pc-running-windows-12-in-your-pocket

Cool concept.

The likelihood of it happening is low, but there's a lot of really great opportunities here especially with the new wave or ARM PCs coming. I don't know what the device form factor looks like, but I wouldn't mind carrying around a pocket PC - a true mobile computer. Already with the Windows Store, you have access to tons of apps. For the apps that aren't in the Store, there's the browser. That seemed to be good enough for Apple's Vision Pro. Taking it a step further would the app gap matter as much if you have Copilot as your concierge orchestrating tasks for you using the various services? Better yet, what if these services had their own assistants / GPTs Copilot could talk to and coordinate on your behalf?

At some point, I might just use OpenAI's Sora model to live vicariously through an AI-generated video depicting this alternate reality where Windows Phone exists...

Permalink: /feed/windows-12-mobile-concept/

Tags: #windows #mobile #windowsphone #pc

lqdev👽02/21/2024

https://simonwillison.net/2024/Feb/21/gemini-pro-video/

I’ve been playing with Gemini Pro 1.5 for a few days, and I think the most exciting feature isn’t so much the token count... it’s the ability to use video as an input.

The ability to extract structured content from text is already one of the most exciting use-cases for LLMs. GPT-4 Video and LLaVA expanded that to images. And now Gemini Pro 1.5 expands that to video.

The ability to analyze video like this feels SO powerful. Being able to take a 20 second video of a bookshelf and get back a JSON array of those books is just the first thing I thought to try.

...as always with modern AI, there are still plenty of challenges to overcome...But this really does feel like another one of those glimpses of a future that’s suddenly far closer then I expected it to be.

Permalink: /feed/gemini-pro-video-willison/

Tags: #google #ai #gemini

lqdev👽02/21/2024

https://huggingface.co/chat/assistants

The goal of this app is to showcase that it is now possible to build an open source alternative to ChatGPT.

Permalink: /feed/huggingchat/

Tags: #huggingface #assistants #ai #chat #opensource

lqdev👽02/21/2024

https://blog.google/technology/developers/gemma-open-models/

Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models. Developed by Google DeepMind and other teams across Google, Gemma is inspired by Gemini, and the name reflects the Latin gemma, meaning “precious stone.” Accompanying our model weights, we’re also releasing tools to support developer innovation, foster collaboration, and guide responsible use of Gemma models.

HuggingFace Gemma Release

Permalink: /feed/gemma-google-ai-models/

Tags: #ai #opensource #google #model #llm

lqdev👽02/20/2024

https://www.localfirst.fm/

A podcast about local-first software development

Permalink: /feed/localfirstfm-podcast/

Tags: #development #software #podcast #tech #podcast #localfirst #local

lqdev👽02/20/2024

https://signal.org/blog/phone-number-privacy-usernames/

Signal’s mission and sole focus is private communication. For years, Signal has kept your messages private, your profile information (like your name and profile photo) private, your contacts private, and your groups private – among much else. Now we’re taking that one step further, by making your phone number on Signal more private.

Here’s how:

New default: Your phone number will no longer be visible to everyone in Signal...

Connect without sharing your phone number...

Control who can find you on Signal by phone number...

Right now, these options are in beta, and will be rolling out to everyone in the coming weeks.

Permalink: /feed/signal-usernames/

Tags: #signal #privacy #security #osint #messaging

lqdev👽02/20/2024

https://huggingface.co/datasets/HuggingFaceTB/cosmopedia

Cosmopedia is a dataset of synthetic textbooks, blogposts, stories, posts and WikiHow articles generated by Mixtral-8x7B-Instruct-v0.1.The dataset contains over 30 million files and 25 billion tokens, making it the largest open synthetic dataset to date.

It covers a variety of topics; we tried to map world knowledge present in Web datasets like RefinedWeb and RedPajama, and generate synthetic content that covers them. This is the v0.1 of Cosmopedia, with ample room for improvement and topics to be more comprehensively covered. We hope this dataset will help the community's research efforts in the increasingly intriguing domain of synthetic data.

This work is inspired by the great work of Phi1.5.

Permalink: /feed/cosmopedia-ai-synthetic-dataset/

Tags: #cosmopedia #dataset #huggingface #mixtral #ai #genai #llm

lqdev👽02/20/2024

https://www.swift.org/blog/mlx-swift/

The Swift programming language has a lot of potential to be used for machine learning research because it combines the ease of use and high-level syntax of a language like Python with the speed of a compiled language like C++.

MLX is an array framework for machine learning research on Apple silicon. MLX is intended for research and not for production deployment of models in apps.

MLX Swift expands MLX to the Swift language, making experimentation on Apple silicon easier for ML researchers.

Permalink: /feed/swift-mlx-ml-research/

Tags: #apple #mlx #swift #opensource #ml #ai

lqdev👽02/19/2024

https://ollama.com/blog/windows-preview

Ollama is now available on Windows in preview, making it possible to pull, run and create large language models in a new native Windows experience. Ollama on Windows includes built-in GPU acceleration, access to the full model library, and the Ollama API including OpenAI compatibility.

Download (https://ollama.com/download/windows)

Permalink: /feed/ollama-windows-preview/

Tags: #ollama #windows #llm #opensource #localmodels #ml #ai

lqdev👽02/17/2024

https://www.jayeless.net/2024/02/staticrypt.html

...I was longing for a way to do friends-only blog posts on the open web, today I came across StatiCrypt , an open-source utility that lets you encrypt static HTML pages behind a password.

Permalink: /feed/staticrypt-protected-posts-static-site-jayless/

Tags: #staticsite #web #blogs #community #openweb #security #privacy

lqdev👽02/16/2024

https://huggingface.co/learn/cookbook/index

The Open-Source AI Cookbook is a collection of notebooks illustrating practical aspects of building AI applications and solving various machine learning tasks using open-source tools and models.

Permalink: /feed/open-source-ai-cookbook-hf/

Tags: #ai #tutorial #huggingface #opensource #ml

lqdev👽02/15/2024

https://observablehq.com/blog/observable-2-0

Today we’re launching Observable 2.0 with a bold new vision: an open-source static site generator for building fast, beautiful data apps, dashboards, and reports.

Our mission is to help teams communicate more effectively with data. Effective presentation of data is critical for deep insight, nuanced understanding, and informed decisions. Observable notebooks are great for ephemeral, ad hoc data exploration. But notebooks aren’t well-suited for polished dashboards and apps.

Permalink: /feed/observable-2-0/

Tags: #data #analytics #observable

lqdev👽02/15/2024

https://ai.meta.com/blog/v-jepa-yann-lecun-ai-model-video-joint-embedding-predictive-architecture/

Today, we’re publicly releasing the Video Joint Embedding Predictive Architecture (V-JEPA) model, a crucial step in advancing machine intelligence with a more grounded understanding of the world.

This early example of a physical world model excels at detecting and understanding highly detailed interactions between objects.

In the spirit of responsible open science, we’re releasing this model under a Creative Commons NonCommercial license for researchers to further explore.

Permalink: /feed/v-jepa-meta/

Tags: #meta #ai #ami #ml #opensource #multimodal

lqdev👽02/15/2024

https://magic.dev/

Magic is working on frontier-scale code models to build a coworker, not just a copilot.

Permalink: /feed/magic-dev/

Tags: #ai #copilot #magic #code #agi

lqdev👽02/15/2024

https://openai.com/sora

Sora is an AI model that can create realistic and imaginative scenes from text instructions.

Sora can generate videos up to a minute long while maintaining visual quality and adherence to the user’s prompt.

Permalink: /feed/openai-sora/

Tags: #openai #multimodal #ai #texttovideo

lqdev👽02/15/2024

https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/?ref=ellipsismx.com

Today, we’re announcing our next-generation model: Gemini 1.5.

Gemini 1.5 delivers dramatically enhanced performance. It represents a step change in our approach, building upon research and engineering innovations across nearly every part of our foundation model development and infrastructure. This includes making Gemini 1.5 more efficient to train and serve, with a new Mixture-of-Experts (MoE) architecture.

Gemini 1.5 Pro comes with a standard 128,000 token context window. But starting today, a limited group of developers and enterprise customers can try it with a context window of up to 1 million tokens via AI Studio and Vertex AI in private preview.

Permalink: /feed/gemini-1-5-google/

Tags: #ai #gemini #lm #google

lqdev👽02/14/2024

https://twit.tv/posts/inside-twit/club-shows-now-open-everyone

We are thrilled to announce that our Club TWiT shows are now available to everyone in audio form. That's right, you can now listen to your favorite shows anytime, anywhere, and it's all starting as early as the end of this week.

Permalink: /feed/club-twit-shows-available-to-all/

Tags: #twit #podcast #tech #community

lqdev👽02/14/2024

https://twit.tv/posts/inside-twit/twits-lesser-known-rss-feeds

Subscribed!

Many people are unaware that TWiT also has RSS feeds designed for news aggregators like Feedly, NetNewsWire, Mozilla Thunderbird, and Akregator. These feeds are not meant for podcast apps but are specifically designed for news aggregators. You can copy any of the RSS feed links below into your RSS feed reader of choice and get updates on the latest TWiT blog posts, articles, or podcasts as soon as they are published.

Permalink: /feed/twit-lesser-known-rss-feeds/

Tags: #twit #rss #tech #news #community

lqdev👽02/14/2024

https://matthiasott.com/notes/we-love-rss

What makes RSS so powerful is that it is an open format. RSS is one of the reasons the blogosphere grew so rapidly and it is the reason why podcasting exploded: because this open format allowed everyone to participate by simply publishing a feed anywhere on the web, without being restricted by platform requirements, closed APIs, and paywalls. And this superpower is also why RSS is having a renaissance today: it allows everyone to subscribe to, share, syndicate, and cross-post content on the open web. And it also enables creative automations using tools like Zapier, IFTTT, Huggin, or n8n.

There is no denying that RSS is having a moment again. Not only because it allows us all to improve the discoverability of our work and explore online content in a personalized and deliberate way, but also because it remains one of the most powerful and influential technologies of the open web. RSS already is the cornerstone of many open technology systems like podcasting, which can’t be owned and controlled by any one company. As Anil Dash notes, this alone is radical, because it is the triumph of exactly the kind of technology that's supposed to be impossible: open and empowering tech that allows people to have ownership over their work and their relationship with their audience.

Permalink: /feed/we-love-rss-ott/

Tags: #rss #web #internet #oss #openweb #indieweb

lqdev👽02/14/2024

https://www.theverge.com/24067997/robots-txt-ai-text-file-web-crawlers-spiders

For three decades, a tiny text file has kept the internet from chaos. This text file has no particular legal or technical authority, and it’s not even particularly complicated. It represents a handshake deal between some of the earliest pioneers of the internet to respect each other’s wishes and build the internet in a way that benefitted everybody. It’s a mini constitution for the internet, written in code.

It’s called robots.txt and is usually located at yourwebsite.com/robots.txt. That file allows anyone who runs a website — big or small, cooking blog or multinational corporation — to tell the web who’s allowed in and who isn’t. Which search engines can index your site? What archival projects can grab a version of your page and save it? Can competitors keep tabs on your pages for their own files? You get to decide and declare that to the web.

It’s not a perfect system, but it works. Used to, anyway. For decades, the main focus of robots.txt was on search engines; you’d let them scrape your site and in exchange they’d promise to send people back to you. Now AI has changed the equation: companies around the web are using your site and its data to build massive sets of training data, in order to build models and products that may not acknowledge your existence at all.

The robots.txt file governs a give and take; AI feels to many like all take and no give. But there’s now so much money in AI, and the technological state of the art is changing so fast that many site owners can’t keep up. And the fundamental agreement behind robots.txt, and the web as a whole — which for so long amounted to “everybody just be cool” — may not be able to keep up either.

Permalink: /feed/verge-ai-robots-txt/

Tags: #ai #internet #web

lqdev👽02/13/2024

https://devblogs.microsoft.com/commandline/introducing-sudo-for-windows/

Sudo for Windows is a new way for users to run elevated commands directly from an unelevated console session. It is an ergonomic and familiar solution for users who want to elevate a command without having to first open a new elevated console.

We are also excited to announce that we are open-sourcing this project here on GitHub!

GitHub Repo

Permalink: /feed/introducing-sudo-windows/

Tags: #windows #sudo #system

lqdev👽02/13/2024

https://www.microsoft.com/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/

Perhaps the greatest challenge – and opportunity – of LLMs is extending their powerful capabilities to solve problems beyond the data on which they have been trained, and to achieve comparable results with data the LLM has never seen. This opens new possibilities in data investigation, such as identifying themes and semantic concepts with context and grounding on datasets. In this post, we introduce GraphRAG, created by Microsoft Research, as a significant advance in enhancing the capability of LLMs.

Permalink: /feed/ms-research-graphrag/

Tags: #research #ai #rag #llm #data #microsoft

lqdev👽02/13/2024

https://everynoise.com/engenremap.html

Every Noise at Once is an ongoing attempt at an algorithmically-generated, readability-adjusted scatter-plot of the musical genre-space, based on data tracked and analyzed for 6,291 genre-shaped distinctions by Spotify as of 2023-11-19. The calibration is fuzzy, but in general down is more organic, up is more mechanical and electric; left is denser and more atmospheric, right is spikier and bouncier.

Permalink: /feed/every-noise-at-once/

Tags: #music #genres #discovery #spotify

lqdev👽02/13/2024

https://www.nvidia.com/ai-on-rtx/chat-with-rtx-generative-ai/

Chat With RTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content—docs, notes, videos, or other data. Leveraging retrieval-augmented generation (RAG), TensorRT-LLM, and RTX acceleration, you can query a custom chatbot to quickly get contextually relevant answers. And because it all runs locally on your Windows RTX PC or workstation, you’ll get fast and secure results.

Permalink: /feed/nvidia-chat-rtx/

Tags: #nvidia #chat #ai #rag #chatbot #gpu #llm

lqdev👽02/13/2024

https://github.com/Stability-AI/StableCascade

Stable Cascade consists of three models: Stage A, Stage B and Stage C, representing a cascade for generating images, hence the name "Stable Cascade". Stage A & B are used to compress images, similarly to what the job of the VAE is in Stable Diffusion. However, as mentioned before, with this setup a much higher compression of images can be achieved. Furthermore, Stage C is responsible for generating the small 24 x 24 latents given a text prompt. The following picture shows this visually. Note that Stage A is a VAE and both Stage B & C are diffusion models.

For this release, we are providing two checkpoints for Stage C, two for Stage B and one for Stage A. Stage C comes with a 1 billion and 3.6 billion parameter version, but we highly recommend using the 3.6 billion version, as most work was put into its finetuning. The two versions for Stage B amount to 700 million and 1.5 billion parameters. Both achieve great results, however the 1.5 billion excels at reconstructing small and fine details. Therefore, you will achieve the best results if you use the larger variant of each. Lastly, Stage A contains 20 million parameters and is fixed due to its small size.

Permalink: /feed/stability-ai-stable-cascade/

Tags: #stabilityai #ai #stablediffusion

lqdev👽02/13/2024

https://openai.com/blog/memory-and-new-controls-for-chatgpt

We’re testing memory with ChatGPT. Remembering things you discuss across all chats saves you from having to repeat information and makes future conversations more helpful.

You're in control of ChatGPT's memory. You can explicitly tell it to remember something, ask it what it remembers, and tell it to forget conversationally or through settings. You can also turn it off entirely.

We are rolling out to a small portion of ChatGPT free and Plus users this week to learn how useful it is. We will share plans for broader roll out soon.

Permalink: /feed/openai-chatgpt-memory/

Tags: #openai #chatgpt #memory #ai #llm #gpt

lqdev👽02/06/2024

https://gvwilson.github.io/sql-tutorial/

notes and working examples that instructors can use to perform a lesson

Permalink: /feed/sql-for-data-scientists-100-queries/

Tags: #sql #datascience #tutorial

lqdev👽02/06/2024

https://www.theverge.com/2024/2/6/24063705/whatsapp-interoperability-plans-eu-dma

WhatsApp, like many other major tech platforms, will have to make some significant changes to comply with the European Union’s Digital Markets Act (DMA). One of those changes is interoperability with other messaging platforms...

The shift toward interoperability will first include text messages, images, voice messages, videos, and files sent from one person to another. In theory, this would allow users to chat with people on WhatsApp through third-party apps, like iMessage, Telegram, Google Messages, and Signal, and vice versa.

As noted by Wired, WhatsApp wants the messaging services it connects with to use the same Signal Protocol to encrypt messages. Meta is also open to apps using alternate encryption protocols so long as companies can prove “they reach the security standards that WhatsApp outlines in its guidance.” The third-party services will also have to sign a contract with Meta before they plug into WhatsApp, with more details about the agreement coming in March.

Permalink: /feed/how-whatsapp-plans-interoperate-other-messaging-services/

Tags: #whatsapp #messaging #interoperability #signal

lqdev👽02/03/2024

https://blog.nomic.ai/posts/nomic-embed-text-v1

We're excited to announce the release of Nomic Embed, the first

Open source

Open data

Open training code

Fully reproducible and auditable

Permalink: /feed/nomic-embed-open-embedding-model/

Tags: #embedding #ai #opensource #ml #model

lqdev👽01/31/2024

https://blog.langchain.dev/opengpts/

A little over two months ago, on the heels of OpenAI dev day, we launched OpenGPTs: a take on what an open-source GPT store may look like. It was powered by an early version of LangGraph - an extension of LangChain aimed at building agents as graphs. At the time, we did not highlight this new package much, as we had not publicly launched it and were still figuring out the interface. We finally got around to launching LangGraph two weeks ago, and over the past weekend we updated OpenGPTs to fully use LangGraph (as well as added some new features). We figure now is as good of time as any to do a technical deep-dive on OpenGPTs and what powers it.

In this blog, we will talk about:

MessageGraph: A particular type of graph that OpenGPTs runs on

Cognitive architectures: What the 3 different types of cognitive architectures OpenGPTs supports are, and how they differ

Persistence: How persistence is baked in OpenGPTs via LangGraph checkpoints.

Configuration: How we use LangChain primitives to configure all these different bots.

New models: what new models we support

New tools: what new tools we support

astream_events: How we are using this new method to stream tokens and intermediate steps

Permalink: /feed/langchain-opengpts/

Tags: #langchain #gpt #ai #opengpt #langgraph #messagegraph

lqdev👽01/31/2024

https://inteltechniques.com/blog/2024/01/27/unredacted-magazine-issue-006/

Permalink: /feed/unredacted-magazine-issue-006/

Tags: #osint #magazine #unredacted #privacy #security

lqdev👽01/31/2024

https://www.npr.org/2024/01/26/1226810515/tiny-desk-concert-thee-sacred-souls

San Diego-based trio Thee Sacred Souls made its mark at the Tiny Desk with satin vocals and vintage melodies. Paying homage to southern California Latino culture meeting American soul roots, the group's sweet fusion melodies brought history and love into the space.

Permalink: /feed/thee-sacred-souls-npr-tiny-desk-concerts/

Tags: #theesacredsouls #music #concert #npr #tinydeskconcert

lqdev👽01/29/2024

https://blog.rwkv.com/p/eagle-7b-soaring-past-transformers

Eagle 7B is a 7.52B parameter model that:

Built on the RWKV-v5 architecture (a linear transformer with 10-100x+ lower inference cost)

Ranks as the world’s greenest 7B model (per token)

Trained on 1.1 Trillion Tokens across 100+ languages

Outperforms all 7B class models in multi-lingual benchmarks

Approaches Falcon (1.5T), LLaMA2 (2T), Mistral (>2T?) level of performance in English evals

Trade blows with MPT-7B (1T) in English evals

All while being an “Attention-Free Transformer”

Is a foundation model, with a very small instruct tune - further fine-tuning is required for various use cases!

We are releasing RWKV-v5 Eagle 7B, licensed as Apache 2.0 license, under the Linux Foundation, and can be used personally or commercially without restrictions

Download from HuggingFace

Permalink: /feed/eagle-7b-rkwv/

Tags: #ai #llm #rwkv #deeplearning #neuralnetwork

lqdev👽01/29/2024

https://www.youtube.com/watch?v=nOxKexn3iBo

In this comprehensive video tutorial, Jeremy Howard from answer.ai demystifies the process of programming NVIDIA GPUs using CUDA, and simplifies the perceived complexities of CUDA programming. Jeremy emphasizes the accessibility of CUDA, especially when combined with PyTorch's capabilities, allowing for programming directly in notebooks rather than traditional compilers and terminals. To make CUDA more approachable to Python programmers, Jeremy shows step by step how to start with Python implementations, and then convert them largely automatically to CUDA. This approach, he argues, simplifies debugging and development.

The tutorial is structured in a hands-on manner, encouraging viewers to follow along in a Colab notebook. Jeremy uses practical examples, starting with converting an RGB image to grayscale using CUDA, demonstrating the process step-by-step. He further explains the memory layout in GPUs, emphasizing the differences from CPU memory structures, and introduces key CUDA concepts like streaming multi-processors and CUDA cores.

Jeremy then delves into more advanced topics, such as matrix multiplication, a critical operation in deep learning. He demonstrates how to implement matrix multiplication in Python first and then translates it to CUDA, highlighting the significant performance gains achievable with GPU programming. The tutorial also covers CUDA's intricacies, such as shared memory, thread blocks, and optimizing CUDA kernels.

The tutorial also includes a section on setting up the CUDA environment on various systems using Conda, making it accessible for a wide range of users.

This is lecture 3 of the "CUDA Mode" series (but you don't need to watch the others first). The notebook is available in the lecture3 folder here: https://github.com/cuda-mode/lecture2...

Permalink: /feed/get-started-cuda-python-programmers/

Tags: #gpu #cuda #python #programming

lqdev👽01/25/2024

https://ollama.ai/blog/python-javascript-libraries

The initial versions of the Ollama Python and JavaScript libraries are now available:

Ollama Python Library
Ollama JavaScript Library

Both libraries make it possible to integrate new and existing apps with Ollama in a few lines of code, and share the features and feel of the Ollama REST API.

Permalink: /feed/python-js-ollama-libraries/

Tags: #ollama #ai #llama #python #javascript #opensource

lqdev👽01/25/2024

https://www.theverge.com/2024/1/25/24050445/google-cloud-hugging-face-ai-developer-access

Google Cloud’s new partnership with AI model repository Hugging Face is letting developers build, train, and deploy AI models without needing to pay for a Google Cloud subscription. Now, outside developers using Hugging Face’s platform will have “cost-effective” access to Google’s tensor processing units (TPU) and GPU supercomputers, which will include thousands of Nvidia’s in-demand and export-restricted H100s.

Google said that Hugging Face users can begin using the AI app-building platform Vertex AI and the Kubernetes engine that helps train and fine-tune models “in the first half of 2024.”

Press Release

Permalink: /feed/google-cloud-huggingface-deal/

Tags: #google #huggingface #ai #cloud #llm #ml #developers

lqdev👽01/25/2024

https://openai.com/blog/new-embedding-models-and-api-updates

We are launching a new generation of embedding models, new GPT-4 Turbo and moderation models, new API usage management tools, and soon, lower pricing on GPT-3.5 Turbo.

Permalink: /feed/new-openai-text-embedding-models-3/

Tags: #openai #llm #embedding #openai #gpt

lqdev👽01/23/2024

https://fosdem.org/2024/schedule/events/

Lots of great sessions. I'm looking forward to the sessions on the following topics:

Matrix
AI
Nix / NixOS
Software Defined Radio (SDR) & Amateur Radio
Modern Email
Collaboration & Content Management

Permalink: /feed/fosdem-2024-schedule/

Tags: #fosdem #opensource #foss #floss #ai #matrix #nixos

lqdev👽01/23/2024

https://openai.com/research/microscope

We’re introducing OpenAI Microscope, a collection of visualizations of every significant layer and neuron of eight vision “model organisms” which are often studied in interpretability. Microscope makes it easier to analyze the features that form inside these neural networks, and we hope it will help the research community as we move towards understanding these complicated systems.

Permalink: /feed/openai-microscope/

Tags: #openai #interpretability #ai #generativeai #visualization

lqdev👽01/23/2024

https://inteltechniques.com/blog/2024/01/05/unredacted-magazine-status/

...the magazine is not 'dead'. Much like the podcast, it is simply on a hiatus. Many people falsely report online that the podcast and magazine are officially never coming back, which is contradictory to my previous post. The reason there have been no issues of the magazine is simply a lack of submissions.

The magazine is a community-driven product. Without the community driving it, it will go nowhere. If you would like to submit an article, please email it to staff@unredactedmagazine.com.

Sponsors are lined up to pay the costs and keep the content free, but there lies other problems. We received constant complaints about having sponsors. Most readers demanded free content without ads, which is unrealistic.

We have considered a small fee per issue, but the credit card fraud which comes with that is an even bigger issue. What is the solution? I do not know yet. If the articles pour in, I will figure it out.

Permalink: /feed/unredacted-magazine-status-jan-2024/

Tags: #magazine #unredacted #osint

lqdev👽01/23/2024

https://paulgraham.com/microsoft.html

GIF of WWE Undertaker sitting up

Interesting points.

Permalink: /feed/graham-microsoft-dead/

Tags: #google #microsoft #web

lqdev👽01/23/2024

https://nightshade.cs.uchicago.edu/

Nightshade, a tool that turns any image into a data sample that is unsuitable for model training. More precisely, Nightshade transforms images into "poison" samples, so that models training on them without consent will see their models learn unpredictable behaviors that deviate from expected norms, e.g. a prompt that asks for an image of a cow flying in space might instead get an image of a handbag floating in space.

What is NightShade?

Permalink: /feed/nightshade-ai-model-poisoning-tool/

Tags: #research #tools #ai #generativeai #llm #computervision #cv

lqdev👽01/23/2024

https://dayoneapp.com/blog/introducing-shared-journals/

Shared Journals are a private space for your closest friends and family to shared life updates and memories. Shared Journals introduce a new dimension to journaling, offering a unique way to share your personal stories and experiences with up to 30 selected individuals, while keeping your individual entries private and secure.

Awesome! When I was writing the post Private Social Media yesterday, I wasn't aware that these had already launched. I knew they were in beta but it's great to see they're now generally available. I'll have to give them a try.

Permalink: /feed/introducing-shared-journals-day-one/

Tags: #dayone #automattic #journal

lqdev👽01/23/2024

https://stability.ai/news/introducing-stable-lm-2

Stable LM 2 1.6B is a state-of-the-art 1.6 billion parameter small language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.

This model's compact size and speed lower hardware barriers, allowing more developers to participate in the generative AI ecosystem.

In addition to the pre-trained and instruction-tuned version, we release the last checkpoint before the pre-training cooldown. We include optimizer states to facilitate developers in fine-tuning and experimentation. Data details will be provided in the upcoming technical report.

Stable LM 2 1.6B can be used now both commercially and non-commercially with a Stability AI Membership & you can test the model on Hugging Face.

Permalink: /feed/introducing-stable-lm-2-1-6B/

Tags: #ai #stabilityai #slm #llm #smalllanguagemodel

lqdev👽01/22/2024

https://www.theverge.com/24036427/rss-feed-reader-best

RSS readers allow you to collect the articles of specific sources in one app, making it a lot easier to find the content you’re interested in without crawling through a lot of noise.

Whatever RSS feed reader you choose, it’s worth it to try at least one or two. This way, you can keep up with news from your favorite sources without depending on the chaos that is your email account or the random opinions from TikTok.

Great overview of the various RSS feed readers out there.

Permalink: /feed/top-five-rss-feed-readers-verge/

Tags: #rss #news #feeds #indieweb #protocols

lqdev👽01/20/2024

https://www.theverge.com/2024/1/20/24044343/apple-vision-pro-safari-killer-app

it’s increasingly clear that the early success of the Vision Pro, and much of the answer to the question of what this headset is actually for, will come from a single app: Safari.

That’s right, friends. Web browsers are back.

...at least at first, the open web is Apple’s best chance to make its headset a winner. Because at least so far, it seems developers are not exactly jumping to build new apps for Apple’s new platform.

Some of the high-profile companies that have announced they’re not yet building apps for the Vision Pro and its visionOS platform — Netflix, Spotify, YouTube, and others — are the very same ones that have loudly taken issue with how Apple runs the App Store.

But what if you don’t need the App Store to reach Apple users anymore? All this corporate infighting has the potential to completely change the way we use our devices, starting with the Vision Pro.

...we’ve all stopped opening websites and started tapping app icons, but the age of the URL might be coming back.

If you believe the open web is a good thing, and that developers should spend more time on their web apps and less on their native ones, this is a big win for the future of the internet.

The problem is, it’s happening after nearly two decades of mobile platforms systematically downgrading and ignoring their browsing experience...Mobile platforms treat browsers like webpage viewers, not app platforms, and it shows.

There are some reasons for hope, though...the company appears to be still invested in making Safari work.

Safari for visionOS will also come with some platform-specific features: you’ll be able to open multiple windows at the same time and move them all around in virtual space.

With a good browser and powerful PWAs, many users might mostly not notice the difference between opening the Spotify app and going to Spotify.com. That’s a win for the whole web.

here’s the real question for Apple: which is more important, getting the Vision Pro off to a good start or protecting the sanctity of its App Store control at all costs? As Apple tries to create a platform shift to face computers, I’m not sure it can have it both ways.

Great article by David Pierce. As part of my website stats, I should probably start also counting authors I reference since many of the articles from the Verge I've previously linked to are written by David.

As someone who accesses services - "apps" - primarily through the web browser on desktop, this is exciting to see. While native apps have their advantages, the types of cross-platform connected experiences that can be delivered through the browser can't be ignored. First-class support for browsers in various platforms can only make these experiences even better. With more folks building their own platforms on the web on top of open standards that have been around for decades, I'm excited for the future of the web.

Permalink: /feed/vision-pros-most-important-app-safari/

Tags: #apple #safari #visionpro #web #browser #vr #xr #mr #ar #indieweb #openweb

lqdev👽01/17/2024

https://blog.bytebytego.com/p/how-discord-serves-15-million-users

In early summer 2022, the Discord operations team noticed unusually high activity on their dashboards. They thought it was a bot attack, but it was legitimate traffic from MidJourney - a new, fast-growing community for generating AI images from text prompts.
To use MidJourney, you need a Discord account. Most MidJourney users join one main Discord server. This server grew so quickly that it soon hit Discord’s old limit of around 1 million users per server.
This is the story of how the Discord team creatively solved this challenge.

Discord’s real-time messaging backend is built with Elixir. Elixir runs on the BEAM virtual machine. BEAM was created for Erlang - a language optimized for large real-time systems requiring rock-solid reliability and uptime.
A key capability BEAM provides is extremely lightweight parallel processes. This enables a single server to efficiently run tens or hundreds of thousands of processes concurrently.
Elixir brings friendlier, Ruby-inspired syntax to the battle-tested foundation of BEAM. Combined they make it much easier to program massively scalable, fault-tolerant systems.
So by leveraging BEAM's lightweight processes, the Elixir code powering Discord can "fan out" messages to hundreds of thousands of users around the world concurrently. However, limits emerge as communities grow larger.

Permalink: /feed/how-discord-serves-15-million-users-using-one-server/

Tags: #beam #erlang #elixir #discord #scale #enterprise #chat #communities

lqdev👽01/17/2024

https://willowprotocol.org/

A protocol for peer-to-peer data stores. The best parts? Fine-grained permissions, a keen approach to privacy, destructive edits, and a dainty bandwidth and memory footprint.

Permalink: /feed/willow-protocol/

Tags: #willow #protocol #p2p #privacy #data

lqdev👽01/17/2024

https://stability.ai/news/stable-code-2024-llm-code-completion-release

Stable Code 3B is a 3 billion parameter Large Language Model (LLM), allowing accurate and responsive code completion at a level on par with models such as CodeLLaMA 7b that are 2.5x larger.

Operates offline even without a GPU on common laptops such as a MacBook Air.

Permalink: /feed/stable-code-3b-stability/

Tags: #ai #stabilityai" #code #software #softwaredevelopment #llm

lqdev👽01/17/2024

https://huyenchip.com//2024/01/16/sampling.html

ML models are probabilistic. Imagine that you want to know what’s the best cuisine in the world. If you ask someone this question twice, a minute apart, their answers both times should be the same. If you ask a model the same question twice, its answer can change.

This probabilistic nature makes AI great for creative tasks.

However, this probabilistic nature also causes inconsistency and hallucinations. It’s fatal for tasks that depend on factuality. Recently, I went over 3 months’ worth of customer support requests of an AI startup I advise and found that ⅕ of the questions are because users don’t understand or don’t know how to work with this probabilistic nature.

To understand why AI’s responses are probabilistic, we need to understand how models generate responses, a process known as sampling (or decoding). This post consists of 3 parts.
1. Sampling: sampling strategies and sampling variables including temperature, top-k, and top-p.
2. Test time sampling: sampling multiple outputs to help improve a model’s performance.
3. Structured outputs: how to get models to generate outputs in a certain format.

Permalink: /feed/sampling-text-generation-huyen/

Tags: #ai #generativeai #statistics #llm

lqdev👽01/17/2024

https://www.theverge.com/2024/1/9/24032155/youtube-podcast-rss-spotify-apple-audacy-bankruptcy

Today, YouTube at very long last debuts RSS integration.

This means more hosts saying "...or wherever you get your podcasts." 🙂

Support Post

Permalink: /feed/podcast-youtube-era/

Tags: #rss #podcasts #youtube

lqdev👽01/17/2024

https://themarkup.org/privacy/2024/01/17/each-facebook-user-is-monitored-by-thousands-of-companies-study-indicates

By now most internet users know their online activity is constantly tracked.

But what is the scale of this surveillance? Judging from data collected by Facebook and newly described in a unique study by non-profit consumer watchdog Consumer Reports, it’s massive, and examining the data may leave you with more questions than answers.

Using a panel of 709 volunteers who shared archives of their Facebook data, Consumer Reports found that a total of 186,892 companies sent data about them to the social network. On average, each participant in the study had their data sent to Facebook by 2,230 companies. That number varied significantly, with some panelists’ data listing over 7,000 companies providing their data.

What Exactly Does This Data Contain?

The data examined by Consumer Reports in this study comes from two types of collection: events and custom audiences. Both categories include information about what people do outside of Meta’s platforms.
Custom audiences allow advertisers to upload customer lists to Meta, often including identifiers like email addresses and mobile advertising IDs...
The other category of data collection, “events,” describes interactions that the user had with a brand, which can occur outside of Meta’s apps and in the real world. Events can include visiting a page on a company’s website, leveling up in a game, visiting a physical store, or purchasing a product...

How Can I See My Data?

Facebook users can browse through the list of companies that have sent their data to Facebook by going to: [https://accountscenter.facebook.com/info_and_permissions](https://accountscenter.facebook.com/info_and_permissions)

Permalink: /feed/facebook-users-monitored-thousands-companies-markup/

Tags: #socialmedia #privacy #facebook #meta #surveillance

lqdev👽01/17/2024

https://simonwillison.net/2024/Jan/17/oxide-and-friends/

I recorded an episode of the Oxide and Friends podcast on Monday, talking with Bryan Cantrill and Adam Leventhal about Open Source LLMs.

Too important for a small group to control...

This technology is clearly extremely important to the future of all sorts of things that we want to do.
I am totally on board with it. There are people who will tell you that it’s all hype and bluster. I’m over that. This stuff’s real. It’s really useful.
It is far too important for a small group of companies to completely control this technology. That would be genuinely disastrous. And I was very nervous that was going to happen, back when it was just OpenAI and Anthropic that had the only models that were any good, that was really nerve-wracking.
Today I’m not afraid of that at all, because there are dozens of organizations now that have managed to create one of these things...

On LLMs for learning...

One of the most exciting things for me about this technology is that it’s a teaching assistant that is always available to you.
You know that thing where you’re learning—especially in a classroom environment—and you miss one little detail and you start falling further and further behind everyone else because there was this one little thing you didn’t quite catch, and you don’t want to ask stupid questions?
You can ask stupid questions of ChatGPT anytime you like and it can help guide you through to the right answer.
That’s kind of a revelation.

Permalink: /feed/talking-open-source-llms-oxide-friends-willison/

Tags: #ai #podcast #opensource #llm

lqdev👽01/17/2024

https://arxiv.org/abs/2301.12662

We present SingSong, a system that generates instrumental music to accompany input vocals, potentially offering musicians and non-musicians alike an intuitive new way to create music featuring their own voice. To accomplish this, we build on recent developments in musical source separation and audio generation. Specifically, we apply a state-of-the-art source separation algorithm to a large corpus of music audio to produce aligned pairs of vocals and instrumental sources. Then, we adapt AudioLM (Borsos et al., 2022) -- a state-of-the-art approach for unconditional audio generation -- to be suitable for conditional "audio-to-audio" generation tasks, and train it on the source-separated (vocal, instrumental) pairs. In a pairwise comparison with the same vocal inputs, listeners expressed a significant preference for instrumentals generated by SingSong compared to those from a strong retrieval baseline. Sound examples at this https URL

AI can now help you create a backing track to all the songs you make up about your pets.

Permalink: /feed/singsong-ai-music-accompaniment/

Tags: #ai #generativeai #music

lqdev👽01/17/2024

https://www.theverge.com/2024/1/17/24041330/notion-calendar-app

After acquiring Cron in 2022, Notion is bringing the calendar app fully into its all-in-one workspace.

The big new feature coming with the rebranding is Notion integration. If you or your company uses Notion, you’ll be able to create or link Notion documents inside a calendar invite. If you have a database filled with due dates, you can add that as a calendar to Notion Calendar. It sounds like a much better way to handle agendas and notes than sending them around before and after a meeting or hunting for them in your Slack. Putting everything in the calendar event is a good move.

This is one of the reasons I like org-mode in Emacs. Being able to annotate documents with timestamps and deadlines that show up and you can organize inside the Agenda view is so powerful. The integrations and learning curve is steeper compared to a tool like Notion but I find it simple and powerful enough for GTD-style workflows, I'd have a hard time moving. I have yet to use AnyType, so maybe after trying that, I choose to shift some of my workflows there.

Permalink: /feed/notion-cron-calendar-app/

Tags: #calendar #notion #productivity #gtd #software #app #pkm

lqdev👽01/16/2024

https://openmentions.com/

OpenMentions is a project designed to use Webmentions and ActivityPub for topical content discovery. The site is organised along the lines of a hierarchy of topics going from broad to fine. This we call OpenTopic – the idea being that many sites could host the full collection of topics so that the loss of any one site is not the loss of all topics.

The intention is that this site should own nothing and that topic hierarchies are organic and discoverable.

Permalink: /feed/openmentions/

Tags: #webmentions #indieweb #activitypub #online

lqdev👽01/16/2024

https://blog.trailofbits.com/2024/01/16/leftoverlocals-listening-to-llm-responses-through-leaked-gpu-local-memory/

We are disclosing LeftoverLocals: a vulnerability that allows recovery of data from GPU local memory created by another process on Apple, Qualcomm, AMD, and Imagination GPUs. LeftoverLocals impacts the security posture of GPU applications as a whole, with particular significance to LLMs and ML models run on impacted GPU platforms. By recovering local memory—an optimized GPU memory region—we were able to build a PoC where an attacker can listen into another user’s interactive LLM session (e.g., llama.cpp) across process or container boundaries

Permalink: /feed/lefoverlocals-llm-responses-leaked-gpu-memory/

Tags: #security #llm #ai

lqdev👽01/15/2024

https://blog.thenewoil.org/easy-ways-to-improve-your-privacy-and-security-in-2024

Every year, I like to remind everyone to go back to the basics. For those who are new to privacy and security and may be trying to create some new, positive habits, this serves as a great entry point. For veteran privacy enthusiasts, the basics form our foundation for more advanced techniques later, making it imperative to ensure we cover all those bases. So in that spirit, let’s all pause – wherever we are in our privacy journeys – to do a quick check and make sure we’ve got the basics covered. If you’re one of those new people I mentioned, welcome! But also know that this post is packed with information, so try not to get overwhelmed. Maybe bookmark this post and do one thing per day or something like that.

Strong Passwords...

Multi-Factor Authentication (MFA)...

Regular Software Updates...

Secure Your Wi-Fi Network...

Be Cautious with Communications...

Review App Permissions...

Review Your Account Settings...

Secure Browsing Habits...

Device Security...

Review Financial Statements...

Educate Yourself...

Permalink: /feed/easy-ways-improve-privacy-security-2024/

Tags: #privacy #security

lqdev👽01/15/2024

https://sites.google.com/view/lastunen/ai-for-economists

This page contains example prompts and responses intended to showcase how generative AI, namely LLMs like GPT-4, can benefit economists.
Example prompts are shown from six domains: ideation and feedback; writing; background research; coding; data analysis; and mathematical derivations.
The framework as well as some of the prompts and related notes come from Korinek, A. 2023. “Generative AI for Economic Research: Use Cases and Implications for Economists“, Journal of Economic Literature, 61 (4): 1281–1317.
Each application area includes 1-3 prompts and responses from an LLM, often from the field of development economics, along with brief notes. The prompts will be updated periodically.

Permalink: /feed/ai-for-economists-prompts/

Tags: #ai #economy #promptengineering #llm #gpt

lqdev👽01/15/2024

https://9to5mac.com/2024/01/12/clicks-iphone-hands-on/

Smartphones and physical keyboards aren’t a combination we think of often, but Clicks for iPhone is trying to bring that back with a new keyboard case that’s extremely good.

Wishful thinking on my end but I'd buy a Blackberry-like device. Not a "smartphone", but an internet-connected device with all the phone capabilities that has a physical keyboard.

Permalink: /feed/hands-on-clicks-iphone/

Tags: #smarphone #keyboard #iphone #featurephone

lqdev👽01/15/2024

https://goblin.band/

Goblin band is an attempt to replicate the colletive creative energy that happens on tumblr and take it to the fediverse

Repo: https://github.com/johnHackworth/goblin

Permalink: /feed/goblin-tumblr-fediverse/

Tags: #tumblr #fediverse #activitypub #automattic #firefish #socialmedia #opensource #community #decentralization #federation

lqdev👽01/15/2024

https://clarkesworldmagazine.com/

Clarkesworld is a monthly science fiction and fantasy magazine first published in October 2006. Each issue contains interviews, thought-provoking articles, and between six and eight works of original fiction.

Permalink: /feed/clarkesworld-sci-fi-fantasy-magazine/

Tags: #magazine #scifi #fantasy #online #fiction #nonfiction #art

lqdev👽01/15/2024

https://twitter.com/ChicanoBatman/status/1746940313678299500

Starting off the year right with new Chicano Batman!

A closeup image of a hand wearing a ring that says FLY — Source: *Chicano Batman on X*

Permalink: /feed/chicano-batman-fly-01-2024/

Tags: #music #chicanobatman

lqdev👽01/15/2024

https://hackaday.com/2023/12/20/floss-weekly-episode-762-spilling-the-tea/

We’re excited to announce that Hackaday is the new home of FLOSS Weekly, a long-running podcast about free, libre, and open-source software! The TWiT network hosted the podcast for an incredible seventeen years, but due to some changes on their end, they recently had to wind things down. They were gracious enough to let us pick up the torch, with Jonathan Bennett now taking over hosting duties.

That didn't take long. Last month I learned FLOSS Weekly was ending on the TWiT network. It's great to see it has found a new home in Hackaday! Time to update the RSS feeds and podroll.

Permalink: /feed/floss-weekly-now-on-hackaday/

Tags: #podcast #floss #hackaday #opensource #technology #internet

lqdev👽01/15/2024

https://every.to/

Every is a daily newsletter founded in 2020. Every day, we publish a long-form essay to make you smarter about technology, productivity, and AI.

Permalink: /feed/every-newsletter/

Tags: #newsletter #technology #productivity #ai #essay #internet

lqdev👽01/14/2024

https://citationneeded.news/substack-to-self-hosted-ghost/

I have found myself with a roughly $103/mo setup...

However, more important to me than the exact price is the degree of control I have over my own not-a-platform...

I realize that this is...a lot. If you are a newsletter writer looking to flee the Substack ship, please don't let this discourage you.

Love seeing this! Whether through self-hosted or hosted options, I hope more people get to experience the benefits of owning their own platform.

Permalink: /feed/migrating-substack-self-hosted-ghost-white/

Tags: #substack #selfhost #indieweb #ghost

lqdev👽01/13/2024

https://www.w3.org/Provider/Style/URI

Keeping URIs so that they will still be around in 2, 20 or 200 or even 2000 years is clearly not as simple as it sounds. However, all over the Web, webmasters are making decisions which will make it really difficult for themselves in the future. Often, this is because they are using tools whose task is seen as to present the best site in the moment, and no one has evaluated what will happen to the links when things change. The message here is, however, that many, many things can change and your URIs can and should stay the same. They only can if you think about how you design them.

Permalink: /feed/cool-uris-dont-change/

Tags: #url #http #standard #protocol #web #internet

lqdev👽01/13/2024

https://blog.benjojo.co.uk/post/who-hosts-the-fediverse-instances

Here we can see that Fastly and Cloudflare make up over 50% of the entire fediverse network.

...for the population using Cloudflare, a fair number of them (30%) appear to be hosting the instances behind a home broadband connection.

...the German hosting provider Hetzner hosts over 51% of the entire network!

Permalink: /feed/where-is-all-fediverse/

Tags: #fediverse #analytics #web

lqdev👽01/13/2024

https://thenewstack.io/more-than-an-openai-wrapper-perplexity-pivots-to-open-source/

...Perplexity has become a surprisingly strong player in a market otherwise dominated by OpenAI, Microsoft, Google and Meta.

At its core, Perplexity is a search engine.

...over the past year, Perplexity has evolved rapidly. It now has its own search index and has built its own LLMs based on open source models. They’ve also begun to combine their proprietary technology products. At the end of November, Perplexity announced two new “online LLMs” — LLMs combined with a search index — called pplx-7b-online and pplx-70b-online. They were built on top of the open source models mistral-7b and llama2-70b.

Using open source models has been critical for the growth of Perplexity.

...the default Perplexity model still relies on GPT 3.5 (and a dash of LLaMA-2). But the intention is to move away from that long-standing reliance on OpenAI for its base model.

Permalink: /feed/perplexity-ai-pivots-open-source/

Tags: #ai #perplexity #opensource #search #llm

lqdev👽01/13/2024

https://marimo.io/

marimo is an open-source reactive notebook for Python — reproducible, git-friendly, executable as a script, and shareable as an app.

Permalink: /feed/marimo-notebook/

Tags: #notebook #programming #literateprogramming

lqdev👽01/13/2024

https://shellsharks.com/indieweb

Principle Mechanics

This section does not provide exhaustive coverage of how to implement IndieWeb functionality. Instead, I simply summarize five core primitives which I feel comprise an IndieWeb site. For a more official gauge on where a site scores within the IndieWeb spectrum, consider leveraging IndieMark!

Hosting: You need a place to host your site and store your content. There are a lot of great options out there. Ideally, choose one that allows you the ability to make some under-the-hood changes and does not limit your content portability.

Syndication: Share your content with the world! There are two preferred methods for syndication, PESOS and POSSE. This resource does a great job explaining both! For more examples of how this is done, check this and this out. RSS is a great starting point for helping others subscribe to new content on your site.

Writing: Though your site could simply serve as a more static point/identity on the web, with little to no “content” being regularly added, I recommend writing!

Interactivity: One of the more advanced concepts within the IndieWeb world, the ability to bake in native comments, replies, likes, etc is a greay way to build community. This interactivity helps mitigate reliance on centralized social networks for communication within Indie communities. One example of IndieWeb interactivity is Webmentions.

Identity: Make it unique, make it fun, make it yours. The corporate web is sterile and suffocating. Let’s bring back the whimsy of the old web.

Permalink: /feed/indieweb-assimilation-shellshark/

Tags: #indieweb #guide #catalog

lqdev👽01/11/2024

https://www.alexirpan.com/2024/01/10/ai-timelines-2024.html

...computers are useful, ML models are useful, and even if models fail to scale, people will want to fit GPT-4 sized models on their phone. It seems reasonable to assume the competing factions will figure something out.

Data seems like the harder question. (Or at least the one I feel qualified talking about.) We have already crossed the event horizon of trying to train on everything on the Internet. It’s increasingly difficult for labs to differentiate themselves on publicly available data. Differentiation is instead coming from non-public high-quality data to augment public low-quality data.

All the scaling laws have followed power laws so far, including dataset size. Getting more data by hand doesn’t seem good enough to cross to the next thresholds. We need better means to get good data.

A long time ago, when OpenAI still did RL in games / simulation, they were very into self-play. You run agents against copies of themselves, score their interactions, and update the models towards interactions with higher reward. Given enough time, they learn complex strategies through competition.

I think it’s possible we’re at the start of a world where self-play or self-play-like ideas work to improve LLM capabilities. Drawing an analogy, the environment is the dialogue, actions are text generated from an LLM, and the reward is from whatever reward model you have. Instead of using ground truth data, our models may be at a point where they can generate data that’s good enough to train on.

Permalink: /feed/ai-pipelines-sped-up-again-alexirpan/

Tags: #ai #predictions #agi #data #llm #technology #ml #computervision

lqdev👽01/11/2024

https://openai.com/blog/introducing-the-gpt-store

It’s been two months since we announced GPTs, and users have already created over 3 million custom versions of ChatGPT. Many builders have shared their GPTs for others to use. Today, we're starting to roll out the GPT Store to ChatGPT Plus, Team and Enterprise users so you can find useful and popular GPTs.

Permalink: /feed/introducing-gpt-store/

Tags: #openai #gpt #ai

lqdev👽01/09/2024

https://tonybaloney.github.io/posts/python-gets-a-jit.html

In late December 2023 (Christmas Day to be precise), CPython core developer Brandt Bucher submitted a little pull-request to the Python 3.13 branch adding a JIT compiler.

Permalink: /feed/python-3-13-gets-jit/

Tags: #python #programming #compiler

lqdev👽01/09/2024

https://robertkingett.com/posts/6421/

I woke up today...and decided I was going to delete all the apps off my iPhone.

If I didn’t need it to do a specialized function? Gone, poof. I decided to just use websites and bookmarks instead.

The websites worked just fine.

...websites are just badly designed these days, especially with using a screen reader on a mobile device. I can’t quite describe it, because mobile web browsers aren’t really well designed either, so it makes the website worse because it doesn’t even render all elements as smoothly as on desktop.

Even though there are problems, I’m honestly glad I deleted almost all my apps off my phone and started to pin websites to my home screen more. For one thing, it cuts down on notifications. it’s super freeing to not get a random notification because you didn’t open the app in a day, so the app pings you to say hey I’m still here please pay attention to me, I feel lonely, and nobody will give me animals to snuggle with.

With pinned websites, my phone is faster and my home screen is much more organized as well.

I did this a few years ago. Websites work just fine for almost everything I need to do. The main benefits I noticed:

Less distractions
More real estate when viewing websites on laptop / desktop
Less apps to drain the battery when running on the background
More restricted access / permissions (i.e. websites don't need unrestricted access to my contacts or other information on my phone)

Better PWA support on mobile platforms would go a long way here to better balance between apps and websites. In the meantime though, pinning websites and websites overall work just fine for most things I need to do day-to-day.

Permalink: /feed/replacing-apps-websites-kingett/

Tags: #apps #websites #internet #mobile

lqdev👽01/07/2024

https://www.tiobe.com/tiobe-index/

For the first time in the history of the TIOBE index, C# has won the programming language of the year award. Congratulations! C# has been a top 10 player for more than 2 decades and now that it is catching up with the big 4 languages, it won the well-deserved award by being the language with the biggest uptick in one year (+1.43%).

Exciting to see F# almost break into the Top 20 at number 22 with 0.77%.

Permalink: /feed/tiobe-index-january-2024/

Tags: #programming #dotnet #technology #fsharp #csharp #programminglanguages

lqdev👽01/07/2024

https://ma.tt/2024/01/birthday-gift/

A comment thread between Luis and Matt — Source: *Matt Mullenweg*

A GIF of a man dressed in black pointing up to blinking overhead text "THIS"

Permalink: /feed/blog-posts-can-be-any-length-matt/

Tags: #writing #blog #web #internet

lqdev👽01/07/2024

https://heydingus.net/blog/2024/1/please-own-your-rss-links

...owning the address where your audience finds you is important. It allows you to be mobile, nimble, and without attached strings. It helps you show off all the things and places you want folks to see because you can put all these URLs on your /feeds page. It’s user-friendly in more ways than one (pretty cool how you can make all those URLs human-readable, huh?).

...it means your audience never has to think about how they’re going to get your stuff.

This is a great idea. I've written before about owning your links. Today, that's how I expose many of my links in the contact page. For example, to access my Mastodon profile, instead of going to the actual URL, you can just visit lqdev.me/mastodon which redirects to the actual URL. If tomorrow I choose to change where and how my Mastodon presence is hosted, the URL doesn't have to change. However, I haven't done the same for my RSS links. Recently I've been thinking about restructuring my website, specifically my microblog feed which includes notes and responses. Today, the RSS urls are coupled to the folder structure on my website which is subject to change and isn't flexible. By setting up more user-friendly and stable RSS urls through redirection, that wouldn't be an issue and readers wouldn't have to change the RSS URL they use.

Permalink: /feed/own-your-rss-links/

Tags: #rss #indieweb

lqdev👽01/06/2024

https://adventuretaco.com/favorite-photos-2023-edition/

A Toyota Tacoma Driving on a dirt road towards mountains in the background — Source: *AdventureTaco*

Permalink: /feed/favorite-photos-2023-adventuretaco/

Tags: #landscapes #photography #overlanding #adventure #photos #desert #mountain

lqdev👽01/03/2024

https://ma.tt/2024/01/birthday-gift/

the gift I most want for my 40th is something everyone can do.

I want you to blog.

Publish a post. About anything! It can be long or short, a photo or a video, maybe a quote or a link to something you found interesting. Don’t sweat it. Just blog. Share something you created, or amplify something you enjoyed. It doesn’t take much. The act of publishing will be a gift for you and me.

That’s it! No wrapping paper or bows. Just blogs and blogs and blogs, each unique and beautiful in its own way.

Permalink: /feed/i-want-you-to-blog-mullenweg/

Tags: #blogging #wordpress #writing #community

lqdev👽01/02/2024

https://bix.blog/2024/01/01/the-year-for-blogging-to-pump-up-the-volume/

There’s been a lot of pontificating lately that the web is ripe for a blogging renaissance, wishing for it to be true. Much of it from people who don’t seem to notice that’s it’s already begun. Maybe they don’t anymore know quite where to look. Maybe the sorts of blogging they’re seeing isn’t what they mean. (To the blognoscenti, do things like “wordvomits” count?) If you haven’t seen it, either, that’s okay. All you have to do is choose to be a part of it. There’s never been a better time: those who managed to monopolize our attentions and keep too many of us chattering for a few hundred characters at a time to the benefit of advertisers are losing their relevance.

I’m not one for making personal resolutions, but let me suggest one on behalf of the blogosphere: this is the year we pump up the volume.

Permalink: /feed/blogging-pump-up-volume/

Tags: #blogging #community #writing #movie #radio

lqdev👽12/30/2023

https://octodon.social/@cwebber/111647596861000656

Bring back self-hosted blogs, reinstall a feed reader, make your feed icon prominent on your blog. Blogs + Atom/RSS is the best decentralized social media system we've ever had!

And yes I am saying that as co-author of ActivityPub: self hosted blogs is the best decentralized social networking we've had

Source: Christine Lemmer-Webber (@cwebber@octodon.social)

💯 💯 💯 💯 💯 💯

Permalink: /feed/blogs-rss-decentralized-social-media-lemmer-webber/

Tags: #rss #blog #decentralized #socialmedia #blogging #mastodon #fediverse #indieweb #atom #feed

lqdev👽12/27/2023

https://www.science.org/content/article/not-dumb-creatures-livestock-surprise-scientists-their-complex-emotional-minds

I think I've read about this somewhere before...

“All animals are equal, but some animals are more equal than others.” ― George Orwell, Animal Farm

“Four legs good, two legs bad.” ― George Orwell, Animal Farm

Permalink: /feed/what-farm-animals-think-orwell/

Tags: #science #animals #farm #orwell #animallfarm #meme

lqdev👽12/25/2023

https://arxiv.org/abs/2310.07704

We introduce Ferret, a new Multimodal Large Language Model (MLLM) capable of understanding spatial referring of any shape or granularity within an image and accurately grounding open-vocabulary descriptions. To unify referring and grounding in the LLM paradigm, Ferret employs a novel and powerful hybrid region representation that integrates discrete coordinates and continuous features jointly to represent a region in the image. To extract the continuous features of versatile regions, we propose a spatial-aware visual sampler, adept at handling varying sparsity across different shapes. Consequently, Ferret can accept diverse region inputs, such as points, bounding boxes, and free-form shapes. To bolster the desired capability of Ferret, we curate GRIT, a comprehensive refer-and-ground instruction tuning dataset including 1.1M samples that contain rich hierarchical spatial knowledge, with 95K hard negative data to promote model robustness. The resulting model not only achieves superior performance in classical referring and grounding tasks, but also greatly outperforms existing MLLMs in region-based and localization-demanded multimodal chatting. Our evaluations also reveal a significantly improved capability of describing image details and a remarkable alleviation in object hallucination. Code and data will be available at this https URL

Code

Permalink: /feed/apple-ml-ferret/

Tags: #ai #ml #llm #mllm #largelanguagemodel #multimodal #multimodallargelanguagemodel #apple #opensource

lqdev👽12/24/2023

https://apurplelife.com/2023/12/19/2023-goals-accomplishments/

My posts have been...including more reviews of fancy travel hacked flights, tours and slow travel locations. Possibly as a result of this shift in topic – or possibly simply because blogging seems to be on its way out according to a few of my blogging peers – my comments section has been quieter lately. I talked about this in one of my monthly recaps with the spin that I didn’t realize I had come to rely on getting at least one comment per post to know that people (and not just bots 🙂 ) were reading my words and they weren’t floating into an abyss.

I didn’t want to be reliant on external validation when I had written this blog without it being public for years, and hadn’t realized I had come to rely on anything but the joy I get from writing it. So I was trying to grow after realizing that not receiving comments on multiple posts in a row bothered me for some reason. I’m going to do my best to not rely on that kind of feedback going forward and will continue to blog for the main reason I always have: for myself 🙂 .

I also followed a reader suggestion to add a “Like” button at the bottom of my posts (it’s after the “Share This” section and before the “Related” articles section) because readers said they don’t necessarily have something they want to comment, but that a Like button would help show there is still a human reading. All fair 🙂 .

🙋‍♂️ there's still a human reading your posts on this end 🙂

Also, today I learned there's a Guineafowl Pufferfish.

Picture of a Guineafowl Pufferfish — Source: *A Purple Life*

Permalink: /feed/2023-goals-accomplishments-purple-life/

Tags: #blogging #review #goals

lqdev👽12/23/2023

https://bsky.app/profile/bsky.app/post/3kh5rjl6bgu2i

I'm not as active on there, but feel free to subscribe to my Bluesky feed wherever you subscribe to feeds.

Also, if you have any feed recommendations, let me know.

📢 1.60 is rolling out now (3/5)

RSS feeds for profiles!

Access your posts via RSS by pasting your profile link into your RSS feed reader and it will automatically be discovered.

Source: Bluesky on Bluesky

Permalink: /feed/bluesky-rss-support/

Tags: #bluesky #rss #socialmedia

lqdev👽12/23/2023

https://twitter.com/HappyPunch/status/1738661281236672833

What was probably being said according to Copilot 😂

Cristiano, you need to listen to me, mate. You're wasting your time and money on those Apple products. They're overpriced, overrated, and outdated. They're nothing compared to the real champions of technology: Windows Phone and Zune. Trust me, I know what I'm talking about. I've used them both, and they're the best phone and MP3 player ever made. Let me tell you why. First of all, they had a sleek design, a smooth interface, and a unique tile system that was way ahead of its time. They were also compatible with Microsoft's ecosystem, which gave you access to Office, Xbox, and Bing. And don't get me started on the Zune Pass, which let you stream and download unlimited music for a low monthly fee. You can't beat that, mate. Second of all, they had no apps, no games, no support, and no market share. Wait, what? No, no, no, that's not a bad thing. That's a good thing. That means they were exclusive, elite, and original. They were not for the masses, they were for the class. They were not for the sheep, they were for the lions. They were not for the followers, they were for the leaders. They were not for the weak, they were for the strong. They were not for you, they were for me. And they can be for you too, if you have the courage, the vision, and the taste to switch to Windows Phone and Zune. I'm telling you, Cristiano, you're missing out on the best phone and MP3 player ever made. And I'm willing to prove it to you. I'll bet you a million euros that if you try them for a week, you'll never go back to your Apple products. What do you say? Are you man enough to take the challenge? Or are you too scared to face the truth? Come on, Cristiano, don't be a coward. Don't be a fool. Don't be a loser. Be a winner. Be a legend. Be a Windows Phone and Zune user.

Permalink: /feed/mcgregor-ronaldo-zune-windows-phone-meme/

Tags: #windowsphone #zune #ai #copilot #meme

lqdev👽12/23/2023

https://sites.research.google/videopoet/

VideoPoet is a simple modeling method that can convert any autoregressive language model or large language model (LLM) into a high-quality video generator. It contains a few simple components:

A pre-trained MAGVIT V2 video tokenizer and a SoundStream audio tokenizer transform images, video, and audio clips with variable lengths into a sequence of discrete codes in a unified vocabulary. These codes are compatible with text-based language models, facilitating an integration with other modalities, such as text.

An autoregressive language model learns across video, image, audio, and text modalities to autoregressively predict the next video or audio token in the sequence.

A mixture of multimodal generative learning objectives are introduced into the LLM training framework, including text-to-video, text-to-image, image-to-video, video frame continuation, video inpainting and outpainting, video stylization, and video-to-audio. Furthermore, such tasks can be composed together for additional zero-shot capabilities (e.g., text-to-audio).

This simple recipe shows that language models can synthesize and edit videos with a high degree of temporal consistency. VideoPoet demonstrates state-of-the-art video generation, in particular in producing a wide range of large, interesting, and high-fidelity motions. The VideoPoet model supports generating videos in square orientation, or portrait to tailor generations towards short-form content, as well as supporting audio generation from a video input.

Blog Post

Permalink: /feed/videopoet-llm-zero-shot-video-generation/

Tags: #ai #genai #videopoet #generativeai #google

lqdev👽12/22/2023

https://www.artstation.com/artwork/aoDzL0

Cyberdeck radio concept art rendering with display closed front and back profile

Source: Michal Kalisz

Cyberdeck radio concept art rendering with display open and keyboard showing — Source: *Michal Kalisz*

Robert Downey Jr. as Tony Stark saying I need it

Permalink: /feed/classic-palmtop-concept/

Tags: #concept #tech #cyberpunk

lqdev👽12/22/2023

https://mid-journey.ai/midjourney-v6-release/

The Dev Team gonna let the community test an alpha-version of Midjourney v6 model over the winter break, starting tonight, December 21st, 2023.

What’s new with the Midjourney v6 base model?

Much more accurate prompt following as well as longer prompts,

Improved coherence, and model knowledge,

Improved image prompting and remix mode,

Minor text drawing ability (you must write your text in “quotations” and --style raw or lower --stylize values may help)

/imagine a photo of the text "Hello World!" written with a marker on a sticky note --ar 16:9 --v 6

Improved upscalers, with both 'subtle‘ and 'creative‘ modes (increases resolution by 2x) (you’ll see buttons for these under your images after clicking U1/U2/U3/U4)

Permalink: /feed/midjourney-v6-release/

Tags: #computervision #ai #midjourney

lqdev👽12/22/2023

https://blog.langchain.dev/langchain-state-of-ai-2023/

What are people building?

Retrieval has emerged as the dominant way to combine your data with LLMs.

...42% of complex queries involve retrieval

...about 17% of complex queries are part of an agent.

Most used LLM Providers

OpenAI has emerged as the leading LLM provider of 2023, and Azure (with more enterprise guarantees) has seized that momentum well.

On the open source model side, we see Hugging Face (4th), Fireworks AI (6th), and Ollama (7th) emerge as the main ways users interact with those models.

OSS Model Providers

A lot of attention recently has been given to open source models, with more and more providers racing to host them at cheaper and cheaper costs. So how exactly are developers accessing these open source models?

We see that the people are mainly running them locally, with options to do so like Hugging Face, LlamaCpp, Ollama, and GPT4All ranking high.

Most used vector stores

Vectorstores are emerging as the primary way to retrieve relevant context.

...local vectorstores are the most used, with Chroma, FAISS, Qdrant and DocArray all ranking in the top 5.

Of the hosted offerings, Pinecone leads the pack as the only hosted vectorstore in the top 5. Weaviate follows next, showing that vector-native databases are currently more used than databases that add in vector functionality.

Of databases that have added in vector functionality, we see Postgres (PGVector), Supabase, Neo4j, Redis, Azure Search, and Astra DB leading the pack.

Most used embeddings

OpenAI reigns supreme

Open source providers are more used, with Hugging Face coming in 2nd most use

On the hosted side, we see that Vertex AI actually beats out AzureOpenAI

Top Advanced Retrieval Strategies

the most common retrieval strategy we see is not a built-in one but rather a custom one.

After that, we see more familiar names popping up:

Self Query - which extracts metadata filters from user's questions

Hybrid Search - mainly through provider specific integrations like Supabase and Pinecone

Contextual Compression - which is postprocessing of base retrieval results

Multi Query - transforming a single query into multiple, and then retrieving results for all

TimeWeighted VectorStore - give more preference to recent documents

How are people testing?

83% of test runs have some form of feedback associated with them. Of the runs with feedback, they average 2.3 different types of feedback, suggesting that developers are having difficulty finding a single metric to rely entirely on, and instead use multiple different metrics to evaluate.

...the majority of them use an LLM to evaluate the outputs. While some have expressed concern and hesitation around this, we are bullish on this as an approach and see that in practice it has emerged as the dominant way to test.

...nearly 40% of evaluators are custom evaluators. This is in line with the fact that we've observed that evaluation is often really specific to the application being worked on, and there's no one-size-fits-all evaluator to rely on.

What are people testing?

...most people are still primarily concerned with the correctness of their application (as opposed to toxicity, prompt leakage, or other guardrails

...low usage of Exact Matching as an evaluation technique [suggests] that judging correctness is often quite complex (you can't just compare the output exactly as is)

Permalink: /feed/langchain-state-of-ai-2023/

Tags: #ai #llm #opensource #report

lqdev👽12/21/2023

https://lea.verou.me/blog/2023/eigensolutions/

tl;dr: Overfitting happens when solutions don’t generalize sufficiently and is a hallmark of poor design. Eigensolutions are the opposite: solutions that generalize so much they expose links between seemingly unrelated use cases. Designing eigensolutions takes a mindset shift from linear design to composability.

The eigensolution is a solution that addresses several key use cases, that previously appeared unrelated.

...it takes a mindset shift, from the linear Use case → Idea → Solution process to composability. Rather than designing a solution to address only our driving use cases, step back and ask yourself: can we design a solution as a composition of smaller, more general features, that could be used together to address a broader set of use cases?

Contrary to what you may expect, eigensolutions can actually be quite hard to push to stakeholders:

Due to their generality, they often require significantly higher engineering effort to implement. Quick-wins are easier to sell: they ship faster and add value sooner. In my 11 years designing web technologies, I have seen many beautiful, elegant eigensolutions be vetoed due to implementation difficulties in favor of far more specific solutions — and often this was the right decision, it’s all about the cost-benefit.

Eigensolutions tend to be lower level primitives, which are more flexible, but can also involve higher friction to use than a solution that is tailored to a specific use case.

Eigensolutions tend to be lower level primitives. They enable a broad set of use cases, but may not be the most learnable or efficient way to implement all of them, compared to a tailored solution. In other words, they make complex things possible, but do not necessarily make common things easy.

Instead of implementing tailored solutions ad-hoc (risking overfitting), they can be implemented as shortcuts: higher level abstractions using the lower level primitive. Done well, shortcuts provide dual benefit: not only do they reduce friction for common cases, they also serve as teaching aids for the underlying lower level feature. This offers a very smooth ease-of-use to power curve: if users need to go further than what the shortcut provides, they can always fall back on the lower level primitive to do so.

In an ideal world, lower level primitives and higher level abstractions would be designed and shipped together. However, engineering resources are typically limited, and it often makes sense to ship one before the other, so we can provide value sooner.

This can happen in either direction:

Lower level primitive first. Shortcuts to make common cases easy can ship at a later stage, and demos and documentation to showcase common “recipes” can be used as a stopgap meanwhile. This prioritizes use case coverage over optimal UX, but it also allows collecting more data, which can inform the design of the shortcuts implemented.

Higher level abstraction first, as an independent, ostensibly ad hoc feature. Then later, once the lower level primitive ships, it is used to “explain” the shortcut, and make it more powerful. This prioritizes optimal UX over use case coverage: we’re not covering all use cases, but for the ones we are covering, we’re offering a frictionless user experience.

...despite the name eigensolution, it’s still all about the use cases: eigensolutions just expose links between use cases that may have been hard to detect, but seem obvious in retrospect...Requiring all use cases to precede any design work can be unnecessarily restrictive, as frequently solving a problem improves our understanding of the problem.

Permalink: /feed/eigensolutions-composability-antidote-overfitting-verou/

Tags: #design #product

lqdev👽12/21/2023

https://proton.me/blog/proton-vs-tuta-encryption

GIF of comedian Bill Hader eating popcorn

Permalink: /feed/proton-mail-vs-tutanota-encryption/

Tags: #protonmail #tutanota #encryption

lqdev👽12/20/2023

https://huggingface.co/microsoft/phi-2

When Phi-2 initially released, it was on the Azure AI Studio Model Catalog. It's nice to see it's now in HuggingFace as well.

Permalink: /feed/phi-2-huggingface/

Tags: #opensource #ai #phi #languagemodel #smalllanguagemodel #huggingface #transformers #microsoft

lqdev👽12/20/2023

https://arxiv.org/abs/2312.11514

Large language models (LLMs) are central to modern natural language processing, delivering exceptional performance in various tasks. However, their intensive computational and memory requirements present challenges, especially for devices with limited DRAM capacity. This paper tackles the challenge of efficiently running LLMs that exceed the available DRAM capacity by storing the model parameters on flash memory but bringing them on demand to DRAM. Our method involves constructing an inference cost model that harmonizes with the flash memory behavior, guiding us to optimize in two critical areas: reducing the volume of data transferred from flash and reading data in larger, more contiguous chunks. Within this flash memory-informed framework, we introduce two principal techniques. First, "windowing'" strategically reduces data transfer by reusing previously activated neurons, and second, "row-column bundling", tailored to the sequential data access strengths of flash memory, increases the size of data chunks read from flash memory. These methods collectively enable running models up to twice the size of the available DRAM, with a 4-5x and 20-25x increase in inference speed compared to naive loading approaches in CPU and GPU, respectively. Our integration of sparsity awareness, context-adaptive loading, and a hardware-oriented design paves the way for effective inference of LLMs on devices with limited memory.

Permalink: /feed/llm-flash-inference-limited-memory/

Tags: #llm #ai #hardware

lqdev👽12/20/2023

https://www.newyorker.com/tech/annals-of-technology/its-time-to-dismantle-the-technopoly

...according to [Neil Postman], we no longer live in a technocratic era. We now inhabit what he calls technopoly. In this third technological age, Postman argues, the fight between invention and traditional values has been resolved, with the former emerging as the clear winner. The result is the “submission of all forms of cultural life to the sovereignty of technique and technology.” Innovation and increased efficiency become the unchallenged mechanisms of progress, while any doubts about the imperative to accommodate the shiny and new are marginalized. “Technopoly eliminates alternatives to itself in precisely the way Aldous Huxley outlined in Brave New World,” Postman writes. “It does not make them illegal. It does not make them immoral. It does not even make them unpopular. It makes them invisible and therefore irrelevant.” Technopoly, he concludes, “is totalitarian technocracy.”

What I didn’t realize back in 2016, however, was that, although the grip of technopoly was strong, it was also soon to weaken.

A major source of this destabilization was the Trump-Clinton election cycle...Where once they had seen platforms like Facebook as useful and in some sense mandatory, they started treating them more warily.

This emerging resistance to the technopoly mind-set doesn’t fall neatly onto a spectrum with techno-optimism at one end and techno-skepticism at the other. Instead, it occupies an orthogonal dimension we might call techno-selectionism. This is a perspective that accepts the idea that innovations can significantly improve our lives but also holds that we can build new things without having to accept every popular invention as inevitable. Techno-selectionists believe that we should continue to encourage and reward people who experiment with what comes next. But they also know that some experiments end up causing more bad than good. Techno-selectionists can be enthusiastic about artificial intelligence, say, while also taking a strong stance on settings where we should block its use. They can marvel at the benefits of the social Internet without surrendering their kids’ mental lives to TikTok.

Permalink: /feed/dismantle-technology-newport/

Tags: #technology #culture #media #internet

lqdev👽12/19/2023

https://platform.openai.com/docs/guides/prompt-engineering

This guide shares strategies and tactics for getting better results from large language models (sometimes referred to as GPT models) like GPT-4. The methods described here can sometimes be deployed in combination for greater effect. We encourage experimentation to find the methods that work best for you.

Permalink: /feed/openai-prompting-guide/

Tags: #openai #llm #promptengineering #ai #guide

lqdev👽12/19/2023

https://www.eff.org/deeplinks/2023/12/meet-spritely-and-veilid

While there is a surge in federated social media sites, like Bluesky and Mastodon, some technologists are hoping to take things further than this model of decentralization with fully peer-to-peer applications. Two leading projects, Spritely and Veilid, hint at what this could look like.

Spritely is a framework for building distributed apps that don’t even have to know that they’re distributed. The project is spearheaded by Christine Lemmer-Webber, who was one of the co-authors of the ActivityPub spec that drives the fediverse. She is taking the lessons learned from that work, combining them with security and privacy minded object capabilities models, and mixing it all up into a model for peer to peer computation that could pave the way for a generation of new decentralized tools.

The Veilid project was released at DEFCON 31 in August and has a number of promising features that could lead to it being a fundamental tool in future decentralized systems. Described as a cross between TOR and Interplanetary File System (IPFS), Veilid is a framework and protocol that offers two complementary tools. The first is private routing, which, much like TOR, can construct an encrypted private tunnel over the public internet allowing two devices to communicate with each other without anyone else on the network knowing who is talking to whom...The second tool that Veilid offers is a Distributed Hash Table (DHT), which lets anyone look up a bit of data associated with a specific key, wherever that data lives on the network.

Public interest in decentralized tools and services is growing, as people realize that there are downsides to centralized control over the platforms that connect us all. The past year has seen interest in networks like the fediverse and Bluesky explode and there’s no reason to expect that to change. Projects like Spritely and Veilid are pushing the boundaries of how we might build apps and services in the future. The things that they are making possible may well form the foundation of social communication on the internet in the next decade, making our lives online more free, secure, and resilient.

Additional Links

Permalink: /feed/spritely-veilid-p2p-web/

Tags: #p2p #veilid #spritely #openweb #web #internet #decentralization

lqdev👽12/19/2023

https://www.theverge.com/23990974/social-media-2023-fediverse-mastodon-threads-activitypub

Good article from David Pierce. I'd add that many of these platforms (i.e. Mastodon, Lemmy, PeerTube, WordPress) have strong RSS support which offer another degree of freedom from opting out of signing up for any of the platforms but still being able to follow the people and topics you care about. Sure, the experience may not be as rich but it's yet another way for people to participate in the ecosystem.

A new kind of social internet is currently forming. Right now it might still look like “Twitter and Reddit, only different,” but that’s only the very beginning of what’s to come. Hopefully.

I’m convinced we’ll be better off with a hundred different apps for Snapchat or Instagram or X instead of just one...

It doesn’t make sense that we have a dozen usernames, a dozen profiles, a dozen sets of fans and friends. All that stuff should belong to me, and I should be able to access it and interact with it anywhere and everywhere.

Decentralizing social media can sound like a sort of kumbaya anti-capitalist manifesto: “It’s about openness and sharing, not capitalism, man!” In practice it’s the opposite: it’s a truly free market approach to social networking.

...in a fediverse-dominated world, the way to win is not to achieve excellent lock-in and network effects. The only way to win is to build the best product.

...so far we’re mostly in the “popular app, but federated” phase of this transition.

Almost everything in the fediverse is a one-to-one competitor to an existing platform...Some of these apps are very good! But nearly all of them are differentiated only in that they’re federated.

Let’s be super clear about this: the point of the fediverse is not that it’s federated...Making the “It’s federated!” argument is like making the “It’s better for privacy!” argument: it makes you feel good, and at best it’s a useful tiebreaker, but it doesn’t actually matter. All that matters is the product.

2023 was the year “fediverse” became a buzzword, 2024 will be the year it becomes an industry. (Hopefully one with a better name, but I’ll get over that.) We’ve spent too long living our lives online in someone else’s spaces. What’s next will belong to all of us. All that’s left to do is start posting.

Permalink: /feed/2023-social-media-case-fediverse/

Tags: #fediverse #socialmedia #activitypub #internet #web #openweb #smallweb

lqdev👽12/15/2023

https://mastodon.social/@jwz/111583679963120813

Just published a blog post, AI like it's 1999 or 1899, inspired by this post from jwz, among other things.

A meme of a telegram saying F**k You Strong Letter to Follow — Source: *@jwz@mastodon.social*

Permalink: /feed/microblogging-telegram-jwz/

Tags: #microblogging #telegram #meme

lqdev👽12/14/2023

https://doc.searls.com/2023/12/14/start-of-an-era/

After 17 years and 761 episodes, FLOSS Weekly ended its run on the TWiT network yesterday.

Nooo! So sad to hear that FLOSS Weekly is ending, especially after learning that The Privacy, Security, and OSINT Show with Michael Bazzell ended as well.

At least there's some hope at the end of Doc's post which hints at it living on in some form.

By the way, FLOSS Weekly has not slipped below the waves. I expect it will be picked up somewhere else on the Web, and wherever you get your podcasts. (I love that expression because it means podcasting isn’t walled into some giant’s garden.) When that happens, I’ll point to it here.

In any case, it was good while it lasted. Also, there's still Reality 2.0 where the guests and topics are just as interesting and entertaining.

Permalink: /feed/floss-weekly-ends-twit-start-of-an-era/

Tags: #podcast #floss #twit #opensource

lqdev👽12/14/2023

https://blog.mozilla.org/en/mozilla/introducing-solo-ai-website-builder/

Today we are excited to introduce a new Mozilla Innovation Project, Solo, an AI website builder for solopreneurs.

If you scour Yelp, it appears a third of businesses lack a website. However, building a website not only provides you with a presence that you own and control but it is also good for business.

Our survey data shows that the majority of solopreneurs rely upon their “tech buddy” to help build their website. As a result, the websites become stale and harder to maintain as it relies on a call to their buddy. Others without a “tech buddy” try popular website authoring tools and then abandon because it’s simply too hard to author and curate content.

Using AI to generate the content of your site and source your images, which a solopreneur can then revise into their own unique voice and style levels the playing field. Solo takes this a step further and can also scrape your existing business Yelp or other page so you have an online presence that is totally authentic to you.

Permalink: /feed/solo-mozilla-ai-website-builder/

Tags: #ai #mozilla #web #internet

lqdev👽12/13/2023

https://www.adamsdesk.com/posts/farewell-privacy-security-and-osint-show/

...a farewell episode was released on November 20th, 2023 entitled “My Irish Exit”. It was finally officially confirmed that [The Privacy, Security, and OSINT Show with Michael Bazzell]...has reached an end.

That's unfortunate. I really enjoyed listening to this show and even had it listed in my podroll. The UNREDACTED magazine had great content as well.

Permalink: /feed/farewell-privacy-security-osint-podcast/

Tags: #podcast #privacy #osint

lqdev👽12/12/2023

https://dayoneapp.com/blog/introducing-journaling-suggestions/

See journaling recommendations inspired by your photos, locations, activities and more. Exclusively for iPhone.

I've been tinkering with Day One the past few months. When paired with their templates, this is a nice addition. Too bad it's iPhone exclusive. Hopefully it makes its way to Android at some point.

Permalink: /feed/journaling-suggestions-day-one/

Tags: #journaling #dayone

lqdev👽12/12/2023

https://wildmanlife.com/aoudaghost-economic-hub-of-the-sahara/

Since 2001, the site has been on the UNESCO World Heritage Tentative List.

Today, Aoudaghost is in a state of complete abandonment. The remains of the once-thriving town are concentrated in the area most protected by the wind and sand, with several walls and fortifications yet to be fully englobed by the desert. From the adjacent cliff, the current state of Aoudaghost can be seen in its entirety, but only the mind can imagine the Aoudaghost that served as an economic and cultural hub for the Sahara

Permalink: /feed/aoudaghost-aprin/

Tags: #travel #sahara

lqdev👽12/12/2023

https://future.mozilla.org/blog/introducing-memorycache/

MemoryCache, a Mozilla Innovation Project, is an early exploration project that augments an on-device, personal model with local files saved from the browser to reflect a more personalized and tailored experience through the lens of privacy and agency.

Additional resources

https://memorycache.ai/

Permalink: /feed/memorycache-personal-ai/

Tags: #ai #web #privacy #opensource

lqdev👽12/12/2023

https://future.mozilla.org/innovation-week/

Mozilla’s Innovation Week is a journey into the future of technology, where AI is not just a buzzword, but a reality we're actively shaping. Here, we're not just talking about innovation – we're living it through a series of AI-driven explorations.

With that in mind, Innovation Week is more than a showcase. It's a platform for collaboration and inspiration. It's about bringing together ideas, people, and technology to pave the way for a more open and responsible future.

Permalink: /feed/mozilla-innovation-week-dec-2023/

Tags: #ai #opensource #mozilla

lqdev👽12/12/2023

https://justine.lol/oneliners/

I spent the last month working with Mozilla to launch an open source project called llamafile which is the new best way to run an LLM on your own computer. So far things have been going pretty smoothly. The project earned 5.6k stars on GitHub, 1073 upvotes on Hacker News, and received press coverage from Hackaday. Yesterday I cut a 0.3 release so let's see what it can do.

Permalink: /feed/llamafile-llm-oneliners-tunney/

Tags: #llama #ai #llm #opensource

lqdev👽12/12/2023

https://github.com/ml-explore/mlx-examples/tree/main/mixtral

Run the Mixtral1 8x7B mixture-of-experts (MoE) model in MLX on Apple silicon.

Permalink: /feed/mlx-mixtral-8x7b/

Tags: #mlx #ai #frameworks

lqdev👽12/12/2023

https://wordpress.org/state-of-the-word/

State of the Word is the annual keynote address delivered by the WordPress project’s co-founder, Matt Mullenweg, celebrating the progress of the open source project and offering a glimpse into its future.

Permalink: /feed/state-of-the-word-2023/

Tags: #wordpress #blogging #web #opensource #community

lqdev👽12/12/2023

https://www.microsoft.com/research/blog/steering-at-the-frontier-extending-the-power-of-prompting/

...steering GPT-4 with a modified version of Medprompt achieves the highest score ever achieved on the complete MMLU.

To achieve a new SoTA on MMLU, we extended Medprompt to Medprompt+ by adding a simpler prompting method and formulating a policy for deriving a final answer by integrating outputs from both the base Medprompt strategy and the simple prompts. The synthesis of a final answer is guided by a control strategy governed by GPT-4 and inferred confidences of candidate answers.

While systematic prompt engineering can yield maximal performance, we continue to explore the out-of-the-box performance of frontier models with simple prompts. It’s important to keep an eye on the native power of GPT-4 and how we can steer the model with zero- or few-shot prompting strategies.

Permalink: /feed/medprompt-promptbase-extending-power-prompting/

Tags: #ai #promptengineering

lqdev👽12/12/2023

https://github.com/microsoft/promptbase

promptbase is an evolving collection of resources, best practices, and example scripts for eliciting the best performance from foundation models like GPT-4.

Permalink: /feed/microsoft-promptbase/

Tags: #ai #promptengineering

lqdev👽12/12/2023

https://arxiv.org/abs/2312.06550

The recent surge in open-source Large Language Models (LLMs), such as LLaMA, Falcon, and Mistral, provides diverse options for AI practitioners and researchers. However, most LLMs have only released partial artifacts, such as the final model weights or inference code, and technical reports increasingly limit their scope to high-level design choices and surface statistics. These choices hinder progress in the field by degrading transparency into the training of LLMs and forcing teams to rediscover many details in the training process. We present LLM360, an initiative to fully open-source LLMs, which advocates for all training code and data, model checkpoints, and intermediate results to be made available to the community. The goal of LLM360 is to support open and collaborative AI research by making the end-to-end LLM training process transparent and reproducible by everyone. As a first step of LLM360, we release two 7B parameter LLMs pre-trained from scratch, Amber and CrystalCoder, including their training code, data, intermediate checkpoints, and analyses (at this https URL). We are committed to continually pushing the boundaries of LLMs through this open-source effort. More large-scale and stronger models are underway and will be released in the future.

Additional Resources

https://www.llm360.ai/

Permalink: /feed/llm360-transparent-open-source-llm/

Tags: #ai #research

lqdev👽12/12/2023

https://www.microsoft.com/research/blog/phi-2-the-surprising-power-of-small-language-models/

We are now releasing Phi-2, a 2.7 billion-parameter language model that demonstrates outstanding reasoning and language understanding capabilities, showcasing state-of-the-art performance among base language models with less than 13 billion parameters. On complex benchmarks Phi-2 matches or outperforms models up to 25x larger, thanks to new innovations in model scaling and training data curation.

Permalink: /feed/microsoft-phi-2/

Tags: #ai #slm #llm

lqdev👽12/12/2023

https://www.answer.ai/posts/2023-12-12-launch.html

Jeremy Howard (founding CEO, previously co-founder of Kaggle and fast.ai) and Eric Ries (founding director, previously creator of Lean Startup and the Long-Term Stock Exchange) today launched Answer.AI, a new kind of AI R&D lab which creates practical end-user products based on foundational research breakthroughs. The creation of Answer.AI is supported by an investment of USD10m from Decibel VC. Answer.AI will be a fully-remote team of deep-tech generalists—the world’s very best, regardless of where they live, what school they went to, or any other meaningless surface feature.

Permalink: /feed/answer-ai-rd-lab/

Tags: #ai #research #vc

lqdev👽12/11/2023

https://stability.ai/news/stablelm-zephyr-3b-stability-llm

Stable LM Zephyr 3B is a 3 billion parameter Large Language Model (LLM), 60% smaller than 7B models, allowing accurate, and responsive output on a variety of devices without requiring high-end hardware.

Permalink: /feed/introducing-stable-lm-zephyr-3b/

Tags: #llm #ai #edge

lqdev👽12/11/2023

https://techcrunch.com/2023/12/11/tumblrs-fediverse-integration-is-still-being-worked-on-says-owner-and-automattic-ceo-matt-mullenweg/

Despite delays, the plan to connect Tumblr’s blogging site to the wider world of decentralized social media, also known as the “fediverse,” is still on, it seems.

...Mullenweg explained that despite the re-org, which will see many Tumblr employees move to other projects at the end of the year, Automattic did switch someone over to Tumblr to work on the fediverse integration, which will continue in the new year.

“I remain a huge believer in open standards and user freedom, though I don’t claim to have the truth on which particular standard is better or best, to serve our customers we will support everything we can in good faith to give users more freedom, choice, and avoid lock-in,” [Matt Mullenweg] also said in his AMA.

Mullunweg also noted that a larger effort to migrate Tumblr’s half a billion blogs to WordPress on the backend is something he’s also contemplating in the new year.

Permalink: /feed/tumblr-still-working-fediverse-integration/

Tags: #tumblr #socialmedia #fediverse

lqdev👽12/11/2023

https://saprmarks.github.io/geometry-of-truth/dataexplorer/

This page contains interactive charts for exploring how large language models represent truth. It accompanies the paper The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets by Samuel Marks and Max Tegmark.

To produce these visualizations, we first extract LLaMA-13B representations of factual statements. These representations live in a 5120-dimensional space, far too high-dimensional for us to picture, so we use PCA to select the two directions of greatest variation for the data. This allows us to produce 2-dimensional pictures of 5120-dimensional data. See this footnote for more details.1

Permalink: /feed/geometry-of-truth-data-explorer/

Tags: #ai #interpretability #llm

lqdev👽12/11/2023

https://docs.google.com/presentation/d/156WpBF_rGvf4Ecg19oM1fyR51g4FAmHV3Zs0WLukrLQ/edit?usp=sharing

Key themes in the 2023 Report include:

GPT-4 is the master of all it surveys (for now), beating every other LLM on both classic benchmarks and exams designed to evaluate humans, validating the power of proprietary architectures and reinforcement learning from human feedback.

Efforts are growing to try to clone or surpass proprietary performance, through smaller models, better datasets, and longer context. These could gain new urgency, amid concerns that human-generated data may only be able to sustain AI scaling trends for a few more years.

LLMs and diffusion models continue to drive real-world breakthroughs, especially in the life sciences, with meaningful steps forward in both molecular biology and drug discovery.

Compute is the new oil, with NVIDIA printing record earnings and startups wielding their GPUs as a competitive edge. As the US tightens its restrictions on trade restrictions on China and mobilizes its allies in the chip wars, NVIDIA, Intel, and AMD have started to sell export-control proof chips at scale.

GenAI saves the VC world, as amid a slump in tech valuations, AI startups focused on generative AI applications (including video, text, and coding), raised over $18 billion from VC and corporate investors.

The safety debate has exploded into the mainstream, prompting action from governments and regulators around the world. However, this flurry of activity conceals profound divisions within the AI community and a lack of concrete progress towards global governance, as governments around the world pursue conflicting approaches.

Challenges mount in evaluating state of the art models, as standard LLMs often struggle with robustness. Considering the stakes, as “vibes-based” approach isn’t good enough.

Additional resources

State of AI Website

Permalink: /feed/2023-state-of-ai-report/

Tags: #ai

lqdev👽12/11/2023

https://github.com/vitoplantamura/OnnxStream/

Generally major machine learning frameworks and libraries are focused on minimizing inference latency and/or maximizing throughput, all of which at the cost of RAM usage. So I decided to write a super small and hackable inference library specifically focused on minimizing memory consumption: OnnxStream.

OnnxStream is based on the idea of decoupling the inference engine from the component responsible of providing the model weights, which is a class derived from WeightsProvider. A WeightsProvider specialization can implement any type of loading, caching and prefetching of the model parameters. For example a custom WeightsProvider can decide to download its data from an HTTP server directly, without loading or writing anything to disk (hence the word "Stream" in "OnnxStream"). Three default WeightsProviders are available: DiskNoCache, DiskPrefetch and Ram.

OnnxStream can consume even 55x less memory than OnnxRuntime with only a 50% to 200% increase in latency (on CPU, with a good SSD, with reference to the SD 1.5's UNET - see the Performance section below).

Permalink: /feed/onnxstream/

Tags: #ai #onnx

lqdev👽12/11/2023

https://www.cs.princeton.edu/~arvindn/talks/evaluating_llms_minefield/

...many things can go wrong when we are trying to evaluate LLMs’ performance on a certain task or behavior in a certain scenario.

It has big implications for reproducibility: both for research on LLMs and research that uses LLMs to answer a question in social science or any other field.

Permalink: /feed/evaluating-llms-minefield/

Tags: #ai #llms #evaluation

lqdev👽12/11/2023

https://benchmark.vectorview.ai/vectordbs.html

Picking a vector database can be hard. Scalability, latency, costs, and even compliance hinge on this choice. For those navigating this terrain, I've embarked on a journey to sieve through the noise and compare the leading vector databases of 2023. I’ve included the following vector databases in the comparision: Pinecone, Weviate, Milvus, Qdrant, Chroma, Elasticsearch and PGvector. The data behind the comparision comes from ANN Benchmarks, the docs and internal benchmarks of each vector database and from digging in open source github repos.

Permalink: /feed/vector-db-comparison-guide-2023/

Tags: #vector #databases

lqdev👽12/11/2023

https://memgpt.ai/

Teach LLMs to manage their own memory for unbounded context!

Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis. To enable using context beyond limited context windows, we propose virtual context management, a technique drawing inspiration from hierarchical memory systems in traditional operating systems that provide the appearance of large memory resources through data movement between fast and slow memory. Using this technique, we introduce MemGPT (Memory-GPT), a system that intelligently manages different memory tiers in order to effectively provide extended context within the LLM's limited context window, and utilizes interrupts to manage control flow between itself and the user. We evaluate our OS-inspired design in two domains where the limited context windows of modern LLMs severely handicaps their performance: document analysis, where MemGPT is able to analyze large documents that far exceed the underlying LLM's context window, and multi-session chat, where MemGPT can create conversational agents that remember, reflect, and evolve dynamically through long-term interactions with their users. We release MemGPT code and data for our experiments at https://memgpt.ai.

In MemGPT, a fixed-context LLM processor is augmented with a tiered memory system and a set of functions that allow it to manage its own memory. Main context is the (fixed-length) LLM input. MemGPT parses the LLM text ouputs at each processing cycle, and either yields control or executes a function call, which can be used to move data between main and external context. When the LLM generates a function call, it can request immediate return of execution to chain together functions. In the case of a yield, the LLM will not be run again until the next external event trigger (e.g. a user message or scheduled interrupt).

Permalink: /feed/memgpt-llm-operating-system/

Tags: #ai #os #llm

lqdev👽12/11/2023

https://www.databricks.com/blog/LLM-auto-eval-best-practices-RAG

This blog represents the first in a series of investigations we’re running at Databricks to provide learnings on LLM evaluation.

Recently, the LLM community has been exploring the use of “LLMs as a judge” for automated evaluation with many using powerful LLMs such as GPT-4 to do the evaluation for their LLM outputs.

Using the Few Shots prompt with GPT-4 didn’t make an obvious difference in the consistency of results.

Including few examples for GPT-3.5-turbo-16k significantly improves the consistency of the scores, and makes the result usable.

...evaluation results can’t be transferred between use cases and we need to build use-case-specific benchmarks in order to properly evaluate how good a model can meet customer needs.

Permalink: /feed/best-practices-rag-application-evaluation-databricks/

Tags: #ai #rag #evaluation #llm

lqdev👽12/11/2023

https://www.databricks.com/blog/announcing-mlflow-28-llm-judge-metrics-and-best-practices-llm-evaluation-rag-applications-part?utm_source=twitter&utm_medium=organic-social

LLM-as-a-judge is one promising tool in the suite of evaluation techniques necessary to measure the efficacy of LLM-based applications.

Permalink: /feed/mlflow-2-8-llm-as-judge-rag-evaluation/

Tags: #mlflow #ai #llm #evaluation

lqdev👽12/11/2023

https://arxiv.org/abs/2309.16671

Contrastive Language-Image Pre-training (CLIP) is an approach that has advanced research and applications in computer vision, fueling modern recognition systems and generative models. We believe that the main ingredient to the success of CLIP is its data and not the model architecture or pre-training objective. However, CLIP only provides very limited information about its data and how it has been collected, leading to works that aim to reproduce CLIP's data by filtering with its model parameters. In this work, we intend to reveal CLIP's data curation approach and in our pursuit of making it open to the community introduce Metadata-Curated Language-Image Pre-training (MetaCLIP). MetaCLIP takes a raw data pool and metadata (derived from CLIP's concepts) and yields a balanced subset over the metadata distribution. Our experimental study rigorously isolates the model and training settings, concentrating solely on data. MetaCLIP applied to CommonCrawl with 400M image-text data pairs outperforms CLIP's data on multiple standard benchmarks. In zero-shot ImageNet classification, MetaCLIP achieves 70.8% accuracy, surpassing CLIP's 68.3% on ViT-B models. Scaling to 1B data, while maintaining the same training budget, attains 72.4%. Our observations hold across various model sizes, exemplified by ViT-H achieving 80.5%, without any bells-and-whistles. Curation code and training data distribution on metadata is made available at this https URL.

Repository

MetaCLIP

Permalink: /feed/metaclip-demistifying-clip-data/

Tags: #ai #opensource #clip

lqdev👽12/11/2023

https://arxiv.org/abs/2310.10634

Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs). Current language agent frameworks aim to facilitate the construction of proof-of-concept language agents while neglecting the non-expert user access to agents and paying little attention to application-level designs. We present OpenAgents, an open platform for using and hosting language agents in the wild of everyday life. OpenAgents includes three agents: (1) Data Agent for data analysis with Python/SQL and data tools; (2) Plugins Agent with 200+ daily API tools; (3) Web Agent for autonomous web browsing. OpenAgents enables general users to interact with agent functionalities through a web user interface optimized for swift responses and common failures while offering developers and researchers a seamless deployment experience on local setups, providing a foundation for crafting innovative language agents and facilitating real-world evaluations. We elucidate the challenges and opportunities, aspiring to set a foundation for future research and development of real-world language agents.

Permalink: /feed/openagents-platform-language-agents/

Tags: #ai #llm #agents

lqdev👽12/11/2023

https://crfm.stanford.edu/fmti/

A comprehensive assessment of the transparency of foundation model developers

Context. Foundation models like GPT-4 and Llama 2 are used by millions of people. While the societal impact of these models is rising, transparency is on the decline. If this trend continues, foundation models could become just as opaque as social media platforms and other previous technologies, replicating their failure modes.

Design. We introduce the Foundation Model Transparency Index to assess the transparency of foundation model developers. We design the Index around 100 transparency indicators, which codify transparency for foundation models, the resources required to build them, and their use in the AI supply chain.

Execution. For the 2023 Index, we score 10 leading developers against our 100 indicators. This provides a snapshot of transparency across the AI ecosystem. All developers have significant room for improvement that we will aim to track in the future versions of the Index.

Permalink: /feed/foundational-model-transparency-index-stanford/

Tags: #ai #opensource #transparency

lqdev👽12/11/2023

https://www.latent.space/p/oct-2023

Mistral 7B, released at the tail end of Sept 2023, is both Apache 2.0 and smaller but better than Llama 2, and is now rumored to be raising $400m at $2.5b valuation from a16z.

Permalink: /feed/new-kings-open-source-ai/

Tags: #ai #opensource #newsletter

lqdev👽12/11/2023

https://mistral.ai/news/mixtral-of-experts/

Today, the team is proud to release Mixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights. Licensed under Apache 2.0. Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference. It is the strongest open-weight model with a permissive license and the best model overall regarding cost/performance trade-offs. In particular, it matches or outperforms GPT3.5 on most standard benchmarks.

Mixtral has the following capabilities.

It gracefully handles a context of 32k tokens.

It handles English, French, Italian, German and Spanish.

It shows strong performance in code generation.

It can be finetuned into an instruction-following model that achieves a score of 8.3 on MT-Bench.

Mixtral is a sparse mixture-of-experts network. It is a decoder-only model where the feedforward block picks from a set of 8 distinct groups of parameters. At every layer, for every token, a router network chooses two of these groups (the “experts”) to process the token and combine their output additively.

This technique increases the number of parameters of a model while controlling cost and latency, as the model only uses a fraction of the total set of parameters per token. Concretely, Mixtral has 46.7B total parameters but only uses 12.9B parameters per token. It, therefore, processes input and generates output at the same speed and for the same cost as a 12.9B model.

Permalink: /feed/mixtral-of-experts/

Tags: #ai #llm #opensource

lqdev👽12/11/2023

https://kottke.org/23/11/the-future-of-rss-is-textcasting-1

Here’s the philosophy:

The goal is interop between social media apps and the features writers need.

What we’re doing: Moving documents between networked apps. We need a set of common features in order for it to work.

The features are motivated by the needs of writers. Not by programmers or social media company execs.

It’s a proposal to build, using technologies we already have and understand very well, a very simple social media protocol that is completely agnostic about what editor you use to write your posts and what viewer you choose to read it. Writer/authors would have more control over styling, links, media enclosures, etc., and readers would have more control over how and where they consume it. It’s decentralized social media, but without the need to peer through ActivityPub or anybody else’s API and squeeze our toothpaste through its tubes.

Additional resources

Textcasting.org

Permalink: /feed/future-rss-textcasting/

Tags: #rss #openweb #internet #socialmedia

lqdev👽12/11/2023

https://blakewatson.com/journal/omg-lol-an-oasis-on-the-internet/

The main thing you are getting with omg.lol is one or more subdomains, which are referred to as addresses.

Email forwarding: You get an email address, you@omg.lol, which you can forward to any email address.

Web Page: This is your link-in-bio one-pager to do whatever you want with. By default this is where your main address (eg, you.omg.lol) points. It’s the flagship feature of omg.lol. It comes with a markdown editor that has some fancy features baked into it. You get a selection of built-in themes but you also have the freedom to go wild with your own CSS.

DNS: You have the ability to use your omg.lol subdomain however you wish by way of a friendly DNS panel.

Now Page: This is a type of page you can use to let people know what’s going on in your life. It’s broader than a social media post but more immediately relevant than an about page. It comes with the same fancy markdown editor and you can optionally appear in omg.lol’s Now Garden.

Statuslog: This is a place to post statuses. It’s really just a fun, silly alternative to other social media platforms but without follows and likes and such. These can cross-post to Mastodon if you want.

Weblog: A full-fledge blogging platform. I’m not aware of all its features but it’s pretty powerful. It comes with fancy markdown support and has all the bloggy things you need like tags and RSS. A good example of a very custom blog on omg.lol is Apple Annie’s Weblog. But it’s worth noting you use it right out of the box without design customization if you want.

Pastebin: It’s just a pastebin for storing text snippets. Super simple and friendly like all of the omg.lol services.

Pics: It’s an image hosting service labeled as being “super-beta” as of the time of this writing. But it does what it says on the tin. You can host images there and they also show up on the some.pics image feed.

PURLs: Persistent uniform resource locators. This is a URL redirection service. You get you.omg.lol/whatever and you.url.lol/whatever. You can use these the way you would use similar services and they come with a basic hit counter and way to preview the URL before following it.

Switchboard: This is a powerful routing system that lets you point the variants of your address wherever you want, be it a destination on the omg.lol platform or an external website. Most omg.lol services have their own domain so you end up with a variety of options. Just as an example, you get a tilde address (ie, omg.lol/~you). Mine points to my tilde.club webpage.

Keys: A place to store public keys—SSH, PGP, etc.

Proofs: A service for verifying ownership or control of a particular web property at a particular moment in time. For example, here is proof that I controlled blakewatson.com as of December 10, 2023.

API access: Most, if not all, omg.lol services have an API you can use to interact with them. Total nerd freedom. 🤯

Permalink: /feed/omg-lol-internet-oasis-watson/

Tags: #web #community

lqdev👽12/10/2023

https://amazingnewsletters.com/

Find the best newsletters to subscribe to!

Permalink: /feed/amazing-newsletters/

Tags: #newsletters

lqdev👽12/09/2023

https://thoughtcatalog.com/ryan-holiday/2017/01/to-everyone-who-asks-for-just-a-little-of-your-time/

Makers...need to have large blocks of uninterrupted, unscheduled time to do what they do. To create and think.

I keep a maker’s schedule because I believe that anything else is anathema to deep work or creativity.

Seneca writes that if all the geniuses in history were to get together, none would be able explain our baffling relationship with time. He says,

No person would give up even an inch of their estate, and the slightest dispute with a neighbor can mean hell to pay; yet we easily let others encroach on our lives—worse, we often pave the way for those who will take it over. No person hands out their money to passers-by, but to how many do each of us hand out our lives! We’re tight-fisted with property and money, yet think too little of wasting time, the one thing about which we should all be the toughest misers.

Time? Time is our most irreplaceable asset—we cannot buy more of it. We cannot get a second of it back. We can only hope to waste as little as possible. Yet somehow we treat it as most renewable of all resources.

Permalink: /feed/what-it-cost-to-say-yes-holiday/

Tags: #productivity #timemanagement

lqdev👽12/09/2023

https://staysaasy.com/management/2023/12/07/accelerating-product-velocity.html

Remove Dependencies

Create a culture that favors begging forgiveness (and reversing decisions quickly) rather than asking permission. Invest in infrastructure such as progressive / cancellable rollouts. Use asynchronous written docs to get people aligned (“comment in this doc by Friday if you disagree with the plan”) rather than meetings (“we’ll get approval at the next weekly review meeting”).

Demand Clear Narratives

Unclear thinking is a reliable cause of slowness, and gets revealed under a microscope.

Bonus points for documenting plans in writing. One of the largest advantages of a strong writing culture is that it forces much clearer narratives than meetings, powerpoint, or five Slack threads spread over 8 business days.

Get Your Deployment and Incident Metrics In Shape

No matter what your job function is, part of your role is ensuring that your engineering team has enough time to get their vital metrics in order. Especially if you’re a product leader, it’s essential that you resist the temptation to push relentlessly for more features and give your engineering counterparts the room to get fit.

Find Trusted Engineering Guides

...it’s especially important to build a strong relationship with all of your engineering partners, and especially these trusted guides.

Permalink: /feed/practical-ways-increase-product-velocity/

Tags: #product #productivity

lqdev👽12/08/2023

https://github.com/microsoft/satclip

SatCLIP trains location and image encoders via contrastive learning, by matching images to their corresponding locations. This is analogous to the CLIP approach, which matches images to their corresponding text. Through this process, the location encoder learns characteristics of a location, as represented by satellite imagery. For more details, check out our paper.

Permalink: /feed/microsoft-satclip/

Tags: #ai #opensource

lqdev👽12/08/2023

https://dosdeck.com/

DOS_deck is built upon the foundation of JS-DOS, which, in turn, relies on DOSBox. Together, they breathe new life into MS-DOS games by bringing them to your browser. However, there's a twist. Games from that era were designed for keyboard and mouse input, without established standards for interaction or control patterns. Here at DOS_deck, a tremendous effort was put into creating a seamless experience, enabling you to effortlessly navigate and play these games, ideally with the comfort of a controller in hand.

Permalink: /feed/dos-deck/

Tags: #gaming #dos #retro #web

lqdev👽12/06/2023

https://defaults.rknight.me/

Aggregated list of App Defaults blog posts inspired by Hemispheric Views 097 - Duel of the Defaults!

Permalink: /feed/app-defaults-catalog/

Tags: #community #blogging #apps

lqdev👽12/06/2023

https://daverupert.com/rss-club/

RSS Club is a collection of blogs (personal and otherwise) committed to providing RSS-only content. It’s like a newsletter delivered to your feed reader in order to celebrate the medium of RSS and breakaway from social media.

Permalink: /feed/rss-club/

Tags: #rss #community

lqdev👽12/06/2023

https://www.anthropic.com/index/claude-2-1-prompting

Claude 2.1 recalls information very well across its 200,000 token context window

However, the model can be reluctant to answer questions based on an individual sentence in a document, especially if that sentence has been injected or is out of place

A minor prompting edit removes this reluctance and results in excellent performance on these tasks

What can users do if Claude is reluctant to respond to a long context retrieval question? We’ve found that a minor prompt update produces very different outcomes in cases where Claude is capable of giving an answer, but is hesitant to do so. When running the same evaluation internally, adding just one sentence to the prompt resulted in near complete fidelity throughout Claude 2.1’s 200K context window

We achieved significantly better results on the same evaluation by adding the sentence “Here is the most relevant sentence in the context:” to the start of Claude’s response. This was enough to raise Claude 2.1’s score from 27% to 98% on the original evaluation.

Permalink: /feed/claude-long-context-prompting/

Tags: #prompts #ai #llm

lqdev👽12/06/2023

https://chrismcleod.dev/blog/blogging-is-where-its-at-again/

the blog is the “natural form” of posting on the web: a site of your own, that you control[1] and set your own rules on content and discussion; where you can post whatever you like without worrying about “The Algorithm”

For better or for worse, social media opened up the web to a lot more people for a number of reasons...But deep down I feel having your own site is better. For the web, and for you: the writer and the reader.

...stumbling into such a trove of active blogs has enthused me about blogging as a medium again. It’s sparked a thought that through a combination of increased blogging activity, declining platforms, and increasing adoption of open standards to glue everything together, that maybe — just maybe — we can swing the web back towards the blog again.

Agree with many of the points. Also, TIL you could subscribe to OPML feeds.

Permalink: /feed/blogging-where-its-at-again-mcleod/

Tags: #blogging #socialmedia #rss #community

lqdev👽12/06/2023

https://blog.google/technology/ai/google-gemini-ai/

Gemini is the result of large-scale collaborative efforts by teams across Google, including our colleagues at Google Research. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across and combine different types of information including text, code, audio, image and video.

Gemini is also our most flexible model yet — able to efficiently run on everything from data centers to mobile devices. Its state-of-the-art capabilities will significantly enhance the way developers and enterprise customers build and scale with AI.

We’ve optimized Gemini 1.0, our first version, for three different sizes:

Gemini Ultra — our largest and most capable model for highly complex tasks.

Gemini Pro — our best model for scaling across a wide range of tasks.

Gemini Nano — our most efficient model for on-device tasks.

We designed Gemini to be natively multimodal, pre-trained from the start on different modalities. Then we fine-tuned it with additional multimodal data to further refine its effectiveness. This helps Gemini seamlessly understand and reason about all kinds of inputs from the ground up, far better than existing multimodal models — and its capabilities are state of the art in nearly every domain.

Our first version of Gemini can understand, explain and generate high-quality code in the world’s most popular programming languages, like Python, Java, C++, and Go.

On TPUs, Gemini runs significantly faster than earlier, smaller and less-capable models.

Starting today, Bard will use a fine-tuned version of Gemini Pro for more advanced reasoning, planning, understanding and more.

We’re also bringing Gemini to Pixel. Pixel 8 Pro is the first smartphone engineered to run Gemini Nano, which is powering new features like Summarize in the Recorder app and rolling out in Smart Reply in Gboard, starting with WhatsApp — with more messaging apps coming next year.

Starting on December 13, developers and enterprise customers can access Gemini Pro via the Gemini API in Google AI Studio or Google Cloud Vertex AI.

Permalink: /feed/introducing-gemini-google/

Tags: #ai #lmm

lqdev👽12/05/2023

https://hacks.mozilla.org/2023/11/introducing-llamafile/

Today we’re announcing the first release of llamafile and inviting the open source community to participate in this new project.

llamafile lets you turn large language model (LLM) weights into executables.

We achieved all this by combining two projects that we love: llama.cpp (a leading open source LLM chatbot framework) with Cosmopolitan Libc (an open source project that enables C programs to be compiled and run on a large number of platforms and architectures). It also required solving several interesting and juicy problems along the way, such as adding GPU and dlopen() support to Cosmopolitan;

Permalink: /feed/introducing-llamafile/

Tags: #ai #llm #opensource

lqdev👽12/05/2023

https://framablog.org/2023/11/28/peertube-v6-is-out-and-powered-by-your-ideas/

The sixth major version is being released today and we are very proud !

Protect your videos with passwords !

Video storyboard : preview what’s coming !

Upload a new version of your video !

Get chapters in your videos !

Stress tests, performance and config recommandations

…and there’s always more !

Permalink: /feed/peertube-v6-released/

Tags: #fediverse #peertube #video #opensource

lqdev👽12/05/2023

https://blog.jim-nielsen.com/2023/how-i-take-and-publish-notes/

99% of the time, this is how my note-taking process goes:

I’m catching up on my RSS feed (on my phone in the Reeder app)

I read something that strikes me as interesting, novel, or insightful.

I copy/paste it as an blockquote into a new, plain-text note in iA writer.

I copy/paste the link of the article into iA writer.

I finish reading the article and copy/paste anything else in the article that strikes me.

I add my own comments in the note as they pop into my head.

I move on to the next article in my RSS feed.

Repeat.

Kind of meta but somewhat similar process for me. To publish the different content found on my response feed, I:

Go through articles on my RSS feed (NewsBlur on both desktop and mobile).
Copy URL and block quotes from article and paste them somewhere. When I have time like now, I create a post like this one, usually in VS Code. If I don't have time though, I've been experimenting with using a messaging app like Element and E-mail as a read-it-later service. At minimum, I create a message with the link and send it to myself for later review. Later on when I have time, I create the post with additional comments and content from the article.
(Optional) Add some of my own comments.
Publish the notes.
Repeat.

Permalink: /feed/take-and-publish-notes-nielsen/

Tags: #blogging #notetaking

lqdev👽12/05/2023

https://www.schneier.com/blog/archives/2023/12/ai-and-mass-spying.html

Surveillance facilitates social control, and spying will only make this worse. Governments around the world already use mass surveillance; they will engage in mass spying as well.

Mass surveillance ushered in the era of personalized advertisements; mass spying will supercharge that industry...The tech monopolies that are currently keeping us all under constant surveillance won’t be able to resist collecting and using all of that data.

We could limit this capability. We could prohibit mass spying. We could pass strong data-privacy rules. But we haven’t done anything to limit mass surveillance. Why would spying be any different?

Permalink: /feed/ai-mass-spying-schneier/

Tags: #ai #privacy #internet #surveillance

lqdev👽12/05/2023

https://thealliance.ai/news

AI Alliance Launches as an International Community of Leading Technology Developers, Researchers, and Adopters Collaborating Together to Advance Open, Safe, Responsible AI

Permalink: /feed/ai-alliance/

Tags: #ai #opensource

lqdev👽11/12/2023

https://twitter.com/studiosipu/status/1723582194432794638/video/1

Preach 🙌

Permalink: /feed/tyler-ted-talk/

Tags: #tedtalk #festival

lqdev👽11/11/2023

https://www.youneedfeeds.com/

YOU NEED FEEDS.

A web feed is a special listing of the latest content from your favourite site. News, music, video and more - whatever is new, web feeds will show you. What's more, you can combine your favourite feeds using a feed reader application - and suddenly the whole web comes to you.

You don't have to do the work of staying on top any more. You can now visit a single site, or use a single app, and see everything that's new and interesting. You choose the content. You're in control.

Permalink: /feed/you-need-feeds/

Tags: #rss #feeds #guides

lqdev👽10/24/2023

https://ma.tt/2023/10/texts-joins-automattic/

Using an all-in-one messaging app is a real game-changer for productivity and keeping up with things.

This is obviously a tricky area to navigate, as in the past the networks have blocked third-party clients, but I think with the current anti-trust and regulatory environments this is actually something the big networks will appreciate: it maintains the same security as their clients, opens them up in a way consumers will love and is very user-centric, and because we’re committed to supporting all their features it can actually increase engagement and usage of their platforms.

I can relate to the feeling of wanting to have one inbox expressed in the video. Coincidentally, I've been playing with Delta Chat and by building on top of e-mail, some of the issues with the siloed platforms are alleviated. Also, e-mail isn't dead and despite some of its shortcomings, it's still broadly used to sign up and sign into platforms.

Permalink: /feed/texts-joins-automattic/

Tags: #messaging #interop #decentralization #opensource

lqdev👽10/24/2023

https://www.theverge.com/2023/10/24/23928685/automattic-texts-acquisition-universal-messaging

I'm really liking the recent acquisitions from Automattic. I'm just starting to use Day One and really enjoy it. Pocket Casts is a fantastic podcast app, though I prefer to use AntennaPod. WordPress is also starting to make it easy to plug into the Fediverse using your blog. I'm excited for Texts and what that might offer in the current siloed messaging landscape.

Automattic, the company that runs WordPress.com, Tumblr, Pocket Casts, and a number of other popular web properties, just made a different kind of acquisition: it’s buying Texts, a universal messaging app, for $50 million.

Texts is an app for all your messaging apps. You can use it to log in to WhatsApp, Instagram, LinkedIn, Signal, iMessage, and more and see and respond to all your messages in one place.

...Mullenweg says he’s bullish on solutions like Matrix, which offers a decentralized and open-source messaging network, and other up-and-coming standards for messaging. He’s already thinking about how Texts might gently nudge people toward more open protocols over time.

Mullenweg and Automattic see a big future for messaging, as more online interaction shifts away from public-first social networks and toward things like group chats. Hardly anyone has figured out how to build a meaningful and sustainable business from chat, but Mullenweg thinks it’s possible. And he thinks it starts with making your messaging a little less messy.

Permalink: /feed/automattic-acquires-texts-messaging/

Tags: #messaging #interop #decentralization #opensource

lqdev👽10/23/2023

https://www.theverge.com/2023/10/23/23928550/posse-posting-activitypub-standard-twitter-tumblr-mastodon

The platform era is ending. Rather than build new Twitters and Facebooks, we can create a stuff-posting system that works better for everybody.

In a POSSE world, everybody owns a domain name, and everybody has a blog. (I’m defining “blog” pretty loosely here — just as a place on the internet where you post your stuff and others consume it.)

But there are some big challenges to the idea...The most immediate question...is simply how to build a POSSE system that works. POSSE’s problems start at the very beginning: it requires owning your own website, which means buying a domain and worrying about DNS records and figuring out web hosts, and by now, you’ve already lost the vast majority of people who would rather just type a username and password into some free Meta platform...Even those willing and able to do the technical work can struggle to make POSSE work.

When I ask Doctorow why he believed in POSSE, he describes the tension every poster feels on the modern internet. “I wanted to find a way to stand up a new platform in this moment,” he says, “where, with few exceptions, everyone gets their news and does their reading through the silos that then hold you to ransom. And I wanted to use those silos to bring in readers and to attract and engage with an audience, but I didn’t want to become beholden to them.” The best of both worlds is currently a lot of work. But the poster’s paradise might not be so far away.

Permalink: /feed/posse-better-way-to-post-social-networks/

Tags: #indieweb #posse #socialnetworks #internet #community

lqdev👽10/14/2023

https://udlbook.github.io/udlbook/

The title of this book is “Understanding Deep Learning” to distinguish it from volumes that cover coding and other practical aspects. This text is primarily about the ideas that underlie deep learning. The first part of the book introduces deep learning models and discusses how to train them, measure their performance, and improve this performance. The next part considers architectures that are specialized to images, text, and graph data. These chapters require only introductory linear algebra, calculus, and probability and should be accessible to any second-year undergraduate in a quantitative discipline. Subsequent parts of the book tackle generative models and reinforcement learning. These chapters require more knowledge of probability and calculus and target more advanced students.

Permalink: /feed/understanding-deep-learning-book/

Tags: #book #deeplearning #ai

lqdev👽10/14/2023

https://arxiv.org/abs/2309.17421

Large multimodal models (LMMs) extend large language models (LLMs) with multi-sensory skills, such as visual understanding, to achieve stronger generic intelligence. In this paper, we analyze the latest model, GPT-4V(ision), to deepen the understanding of LMMs. The analysis focuses on the intriguing tasks that GPT-4V can perform, containing test samples to probe the quality and genericity of GPT-4V's capabilities, its supported inputs and working modes, and the effective ways to prompt the model. In our approach to exploring GPT-4V, we curate and organize a collection of carefully designed qualitative samples spanning a variety of domains and tasks. Observations from these samples demonstrate that GPT-4V's unprecedented ability in processing arbitrarily interleaved multimodal inputs and the genericity of its capabilities together make GPT-4V a powerful multimodal generalist system. Furthermore, GPT-4V's unique capability of understanding visual markers drawn on input images can give rise to new human-computer interaction methods such as visual referring prompting. We conclude the report with in-depth discussions on the emerging application scenarios and the future research directions for GPT-4V-based systems. We hope that this preliminary exploration will inspire future research on the next-generation multimodal task formulation, new ways to exploit and enhance LMMs to solve real-world problems, and gaining better understanding of multimodal foundation models. Finally, we acknowledge that the model under our study is solely the product of OpenAI's innovative work, and they should be fully credited for its development. Please see the GPT-4V contributions paper for the authorship and credit attribution: this https URL

Permalink: /feed/dawn-lmms-early-explorations-gtp4v/

Tags: #ai #lmm

lqdev👽10/14/2023

https://arxiv.org/abs/2309.11495

Generation of plausible yet incorrect factual information, termed hallucination, is an unsolved issue in large language models. We study the ability of language models to deliberate on the responses they give in order to correct their mistakes. We develop the Chain-of-Verification (COVE) method whereby the model first (i) drafts an initial response; then (ii) plans verification questions to fact-check its draft; (iii) answers those questions independently so the answers are not biased by other responses; and (iv) generates its final verified response. In experiments, we show COVE decreases hallucinations across a variety of tasks, from list-based questions from Wikidata, closed book MultiSpanQA and longform text generation.

Permalink: /feed/chain-of-verification-prompting-technique/

Tags: #ai #llm #promptengineering

lqdev👽10/14/2023

https://wordpress.com/blog/2023/10/11/activitypub/

Exciting times are here for all WordPress.com users! The revolutionary ActivityPub feature is now available across all WordPress.com plans, unlocking a world of engagement and interaction for your blog. Your blogs can now be part of the rapidly expanding fediverse, which enables you to connect with a broader audience and attract more followers.

I can't believe I missed these news but so exciting!

Permalink: /feed/activitypub-plugin-available-hosted-wordpress/

Tags: #activitypub #mastodon #wordpress #blogging

lqdev👽10/14/2023

https://huyenchip.com/2023/10/10/multimodal.html

This post covers multimodal systems in general, including LMMs. It consists of 3 parts.

Part 1 covers the context for multimodality, including why multimodal, different data modalities, and types of multimodal tasks.
Part 2 discusses the fundamentals of a multimodal system, using the examples of CLIP, which lays the foundation for many future multimodal systems, and Flamingo, whose impressive performance gave rise to LMMs.
Part 3 discusses some active research areas for LMMs, including generating multimodal outputs and adapters for more efficient multimodal training, covering newer multimodal systems such as BLIP-2, LLaVA, LLaMA-Adapter V2, LAVIN, etc.

Permalink: /feed/multimodality-large-multimodal-models-lmm-huyen/

Tags: #ai #lmm

lqdev👽10/14/2023

https://github.com/huggingface/text-embeddings-inference

A blazing fast inference solution for text embeddings models.

Permalink: /feed/hf-text-embedding-inference/

Tags: #ai #embeddings #text #inference #ml

lqdev👽10/06/2023

https://radiooooo.com/

Radiooooo is a project born in 2013, dreamt up by a little family of friends, both djs and music lovers, who decided to share their record collections and the fruit of many years of research, for all to enjoy.

« Sharing and discovering » , « curiosity and pleasure » these are the foundations of this musical time machine.

Radiooooo is a collaborative website, whose goal is to open each and everyone’s horizons through culture and beauty.

Permalink: /feed/radiooooo/

Tags: #music #radio

lqdev👽09/28/2023

https://www.beren.io/2023-04-11-Scaffolded-LLMs-natural-language-computers/

ReACT LLM Pattern Image — Source: *beren.io*

Image of high-level CPU architecture — Source: *beren.io*

Permalink: /feed/scaffolded-llms-natural-language-computers/

Tags: #ai #llm #cpu

lqdev👽09/28/2023

https://queue.acm.org/detail.cfm?id=3623391

The team at NVIDIA brings confidentiality and integrity to user code and data for accelerated computing.

Permalink: /feed/creating-first-confidential-gpus/

Tags: #ai #security #privacy #gpu

lqdev👽09/28/2023

https://danielmiessler.com/p/the-ai-attack-surface-map-v1-0/

AI Surface Attack Map — Source: *danielmiessler.com*

Permalink: /feed/ai-attack-surface-map-v1/

Tags: #ai #security #cybersecurity

lqdev👽09/28/2023

https://mistral.ai/

Mistral-7B-v0.1 is a small, yet powerful model adaptable to many use-cases. Mistral 7B is better than Llama 2 13B on all benchmarks, has natural coding abilities, and 8k sequence length. It’s released under Apache 2.0 licence. We made it easy to deploy on any cloud, and of course on your gaming GPU.

Permalink: /feed/mistral-ai/

Tags: #ai #llm #opensource

lqdev👽09/28/2023

https://about.fb.com/news/2023/09/introducing-ai-powered-assistants-characters-and-creative-tools/

We’re starting to roll out AI stickers across our apps, and soon you’ll be able to edit your images or even co-create them with friends on Instagram using our new AI editing tools, restyle and backdrop.

We’re introducing Meta AI in beta, an advanced conversational assistant that’s available on WhatsApp, Messenger, and Instagram, and is coming to Ray-Ban Meta smart glasses and Quest 3. Meta AI can give you real-time information and generate photorealistic images from your text prompts in seconds to share with friends. (Available in the US only)

We’re also launching 28 more AIs in beta, with unique interests and personalities. Some are played by cultural icons and influencers, including Snoop Dogg, Tom Brady, Kendall Jenner, and Naomi Osaka.

Over time, we’re making AIs for businesses and creators available, and releasing our AI studio for people and developers to build their own AIs.

These new AI experiences also come with a new set of challenges for our industry. We’re rolling out our new AIs slowly and have built in safeguards.

Permalink: /feed/meta-announces-ai-experiences-fb-whatsapp-ig/

Tags: #meta #ai #facebook

lqdev👽09/28/2023

https://www.raspberrypi.com/products/raspberry-pi-5/

The everything computer. Optimised.

With 2–3× the speed of the previous generation, and featuring silicon designed in‑house for the best possible performance, we’ve redefined the Raspberry Pi experience.

Coming October 2023

Permalink: /feed/raspberry-pi-5/

Tags: #raspberrypi #selfhosting

lqdev👽09/28/2023

https://carton.run/

Carton makes it easy to run any ML model from any programming language.

Permalink: /feed/carton-ml-models/

Tags: #ai #ml #mlops

lqdev👽09/26/2023

https://shop.boox.com/products/palma

I really like my Boox e-reader but having a more pocketable device would be amazing. It's unforunate you can't also use it for handwritten notes but at this size it makes sense.

Permalink: /feed/boox-palma-eink-phone/

Tags: #eink #ereader #mp3 #minimalism #simplicity

lqdev👽09/25/2023

https://github.com/FlowiseAI/Flowise

GIF of FlowiseAI LLM visual tool

Permalink: /feed/flowiseai-visual-llm-flow/

Tags: #ai #tools

lqdev👽09/25/2023

https://varunshenoy.substack.com/p/why-open-source-ai-will-win

Permalink: /feed/why-open-source-ai-will-win-varun/

Tags: #opensource #ai

lqdev👽09/25/2023

https://blog.research.google/2023/09/distilling-step-by-step-outperforming.html

In “Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes”, presented at ACL2023, we set out to tackle this trade-off between model size and training data collection cost. We introduce distilling step-by-step, a new simple mechanism that allows us to train smaller task-specific models with much less training data than required by standard fine-tuning or distillation approaches that outperform few-shot prompted LLMs’ performance. We demonstrate that the distilling step-by-step mechanism enables a 770M parameter T5 model to outperform the few-shot prompted 540B PaLM model using only 80% of examples in a benchmark dataset, which demonstrates a more than 700x model size reduction with much less training data required by standard approaches.

Permalink: /feed/distilling-step-x-step-outperforming-large-models-small-data/

Tags: #ai #llm #slm #optimization

lqdev👽09/25/2023

https://www.bbc.com/future/article/20230912-how-i-hacked-my-brain

There is growing evidence that simple, everyday changes to our lives can alter our brains and change how they work. Melissa Hogenboom put herself into a scanner to find out.

I was surprised that something as simple as mindfulness can play such a crucial role in keeping our minds healthy. Research has shown that mindfulness is a simple but powerful way to enhance several cognitive functions. It can improve attention, relieve pain and reduce stress. Research has found that after only a few months of mindfulness training, certain depression and anxiety symptoms can ease – though as with any complex mental health problem, this may of course vary depending on individual circumstances.

Permalink: /feed/rewired-brain-six-weeks-bbc/

Tags: #neuroscience #habits #mindfulness #cognition

lqdev👽09/25/2023

http://josephthacker.com/ai/2023/09/18/vim-llm-hacks.html

... I learned that you could be inside vim, but manipulate the entire file as if you were piping the contents of the file into a command. The output of the command does in-line replacement of the entire file with those changes. That sounds confusing, but it just means you can be inside a vim file and do :%!grep test and it’ll remove all lines that don’t contain test, for example.

This post is a simple showcase of taking that concept, but throwing an llm into the mix to add more dynamic functionality.

Permalink: /feed/vim-llm-hacks-thacker/

Tags: #vi #linux #llm #ai #softwaredevelopment

lqdev👽09/25/2023

https://www.youtube.com/watch?v=bskEGP0r3hE

The future is AVX10, so says Intel. Recently a document was released showcasing a post-AVX512 world, and to explain why this matters, I've again invited the Chips And Cheese crew onto the channel. Chester and George answer my questions on AVX10 and why it matters! Visit http://www.chipsandcheese.com to learn more!

Permalink: /feed/next-gen-cpu-acceleration-avx-gen-ai/

Tags: #ai #hardware #hpc

lqdev👽09/25/2023

https://www.nps.gov/katm/learn/fat-bear-week.htm

Fat Bear Week - an annual celebration of success. All bears are winners but only one true champion will emerge. Held over the course of seven days and concluding on the Fat Bear Tuesday, people chose which bear to crown in this tournament style bracket where bears are pitted against each other for your vote.

Permalink: /feed/fat-bear-week-2023/

Tags: #nature #bears #nps

lqdev👽09/25/2023

https://openai.com/blog/chatgpt-can-now-see-hear-and-speak

We are beginning to roll out new voice and image capabilities in ChatGPT. They offer a new, more intuitive type of interface by allowing you to have a voice conversation or show ChatGPT what you’re talking about.

Permalink: /feed/chatgpt-voice-image-capabilities/

Tags: #ai #chatgpt #openai #llm #machinelearning #ml

lqdev👽09/25/2023

https://www.aboutamazon.com/news/company-news/amazon-aws-anthropic-ai

Anthropic selects AWS as its primary cloud provider and will train and deploy its future foundation models on AWS Trainium and Inferentia chips, taking advantage of AWS’s high-performance, low-cost machine learning accelerators.

Permalink: /feed/aws-anthropic-partnership/

Tags: #ai #cloud

lqdev👽09/23/2023

https://tracydurnell.com/2023/09/23/my-20th-anniversary-of-blogging/

Happy 20th anniversary! Also, thanks for the generous linking. Lots of new folks to read and subscribe to.

Permalink: /feed/tracy-durnell-20-blogging-anniversary/

Tags: #blogs #blogging #indieweb

lqdev👽09/22/2023

https://openai.com/dall-e-3

DALL·E 3 understands significantly more nuance and detail than our previous systems, allowing you to easily translate your ideas into exceptionally accurate images.

Permalink: /feed/dall-e-3/

Tags: #dalle #ai #openai #images #llm #ml

lqdev👽09/22/2023

https://matrix.org/blog/2023/09/matrix-2-0/

TL;DR: If you want to play with a shiny new Matrix 2.0 client, head over to Element X.

Permalink: /feed/matrix-2-0/

Tags: #matrix #messaging #communication #opensource #protocols #decentralization #openweb #web #internet

lqdev👽09/17/2023

https://www.theatlantic.com/technology/archive/2023/09/managing-digital-privacy-personal-information-online/675184/

Sponsored post, but it's still a good list with guidance and suggestions for common questions.

Permalink: /feed/atlantic-privacy-guide-2023/

Tags: #privacy #security #internet

lqdev👽09/17/2023

https://huggingface.co/blog/optimize-llm

In this blog post, we will go over the most effective techniques at the time of writing this blog post to tackle these challenges for efficient LLM deployment:

Lower Precision: Research has shown that operating at reduced numerical precision, namely 8-bit and 4-bit, can achieve computational advantages without a considerable decline in model performance.

2. Flash Attention: Flash Attention is a variation of the attention algorithm that not only provides a more memory-efficient approach but also realizes increased efficiency due to optimized GPU memory utilization.

3. Architectural Innovations: Considering that LLMs are always deployed in the same way during inference, namely autoregressive text generation with a long input context, specialized model architectures have been proposed that allow for more efficient inference. The most important advancement in model architectures hereby are Alibi, Rotary embeddings, Multi-Query Attention (MQA) and Grouped-Query-Attention (GQA).

Permalink: /feed/optimizing-llm-production-hf/

Tags: #ai #llm #production #engineering #software #mlops #aiops #opensource

lqdev👽09/17/2023

https://blog.minch.co/2022/11/15/software-squared.html

A new generation of AIs that become increasingly general by producing their own training data

We are currently at the cusp of transitioning from “learning from data” to “learning what data to learn from” as the central focus of AI research.

If deep learning can be described as “Software 2.0”—software that programs itself based on example inputs/output pairs, then this promising, data-centric paradigm, in which software effectively improves itself by searching for its own training data, can be described as a kind of “Software²”. This paradigm inherits the benefits of Software 2.0 while improving on its core, data-bound weaknesses: While deep learning (Software 2.0) requires the programmer to manually provide training data for each new task, Software² recasts data as software that models or searches the world to produce its own, potentially unlimited, training tasks and data.

Permalink: /feed/software-squared-jiang/

Tags: #ai #software #data

lqdev👽09/17/2023

https://github.com/OpenRobotLab/PointLLM

We introduce PointLLM, a multi-modal large language model capable of understanding colored point clouds of objects. It perceives object types, geometric structures, and appearance without concerns for ambiguous depth, occlusion, or viewpoint dependency. We collect a novel dataset comprising 660K simple and 70K complex point-text instruction pairs to enable a two-stage training strategy. To rigorously evaluate our model's perceptual abilities and its generalization capabilities, we establish two benchmarks: Generative 3D Object Classification and 3D Object Captioning, assessed through three different evaluation methods.

Permalink: /feed/point-llm-point-clouds/

Tags: #ai #llm #generativeai #multimodal #research

lqdev👽09/17/2023

https://a16z.com/how-are-consumers-using-generative-ai/

1. Most leading products are built from the “ground up” around generative AI
Like ChatGPT, the majority of products on this list didn’t exist a year ago—80% of these websites are new Of the 50 companies on the list, only 5 are products of, or acquisitions by, pre-existing big tech companies... Of the remaining list members, a whopping 48% are completely bootstrapped, with no outside funding, according to PitchBook data.

2. ChatGPT has a massive lead, for now…
ChatGPT represents 60% of monthly traffic to the entire top 50 list, with an estimated 1.6 billion monthly visits and 200 million monthly users (as of June 2023). This makes ChatGPT the 24th most visited website globally.

3. LLM assistants (like ChatGPT) are dominant, but companionship and creative tools are on the rise
General LLM chatbots represent 68% of total consumer traffic to the top 50 list. However, two other categories have started to drive significant usage in recent months—AI companions (such as CharacterAI) and content generation tools (such as Midjourney and ElevenLabs). Within the broader content generation category, image generation is the top use case with 41% of traffic, followed by prosumer writing tools at 26%, and video generation at 8%. Another category worth mentioning? Model hubs. There are only 2 on the list, but they drive significant traffic—Civitai (for images) and Hugging Face both rank in the top 10. This is especially impressive because consumers are typically visiting these sites to download models to run locally, so web traffic is likely an underestimate of actual usage.

4. Early “winners” have emerged, but most product categories are up for grabs
Good news for builders: despite the surge in interest in generative AI, in many categories there is not yet a runway success.

5. Acquisition for top products is entirely organic—and consumers are willing to pay!
The majority of companies on this list have no paid marketing (at least, that SimilarWeb is able to attribute). There is significant free traffic “available” via X, Reddit, Discord, and email, as well as word of mouth and referral growth. And consumers are willing to pay for GenAI. 90% of companies on the list are already monetizing, nearly all of them via a subscription model. The average product on the list makes $21/month (for users on monthly plans)—yielding $252 annually.

6. Mobile apps are still emerging as a GenAI platform
Consumer AI products have, thus far, been largely browser-first, rather than app-first. Even ChatGPT took 6 months to launch a mobile app! Why aren’t more AI companies building on mobile? The browser is a natural starting place to reach the broadest base of consumers. Many AI companies have small teams and likely don’t want to fragment their focus and resources across Web, iOS, and Android. Given that the average consumer now spends 36 minutes more per day on mobile than desktop (4.1 hours vs. 3.5 hours), we expect to see more mobile-first GenAI products emerge as the technology matures.

Permalink: /feed/a16z-how-consumers-using-generative-ai/

Tags: #ai #generativeai #product #business #venturecapital #vc

lqdev👽09/17/2023

https://wordpress.org/plugins/activitypub/

I can't believe I missed this announcement. This is great to see!

Enter the fediverse with ActivityPub, broadcasting your blog to a wider audience! Attract followers, deliver updates, and receive comments from a diverse user base of ActivityPub-compliant platforms.

With the ActivityPub plugin installed, your WordPress blog itself function as a federated profile, along with profiles for each author. For instance, if your website is example.com, then the blog-wide profile can be found at @example.com@example.com, and authors like Jane and Bob would have their individual profiles at @jane@example.com and @bobz@example.com, respectively.

Permalink: /feed/wordpress-activitypub-plugin/

Tags: #wordpress #activitypub #fediverse #openweb #decentralization #internet #blogging #opensource

lqdev👽09/17/2023

https://huggingface.co/spaces/coqui/xtts

XTTS is a Voice generation model that lets you clone voices into different languages by using just a quick 3-second audio clip.

XTTS is built on previous research, like Tortoise, with additional architectural innovations and training to make cross-language voice cloning and multilingual speech generation possible.

Permalink: /feed/coqui-xtts-text-to-speech/

Tags: #ai #generativeai #speech #huggingface #ml #tts #opensource

lqdev👽09/17/2023

https://huggingface.co/blog/wuerstchen

Würstchen is a diffusion model, whose text-conditional component works in a highly compressed latent space of images. Why is this important? Compressing data can reduce computational costs for both training and inference by orders of magnitude. Training on 1024×1024 images is way more expensive than training on 32×32. Usually, other works make use of a relatively small compression, in the range of 4x - 8x spatial compression. Würstchen takes this to an extreme. Through its novel design, it achieves a 42x spatial compression! This had never been seen before, because common methods fail to faithfully reconstruct detailed images after 16x spatial compression. Würstchen employs a two-stage compression, what we call Stage A and Stage B. Stage A is a VQGAN, and Stage B is a Diffusion Autoencoder (more details can be found in the paper). Together Stage A and B are called the Decoder, because they decode the compressed images back into pixel space. A third model, Stage C, is learned in that highly compressed latent space. This training requires fractions of the compute used for current top-performing models, while also allowing cheaper and faster inference. We refer to Stage C as the Prior.

Permalink: /feed/wurstchen-fast-image-generation/

Tags: #ai #generativeai #imagegeneration #huggingface #diffusion #ml

lqdev👽09/17/2023

https://huggingface.co/blog/t2i-sdxl-adapters

T2I-Adapter is an efficient plug-and-play model that provides extra guidance to pre-trained text-to-image models while freezing the original large text-to-image models. T2I-Adapter aligns internal knowledge in T2I models with external control signals. We can train various adapters according to different conditions and achieve rich control and editing effects.

Over the past few weeks, the Diffusers team and the T2I-Adapter authors have been collaborating to bring the support of T2I-Adapters for Stable Diffusion XL (SDXL) in diffusers. In this blog post, we share our findings from training T2I-Adapters on SDXL from scratch, some appealing results, and, of course, the T2I-Adapter checkpoints on various conditionings (sketch, canny, lineart, depth, and openpose)!

Permalink: /feed/t2i-adapters-efficient-controllable-generation/

Tags: #ai #generativeai #imagegeneration #deeplearning #ml #huggingface

lqdev👽09/12/2023

https://huggingface.co/blog/falcon-180b

Today, we're excited to welcome TII's Falcon 180B to HuggingFace! Falcon 180B sets a new state-of-the-art for open models. It is the largest openly available language model, with 180 billion parameters, and was trained on a massive 3.5 trillion tokens using TII's RefinedWeb dataset. This represents the longest single-epoch pretraining for an open model.

Permalink: /feed/falcon-180b-announcement/

Tags: #ai #llm #opensource #huggingface

lqdev👽09/12/2023

https://retool.com/visual-basic/

How Visual Basic became the world's most dominant programming environment, its sudden fall from grace, and why its influence is still shaping the future of software development.

Permalink: /feed/visual-basic-history-retool/

Tags: #visualbasic #windows

lqdev👽09/10/2023

https://www.modular.com/blog/mojo-its-finally-here

Today, we’re excited to announce the next big step in Mojo’s evolution: Mojo is now available for local download – beginning with Linux systems, and adding Mac and Windows in coming releases.

Permalink: /feed/mojo-available-local-download/

Tags: #ai #python #programminglanguages

lqdev👽09/10/2023

https://github.com/Textualize/textual-web

Textual Web publishes Textual apps and terminals on the web.

Permalink: /feed/textualize-textual-web/

Tags: #tui #terminal #web

lqdev👽09/08/2023

https://www.dreadcentral.com/the-overlook-motel/

I'm a fan of finding hidden gems and overlooked films, especially when it comes to horror. It's how I came across some of my favorites like Hell House and Terrifier. That's why I was excited to run into The Overlook Motel series from Dread Central which spotlights these kinds of films.

Permalink: /feed/dread-central-overlook-motel/

Tags: #movies #horror

lqdev👽09/08/2023

https://mullvad.net/en/blog/2023/9/7/tailscale-has-partnered-with-mullvad/

Today we announce a partnership with Tailscale that allows you to use both in conjunction through the Tailscale app. This functionality is not available through the Mullvad VPN app.

This partnership allows customers of Tailscale to make use of our WireGuard VPN servers as “exit nodes”. This means that whilst connected to Tailscale, you can access your devices across Tailscale’s mesh network, whilst still connecting outbound through Mullvad VPN WireGuard servers in any location.

Permalink: /feed/tailscale-partners-mullvad/

Tags: #privacy #vpn #security

lqdev👽09/07/2023

https://www.jwz.org/blog/2023/09/platos-cave-regrets-to-inform-you-it-will-be-raising-its-rent/

If you are receiving this letter, it means you have been designated a tenant of the cave—i.e., you are chained to the wall, you are forced to watch shadows for all eternity, you are projecting said shadow puppets, and/or you are a philosopher who was able to break free and understand the true shackles of reality (PhD candidates about to argue their thesis).

We do not undertake this lightly. As the costs of maintaining a cave meant to trap you in your ignorance increases year after year, we want you to know, from the bottom of our hearts, that we, too, are suffering. We get that times are tough, and we hope you can extend that sympathy to us, the managers of your cave.

Please rest assured that cave costs are increasing everywhere. We manage many other caves like that of Polyphemus the Cyclops and the childhood home of Zeus. So, trust us: we know caves.

We hope you will continue to enjoy living in our cave. We believe you are a valued part of the Plato's Cave community. Credit, cash, or Venmo all work.

Original Source: https://www.mcsweeneys.net/articles/platos-cave-regrets-to-inform-you-it-will-be-raising-its-rent

Boy laughing and then crying

Permalink: /feed/platos-cave-raising-rent/

Tags: #simulation

lqdev👽09/06/2023

https://www.aaron-powell.com/posts/2023-09-04-generative-ai-and-dotnet---part-2-sdk/

It’s time to have a look at how we can build the basics of an application using Azure OpenAI Services and the .NET SDK.

Permalink: /feed/generative-ai-dotnet-pt-2-powell/

Tags: #ai #dotnet #generativeai #llm #azure

lqdev👽09/06/2023

https://community.torproject.org/onion-services/setup/

This guide shows you how to set up an Onion Service for your website. For the technical details of how the Onion Service protocol works, see our Onion Service protocol page.

Permalink: /feed/set-up-onion-service/

Tags: #tor #internet #privacy #guide

lqdev👽09/06/2023

https://spectrum.ieee.org/doctorow-interoperability

In his new book The Internet Con: How to Seize the Means of Computation, author Cory Doctorow presents a strong case for disrupting Big Tech. While the dominance of Internet platforms like Twitter, Facebook, Instagram, or Amazon is often taken for granted, Doctorow argues that these walled gardens are fenced in by legal structures, not feats of engineering. Doctorow proposes forcing interoperability—any given platform’s ability to interact with another—as a way to break down those walls and to make the Internet freer and more democratic.

Permalink: /feed/interoperability-can-save-open-web-doctorow/

Tags: #openweb #interoperability #internet #opensource

lqdev👽09/06/2023

https://www.microsoft.com/en-us/research/blog/rethinking-trust-in-direct-messages-in-the-ai-era/

This blog post is a part of a series exploring our research in privacy, security, and cryptography. For the previous post, see https://www.microsoft.com/en-us/research/blog/research-trends-in-privacy-security-and-cryptography. While AI has the potential to massively increase productivity, this power can be used equally well for malicious purposes, for example, to automate the creation of sophisticated scam messages. In this post, we explore threats AI can pose for online communication ecosystems and outline a high-level approach to mitigating these threats.

Permalink: /feed/rethinking-trust-dm-ai-era/

Tags: #security #ai #privacy #cryptography #communication #messaging #socialmedia #trust

lqdev👽09/06/2023

https://perplexity.vercel.app/

I built this little tool to help me understand what it's like to be an autoregressive language model. For any given passage of text, it augments the original text with highlights and annotations that tell me how "surprising" each token is to the model, and which other tokens the model thought were most likely to occur in its place. Right now, the LM I'm using is the smallest version of GPT-2, with 124M parameters.

Permalink: /feed/perplexity-interactive-llm-visualizations/

Tags: #ai #llm #visualization #tools

lqdev👽09/05/2023

https://www.fast.ai/posts/2023-09-04-learning-jumps/

Summary: recently while fine-tuning a large language model (LLM) on multiple-choice science exam questions, we observed some highly unusual training loss curves. In particular, it appeared the model was able to rapidly memorize examples from the dataset after seeing them just once. This astonishing feat contradicts most prior wisdom about neural network sample efficiency. Intrigued by this result, we conducted a series of experiments to validate and better understand this phenomenon. It’s early days, but the experiments support the hypothesis that the models are able to rapidly remember inputs. This might mean we have to re-think how we train and use LLMs.

Permalink: /feed/can-llms-learn-single-example/

Tags: #ai #llm #finetuning #fastai

lqdev👽09/02/2023

https://naveenarun.wordpress.com/2023/08/31/doing-laundry-on-campus-without-a-phone/

This article reminds me of this gem 😂

Permalink: /feed/doing-laundry-without-phone/

Tags: #technology #tv

lqdev👽09/01/2023

https://openai.com/blog/teaching-with-ai

We’re releasing a guide for teachers using ChatGPT in their classroom—including suggested prompts, an explanation of how ChatGPT works and its limitations, the efficacy of AI detectors, and bias.

Permalink: /feed/teching-with-ai-openai/

Tags: #ai #education #openai #guide

lqdev👽09/01/2023

https://mastodon.social/@pixelfed/110989967683252426

Solo by Pixelfed

We've been secretly building a single user federated photo sharing server (based on Pixelfed), with minimal setup, and built-in import

Solo is simple, download the code, drag your photos to the media directory and open your browser

So simple, yet super smart, Solo won't get in your way, but it will impress

Launching Oct 2023

Very excited about this!

Permalink: /feed/pixelfed-solo-announcement/

Tags: #fediverse #pixelfed #photos #socialmedia

lqdev👽09/01/2023

https://www.aaron-powell.com/posts/2023-09-01-generative-ai-and-dotnet---part-1-intro/

over this series I’m going to share my learnings on the APIs, SDKs, and the like. The goal here isn’t to “build something” but rather to share what I’ve learnt, the mistakes I’ve made, the things I’ve found confusing, and the code I’ve had to rewrite umpteen times because “oh, that’s a better way to do it”.

Permalink: /feed/generative-ai-dotnet-pt-1-powell/

Tags: #ai #dotnet #generativeai #llm

lqdev👽08/31/2023

https://webllm.mlc.ai/

Llama 2 7B/13B are now available in Web LLM!

Llama 2 70B is also supported.

This project brings large-language model and LLM-based chatbot to web browsers. Everything runs inside the browser with no server support and accelerated with WebGPU. This opens up a lot of fun opportunities to build AI assistants for everyone and enable privacy while enjoying GPU acceleration.

Permalink: /feed/web-llm-llama-2/

Tags: #ai #webgpu #llama #llm

lqdev👽08/31/2023

https://a16z.com/2023/08/30/supporting-the-open-source-ai-community/

We believe artificial intelligence has the power to save the world—and that a thriving open source ecosystem is essential to building this future.

To help close this resource gap, we’re announcing today the a16z Open Source AI Grant program. We’ll support a small group of open source developers through grant funding (not an investment or SAFE note), giving them the opportunity to continue their work without the pressure to generate financial returns.

Permalink: /feed/supporting-open-source-ai-community-az/

Tags: #ai #opensource #machinelearning #llm

lqdev👽08/30/2023

https://www.reality2cast.com/151

Good discussion on the open web, self-hosting, radio, and the indie web.

Permalink: /feed/podcast-wordpress-indieweb-reality2/

Tags: #podcast #indieweb #wordpress #selfhost #radio #openweb #fediverse

lqdev👽08/29/2023

https://simonwillison.net/2023/Aug/27/wordcamp-llms/

My goal today is to provide practical, actionable advice for getting the most out of Large Language Models—both for personal productivity but also as a platform that you can use to build things that you couldn’t build before.

Permalink: /feed/making-llms-work-for-you-willison/

Tags: #ai #llm #wordpress #presentation

lqdev👽08/28/2023

https://www.newyorker.com/culture/cultural-comment/we-dont-need-a-new-twitter

If Meta can succeed in capturing some of this peak-Twitter magic, while avoiding late-stage Twitter’s struggles, the company will perhaps even reclaim some of the cultural gravity that it squandered a decade ago when Facebook took its turn toward crazy-uncle irrelevance. But can Meta possibly succeed in building a saner, nicer Twitter?

Breaking news can spread quickly, as can clips that are funny in an original or strange way—but these innocuous trends feel serendipitous, like a rainbow spanning storm clouds. To reach the Twitter masses, conspiracy, demagoguery, and cancellation are much more likely to succeed. The result is a Faustian bargain for our networked era: trusting the wisdom of crowds to identify what’s interesting can create an intensely compelling stream of shared content, but this content is likely to arrive drenched in rancor.

The obvious way Meta can attempt to escape this bargain is by moving Threads away from retransmission-based curation and toward algorithmic ranking. This will give the company more control over which discussions are amplified, but, in doing so, they will also lose the human-powered selectivity that makes Twitter so engaging.

If we look past this narrow discussion of Threads’ challenges, however, a broader question arises: Why is it so important to create a better version of Twitter in the first place? Ignored amid the hand-wringing about the toxic turn taken by large-scale conversation platforms are the many smaller, less flashy sites and services that have long been supporting a more civilized form of digital interaction.

“The Internet has become the ultimate narrowcasting vehicle: everyone from UFO buffs to New York Yankee fans has a Website (or dozen) to call his own,” the journalist Richard Zoglin wrote in 1996. “A dot-com in every pot.”

We’ve gone from Zoglin’s dot-com in every pot to the social-media age’s vision of every pot being filled with slop from the same platforms.

Permalink: /feed/no-new-twitter-newport/

Tags: #twitter #socialmedia #indieweb #internet #culture

lqdev👽08/24/2023

https://studio.ribbonfarm.com/p/against-waldenponding

Waldenponding (after Thoreau's Walden Pond experiment on which Walden is based). The crude caricature is "smash your smart phone and go live in a log cabin to reclaim your attention and your life from being hacked by evil social media platforms."

Permalink: /feed/against-waldenponding-rao/

Tags: #internet #society #technology

lqdev👽08/24/2023

https://tomcritchlow.com/2022/04/21/new-rss/

RSS is kind of an invisible technology. People call RSS dead because you can’t see it. There’s no feed, no login, no analytics. RSS feels subsurface.

Come to think of it - all the interesting bits of blogging are invisible. The discussion has moved to Twitter, or discords, or DMs. Trackbacks aren’t a thing anymore. So when you see someone blogging all you see is the blog post. The branching replies and conversations are either invisible or hard to track down.

But I believe we’re living in a golden age of RSS. Blogging is booming. My feed reader has 280 feeds in it.

How do we increase the surface area of RSS and blogging?

I think there’s something quietly radical about making your feed reader open by default. It increases the surface area of RSS so others can discover content more easily. It makes blogging more visible.

The nice thing about RSS and OPML is that’s a very extensible spec. The file format is flexible, you can define your own schema and fields. This might open up new kinds of publishing.

Permalink: /feed/increasing-surface-area-blogging-critchlow/

Tags: #rss #internet #blog #postcast #feeds #openweb #openstandards

lqdev👽08/24/2023

https://aboutfeeds.com/

About Feeds is a free site from Matt Webb. I made this site because using web feeds for the first time is hard, and we can fix that.

Permalink: /feed/about-feeds/

Tags: #rss #internet #blog #openweb #openstandards

lqdev👽08/24/2023

https://www.edge.org/conversation/marvin_minsky-consciousness-is-a-big-suitcase

Marvin Minsky is the leading light of AI—artificial intelligence, that is. He sees the brain as a myriad of structures. Scientists who, like Minsky, take the strong AI view believe that a computer model of the brain will be able to explain what we know of the brain's cognitive abilities. Minsky identifies consciousness with high-level, abstract thought, and believes that in principle machines can do everything a conscious human being can do.

Permalink: /feed/consciousness-big-suitcase-minsky/

Tags: #ai #consciousness #computerscience

lqdev👽08/24/2023

https://serenityos.org/

A graphical Unix-like operating system for desktop computers!

SerenityOS is a love letter to '90s user interfaces with a custom Unix-like core. It flatters with sincerity by stealing beautiful ideas from various other systems.

Roughly speaking, the goal is a marriage between the aesthetic of late-1990s productivity software and the power-user accessibility of late-2000s *nix.

This is a system by us, for us, based on the things we like.

Permalink: /feed/serenity-os/

Tags: #os #linux #90s #serenityos #retro

lqdev👽08/24/2023

https://ai.meta.com/blog/code-llama-large-language-model-coding/

Takeaways

Code Llama is a state-of-the-art LLM capable of generating code, and natural language about code, from both code and natural language prompts.

Code Llama is free for research and commercial use.

Code Llama is built on top of Llama 2 and is available in three models:

Code Llama, the foundational code model;

Codel Llama - Python specialized for Python;

and Code Llama - Instruct, which is fine-tuned for understanding natural language instructions.

In our own benchmark testing, Code Llama outperformed state-of-the-art publicly available LLMs on code tasks

Permalink: /feed/meta-code-llama/

Tags: #ai #llm #llama #code #opensource #meta

lqdev👽08/22/2023

https://techcommunity.microsoft.com/t5/excel-blog/announcing-python-in-excel-combining-the-power-of-python-and-the/ba-p/3893439

Today we’re announcing a significant evolution in the analytical capabilities available within Excel by releasing a Public Preview of Python in Excel. Python in Excel makes it possible to natively combine Python and Excel analytics within the same workbook - with no setup required. With Python in Excel, you can type Python directly into a cell, the Python calculations run in the Microsoft Cloud, and your results are returned to the worksheet, including plots and visualizations.

Permalink: /feed/announcing-python-in-excel/

Tags: #python #excel #dataanalytics #ai

lqdev👽08/16/2023

https://www.deeplearning.ai/short-courses/large-language-models-semantic-search/

Keyword search has been a common method for search for many years. But for content-rich websites like news media sites or online shopping platforms, the keyword search capability can be limiting. Incorporating large language models (LLMs) into your search can significantly enhance the user experience by allowing them to ask questions and find information in a much easier way.

This course teaches the techniques needed to leverage LLMs into search.

Throughout the lessons, you’ll explore key concepts like dense retrieval, which elevates the relevance of retrieved information, leading to improved search results beyond traditional keyword search, and reranking, which injects the intelligence of LLMs into your search system, making it faster and more effective.

Permalink: /feed/llm-semantic-search-deeplearningai-course/

Tags: #ai #llm #course #retrievalaugmentedgeneration

lqdev👽08/16/2023

https://eugeneyan.com/writing/llm-patterns/

There are seven key patterns. They’re also organized along the spectrum of improving performance vs. reducing cost/risk, and closer to the data vs. closer to the user.

Evals: To measure performance

RAG: To add recent, external knowledge

Fine-tuning: To get better at specific tasks

Caching: To reduce latency & cost

Guardrails: To ensure output quality

Defensive UX: To anticipate & manage errors gracefully

Collect user feedback: To build our data flywheel

Addendum: how to match these LLM patterns to potential problems

Permalink: /feed/patterns-for-building-llm-systems-products-yan/

Tags: #ai #llm #designpatterns

lqdev👽08/16/2023

https://github.com/huggingface/candle

Candle is a minimalist ML framework for Rust with a focus on performance (including GPU support) and ease of use.

Permalink: /feed/huggingface-candle/

Tags: #ai #machinelearning #opensource #huggingface

lqdev👽08/16/2023

https://huyenchip.com/2023/08/16/llm-research-open-challenges.html

Reduce and measure hallucinations

Optimize context length and context construction

Incorporate other data modalities

Make LLMs faster and cheaper

Design a new model architecture

Develop GPU alternatives

Make agents usable

Improve learning from human preference

Improve the efficiency of the chat interface

Build LLMs for non-English languages

Permalink: /feed/open-challenges-llm-research-huyen/

Tags: #ai #llm #research

lqdev👽08/16/2023

https://www.infoq.com/news/2023/08/jupyter-ai-notebooks/

The open-source Project Jupyter, used by millions for data science and machine learning, has released Jupyter AI, a free tool bringing powerful generative AI capabilities to Jupyter notebooks.

https://jupyter-ai.readthedocs.io/en/latest/

Jupyter AI, which brings generative AI to Jupyter. Jupyter AI provides a user-friendly and powerful way to explore generative AI models in notebooks and improve your productivity in JupyterLab and the Jupyter Notebook. More specifically, Jupyter AI offers:

An %%ai magic that turns the Jupyter notebook into a reproducible generative AI playground. This works anywhere the IPython kernel runs (JupyterLab, Jupyter Notebook, Google Colab, VSCode, etc.).

A native chat UI in JupyterLab that enables you to work with generative AI as a conversational assistant.

Support for a wide range of generative model providers and models (AI21, Anthropic, Cohere, Hugging Face, OpenAI, SageMaker, etc.).

Permalink: /feed/jupyter-ai-notebooks/

Tags: #ai #notebooks #development

lqdev👽07/23/2023

https://retrostrange.com/

Interesting project.

RetroStrange is part independent media experiment, part handmade exaltation of vintage weirdness and the public domain. It is produced by Noah Maher and Phil Nelson.

Permalink: /feed/restrostrange-tv/

Tags: #livestream

lqdev👽07/15/2023

https://developer.wordpress.org/playground/

Not a WordPress user but this is cool, especially this - "Build an entire site, save it, host it"

WordPress Playground makes WordPress instantly accessible for users, learners, extenders, and contributors. You can:

Try a block, a theme, or a plugin

Build an entire site, save it, host it

Test your plugin with many specific WordPress and PHP versions

Embed a real, interactive WordPress site in your tutorial or course

Showcase a plugin or theme on your website

Preview pull requests from your repository

…or even run WordPress locally using the VisualStudio Code plugin or a CLI tool called wp-now

Permalink: /feed/wordpress-playground/

Tags: #indieweb #wordpress #openweb

lqdev👽07/12/2023

https://tildeverse.org/

a loose association of like-minded tilde communities.

tildes are pubnixes in the spirit of tilde.club, which was created in 2014 by paul ford.

Public Access UNIX Systems (PAUS) are a type of server that provide various services to a multi-user community. They first began in the early 1980's and continue today. Early servers ran various flavors of UNIX, hence the name Public Access "UNIX" Systems, but later generations saw a large mix of Unix-variants and, of course, GNU/Linux. To recognize the many different operating systems online today, these systems are increasingly referred to generically as "pubnixes". - Pubnix Hist

Permalink: /feed/tildeverse/

Tags: #linux #community #retrocomputing

lqdev👽07/12/2023

https://nick-black.com/htp-notcurses.pdf

A TUI (text user interface) is a holistic model, view, and controller implemented using character graphics. TUIs, like WIMP3 GUIs, freely move the cursor around their rectilinear display, as opposed to line-oriented CLIs and their ineluctable marches through the scrolling region. Given the same interactive task

• A TUI implementation is almost certainly a smaller memory and disk footprint than a GUI,
• a good TUI implementation might introduce less latency, and
• a properly-done TUI implementation can often be significantly more portable.

Permalink: /feed/hack-the-planet-notcurses-tuis/

Tags: #linux #terminal #tui

lqdev👽07/12/2023

https://the-dam.org/docs/explanations/suc.html

suc provides Slack, Mattermost, etc.’s core features:

Real-time, rich-text chat,
File sharing,
Fine-grained access control,
Straightforward automation and integration with other tools,
Data encryption in transit
and optionally at rest,
state-of-the-art user authentication.

This paper shows how suc implements those features. suc stays small by leveraging the consistent and composable primitives offered by modern UNIX implementations

Permalink: /feed/dam-slack-clone-bash/

Tags: #linux #bash #communication

lqdev👽07/12/2023

https://creatoreconomy.so/p/kaz-coo-shopify-craft-and-no-meetings

The difference between crafters and managers

The difference is in what you spend time on.

"Most people get satisfaction from building — from actually creating things."

As companies scale, optics start playing a larger role. People start spending more time on internal docs than actually talking to customers. How do you prevent this from happening at Shopify?

In most product reviews, product managers spend way too much time preparing the perfect presentation for execs.

At Shopify, our approach to product reviews is different. We want to see how the product actually works by playing with the demo or diving into the code.

We want our PMs to be extremely user-focused, to take full ownership over problems, and to have a high tolerance for risk.

If these attributes aren't present, product managers tend to become "keepers of strategy.” You end up with smart, highly credentialed individuals spending all their time writing strategy memos to increase their team size so that they can write even more strategy memos.

How Shopify rages against meetings

In early 2023, Shopify initiated operation “Chaos Monkey” to:

Cancel all meetings with 3+ people

Reinstate “no meeting Wednesdays”

Remove needless Slack channels

Does Shopify have a strong writing culture to help people communicate without meetings?

Yes, we try to make async decisions as much as we can. We do this in a few ways:

One of our mantras is “Do things, tell people.” You’ll see this plastered on our walls if you come to Shopify’s office.

We built an operating system called GSD (get shit done). This internal tool emphasizes frequent written updates, which are much easier to digest than constant meetings.

A meeting is a bug that some other process didn’t work out.

We focus on the mission. We want to be the all-in-one commerce platform for people to start and grow businesses. We try to avoid getting distracted by other side quests.

The main thing is to keep the main thing the main thing.

Permalink: /feed/interview-shopify-coo-career-path-meetings/

Tags: #career #productivity #product

lqdev👽07/12/2023

https://x.ai/

Today we announce the formation of xAI.

The goal of xAI is to understand the true nature of the universe.

Permalink: /feed/x-ai-launch/

Tags: #ai

lqdev👽07/11/2023

https://keras.io/keras_core/announcement/

We're excited to share with you a new library called Keras Core, a preview version of the future of Keras. In Fall 2023, this library will become Keras 3.0. Keras Core is a full rewrite of the Keras codebase that rebases it on top of a modular backend architecture. It makes it possible to run Keras workflows on top of arbitrary frameworks — starting with TensorFlow, JAX, and PyTorch.

Keras Core is also a drop-in replacement for tf.keras, with near-full backwards compatibility with tf.keras code when using the TensorFlow backend. In the vast majority of cases you can just start importing it via import keras_core as keras in place of from tensorflow import keras and your existing code will run with no issue — and generally with slightly improved performance, thanks to XLA compilation.

Permalink: /feed/keras-core-announcement/

Tags: #deeplearning #ai #keras #jax #tensorflow #pytorch

lqdev👽07/06/2023

https://openai.com/blog/gpt-4-api-general-availability

GPT-4 API general availability and deprecation of older models in the Completions API. GPT-3.5 Turbo, DALL·E and Whisper APIs are also generally available, and we are releasing a deprecation plan for older models of the Completions API, which will retire at the beginning of 2024.

Permalink: /feed/gpt-4-ga/

Tags: #openai #gpt4 #llm #ai

lqdev👽06/29/2023

https://512kb.club/

The 512KB Club is a collection of performance-focused web pages from across the Internet. To qualify your website must satisfy both of the following requirements:

It must be an actual site that contains a reasonable amount of information, not just a couple of links on a page (more info here).

Your total UNCOMPRESSED web resources must not exceed 512KB.

Permalink: /feed/512kb-club/

Tags: #web #smallweb #community

lqdev👽06/29/2023

https://1mb.club/

1MB Club is a growing collection of performance-focused web pages weighing less than 1 megabyte.

Permalink: /feed/1mb-club/

Tags: #web #smallweb #community

lqdev👽06/29/2023

https://proton.me/blog/proton-pass-launch

We’re happy to announce the global launch of Proton Pass...a password manager, one of the most highly demanded services from the Proton community in our annual surveys since we first launched Proton Mail...

Permalink: /feed/protonpass-release/

Tags: #security

lqdev👽06/26/2023

http://fsharpconf.com/

I'm really enjoying the sessions. Kudos to the team who put it together and presenters delivering great content.

If you're interested, check out the stream at http://fsharpconf.com/.

Permalink: /feed/fsharp-conf-2023/

Tags: #fsharp #fsharpconf #dotnet #programming #community #opensource

lqdev👽06/20/2023

https://yewtu.be/watch?v=6bODiZ5bP84

GIF of baby throwing cash out window

Permalink: /feed/book-8088-hand-386-dos-pcs/

Tags: #retrocomputing #review #dos #pcs

lqdev👽06/19/2023

https://reclaimopen.com/

List of resources from the Reclaim Open 2023 Conference.

Permalink: /feed/reclaim-open-2023-resources/

Tags: #reclaimopen #opensource #openweb #edtech

lqdev👽06/06/2023

https://arxiv.org/pdf/2306.02707.pdf

Recent research has focused on enhancing the capability of smaller models through imitation learning, drawing on the outputs generated by large foundation models (LFMs). A number of issues impact the quality of these models, ranging from limited imitation signals from shallow LFM outputs; small scale homogeneous training data; and most notably a lack of rigorous evaluation resulting in overestimating the small model’s capability as they tend to learn to imitate the style, but not the reasoning process of LFMs. To address these challenges, we develop Orca, a 13-billion parameter model that learns to imitate the reasoning process of LFMs. Orca learns from rich signals from GPT-4 including explanation traces; step-by-step thought processes; and other complex instructions, guided by teacher assistance from ChatGPT. To promote this progressive learning, we tap into large-scale and diverse imitation data with judicious sampling and selection. Orca surpasses conventional state-of-the-art instruction-tuned models such as Vicuna-13B by more than 100% in complex zero-shot reasoning benchmarks like Big- Bench Hard (BBH) and 42% on AGIEval. Moreover, Orca reaches parity with ChatGPT on the BBH benchmark and shows competitive performance (4 pts gap with optimized system message) in professional and academic examinations like the SAT, LSAT, GRE, and GMAT, both in zero-shot settings without CoT; while trailing behind GPT-4. Our research indicates that learning from step-by-step explanations, whether these are generated by humans or more advanced AI models, is a promising direction to improve model capabilities and skills.

Permalink: /feed/orca-lm-progressive-learning-gpt-4/

Tags: #ai #finetuning #gpt

lqdev👽06/03/2023

https://gorilla.cs.berkeley.edu/

Large Language Models (LLMs) have seen an impressive wave of advances recently, with models now excelling in a variety of tasks, such as mathematical reasoning and program synthesis. However, their potential to effectively use tools via API calls remains unfulfilled. This is a challenging task even for today's state-of-the-art LLMs such as GPT-4, largely due to their inability to generate accurate input arguments and their tendency to hallucinate the wrong usage of an API call. We release Gorilla, a finetuned LLaMA-based model that surpasses the performance of GPT-4 on writing API calls. When combined with a document retriever, Gorilla demonstrates a strong capability to adapt to test-time document changes, enabling flexible user updates or version changes. It also substantially mitigates the issue of hallucination, commonly encountered when prompting LLMs directly. To evaluate the model's ability, we introduce APIBench, a comprehensive dataset consisting of HuggingFace, TorchHub, and TensorHub APIs. The successful integration of the retrieval system with Gorilla demonstrates the potential for LLMs to use tools more accurately, keep up with frequently updated documentation, and consequently increase the reliability and applicability of their outputs.

Code

Permalink: /feed/gorilla-apis-llm/

Tags: #ai #opensource #llm

lqdev👽06/01/2023

https://arxiv.org/abs/2305.10973

...we propose DragGAN, which consists of two main components: 1) a feature-based motion supervision that drives the handle point to move towards the target position, and 2) a new point tracking approach that leverages the discriminative generator features to keep localizing the position of the handle points. Through DragGAN, anyone can deform an image with precise control over where pixels go, thus manipulating the pose, shape, expression, and layout of diverse categories such as animals, cars, humans, landscapes, etc. As these manipulations are performed on the learned generative image manifold of a GAN, they tend to produce realistic outputs even for challenging scenarios such as hallucinating occluded content and deforming shapes that consistently follow the object's rigidity. Both qualitative and quantitative comparisons demonstrate the advantage of DragGAN over prior approaches in the tasks of image manipulation and point tracking. We also showcase the manipulation of real images through GAN inversion.

Code

Permalink: /feed/draggan/

Tags: #ai #computer-vision

lqdev👽06/01/2023

https://huggingface.co/blog/starcoder

StarCoder and StarCoderBase are Large Language Models for Code (Code LLMs) trained on permissively licensed data from GitHub, including from 80+ programming languages, Git commits, GitHub issues, and Jupyter notebooks. Similar to LLaMA, we trained a ~15B parameter model for 1 trillion tokens. We fine-tuned StarCoderBase model for 35B Python tokens, resulting in a new model that we call StarCoder.

Permalink: /feed/hf-starcoder-llm/

Tags: #ai #code

lqdev👽06/01/2023

https://github.com/microsoft/LoRA

LoRA reduces the number of trainable parameters by learning pairs of rank-decompostion matrices while freezing the original weights. This vastly reduces the storage requirement for large language models adapted to specific tasks and enables efficient task-switching during deployment all without introducing inference latency. LoRA also outperforms several other adaptation methods including adapter, prefix-tuning, and fine-tuning.

Paper

Permalink: /feed/microsoft-lora-llm/

Tags: #ai #finetune #llm

lqdev👽06/01/2023

https://arxiv.org/abs/2305.02463

We present Shap-E, a conditional generative model for 3D assets. Unlike recent work on 3D generative models which produce a single output representation, Shap-E directly generates the parameters of implicit functions that can be rendered as both textured meshes and neural radiance fields. We train Shap-E in two stages: first, we train an encoder that deterministically maps 3D assets into the parameters of an implicit function; second, we train a conditional diffusion model on outputs of the encoder. When trained on a large dataset of paired 3D and text data, our resulting models are capable of generating complex and diverse 3D assets in a matter of seconds. When compared to Point-E, an explicit generative model over point clouds, Shap-E converges faster and reaches comparable or better sample quality despite modeling a higher-dimensional, multi-representation output space.

Code

Permalink: /feed/openai-shap-e/

Tags: #ai #openai #3d

lqdev👽06/01/2023

https://wordpress.com/blog/2023/06/01/newsletters-paid-subscriptions/

...we’re introducing a big update — the ability to add paid subscriptions and premium content, whatever plan you’re on. Including the Free plan.

Paid subscriptions let your fans support your art, writing, or project directly.

Permalink: /feed/wordpress-paid-newsletter/

Tags: #web #blogging #wordpress #newsletters

lqdev👽05/22/2023

https://ai.facebook.com/blog/multilingual-model-speech-recognition/

The Massively Multilingual Speech (MMS) project expands speech technology from about 100 languages to over 1,000 by building a single multilingual speech recognition model supporting over 1,100 languages (more than 10 times as many as before), language identification models able to identify over 4,000 languages (40 times more than before), pretrained models supporting over 1,400 languages, and text-to-speech models for over 1,100 languages. Our goal is to make it easier for people to access information and to use devices in their preferred language.

Paper
Code

Permalink: /feed/meta-mms-tts-stt/

Tags: #ai #nlp #speech-to-text #text-to-speech

lqdev👽05/14/2023

https://implement-dns.wizardzines.com/

Permalink: /feed/implement-dns-weekend/

Tags: #internet #dns #www #zine

lqdev👽05/11/2023

https://ai.google/discover/palm2

PaLM 2 is our next generation large language model that builds on Google’s legacy of breakthrough research in machine learning and responsible AI.

It excels at advanced reasoning tasks, including code and math, classification and question answering, translation and multilingual proficiency, and natural language generation better than our previous state-of-the-art LLMs, including PaLM. It can accomplish these tasks because of the way it was built – bringing together compute-optimal scaling, an improved dataset mixture, and model architecture improvements.

PaLM 2 is grounded in Google’s approach to building and deploying AI responsibly. It was evaluated rigorously for its potential harms and biases, capabilities and downstream uses in research and in-product applications. It’s being used in other state-of-the-art models, like Med-PaLM 2 and Sec-PaLM, and is powering generative AI features and tools at Google, like Bard and the PaLM API.

Technical Report

Permalink: /feed/PaLM-2/

Tags: #ai

lqdev👽05/11/2023

https://huggingface.co/docs/transformers/en/transformers_agents

Transformers Agent...provides a natural language API on top of transformers: we define a set of curated tools and design an agent to interpret natural language and to use these tools. It is extensible by design; we curated some relevant tools, but we’ll show you how the system can be extended easily to use any tool developed by the community.

Permalink: /feed/huggingface-transformer-agents/

Tags: #transformers #ai

lqdev👽05/11/2023

https://githubnext.com/projects/copilot-for-docs

Whether you’re learning a new library or API or you’ve been using it for years, it can feel like the documentation gets in your way more than it helps. Maybe the tutorials are too basic, or the reference manual is too sketchy, or the relevant information is split across multiple pages full of irrelevant details.

We’re exploring a way to get you the information you need, faster. By surfacing the most relevant content for questions with tailored summaries that help connect the dots, Copilot for docs saves developers from scouring reams of documentation.

Permalink: /feed/copilot-docs/

Tags: #ai #documentation #developers

lqdev👽05/11/2023

https://www.deeplearning.ai/short-courses/chatgpt-prompt-engineering-for-developers/

In ChatGPT Prompt Engineering for Developers, you will learn how to use a large language model (LLM) to quickly build new and powerful applications. Using the OpenAI API, you’ll be able to quickly build capabilities that learn to innovate and create value in ways that were cost-prohibitive, highly technical, or simply impossible before now.

Permalink: /feed/chatgpt-prompt-engineering-developers/

Tags: #ai #gpt #promptengineering

lqdev👽05/09/2023

https://ai.facebook.com/blog/imagebind-six-modalities-binding-ai/

...ImageBind, the first AI model capable of binding information from six modalities. The model learns a single embedding, or shared representation space, not just for text, image/video, and audio, but also for sensors that record depth (3D), thermal (infrared radiation), and inertial measurement units (IMU), which calculate motion and position.

Permalink: /feed/imagebind/

Tags: #ai #embeddings

lqdev👽05/09/2023

https://openaipublic.blob.core.windows.net/neuron-explainer/paper/index.html

This paper applies automation to the problem of scaling an interpretability technique to all the neurons in a large language model. Our hope is that building on this approach of automating interpretability will enable us to comprehensively audit the safety of models before deployment.

Our technique seeks to explain what patterns in text cause a neuron to activate. It consists of three steps:

Explain the neuron's activations using GPT-4

Simulate activations using GPT-4, conditioning on the explanation

Score the explanation by comparing the simulated and real activations

Permalink: /feed/large-language-models-explain-neural-network-neurons/

Tags: #ai #llm #deeplearning

lqdev👽04/20/2023

https://github.blog/2023-04-14-how-generative-ai-is-changing-the-way-developers-work/

Permalink: /feed/generative-ai-developer-work/

Tags: #ai #engineering #software

lqdev👽04/20/2023

https://huggingface.co/blog/peft

...as models get larger and larger, full fine-tuning becomes infeasible to train on consumer hardware. In addition, storing and deploying fine-tuned models independently for each downstream task becomes very expensive, because fine-tuned models are the same size as the original pretrained model. Parameter-Efficient Fine-tuning (PEFT) approaches are meant to address both problems!

PEFT approaches only fine-tune a small number of (extra) model parameters while freezing most parameters of the pretrained LLMs, thereby greatly decreasing the computational and storage costs. This also overcomes the issues of catastrophic forgetting, a behaviour observed during the full finetuning of LLMs. PEFT approaches have also shown to be better than fine-tuning in the low-data regimes and generalize better to out-of-domain scenarios. It can be applied to various modalities, e.g., image classification and stable diffusion dreambooth.

Permalink: /feed/peft-parameter-efficient-fine-tuning/

Tags: #ai #finetuning

lqdev👽04/20/2023

https://huggingface.co/datasets/Cohere/wikipedia-22-12-simple-embeddings

Permalink: /feed/wikipedia-embeddings/

Tags: #ai #dataset #embeddings

lqdev👽04/20/2023

https://blog.jim-nielsen.com/2023/offline-is-online-with-extreme-latency/

you can think of online/offline as part of the same continuum just different measurements of latency. There are gradations of latency when you’re “online”, and “offline” is merely at the slowest end of that spectrum.

Permalink: /feed/nielsen-offline-online-latency/

Tags: #web #offline

lqdev👽04/18/2023

https://llava-vl.github.io/

LLaVA represents a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding, achieving impressive chat capabilities mimicking spirits of the multimodal GPT-4 and setting a new state-of-the-art accuracy on Science QA.

Instruction tuning large language models (LLMs) using machine-generated instruction-following data has improved zero-shot capabilities on new tasks in the language domain, but the idea is less explored in the multimodal field.

1. Multimodal Instruct Data. We present the first attempt to use language-only GPT-4 to generate multimodal language-image instruction-following data.
2. LLaVA Model. We introduce LLaVA (Large Language-and-Vision Assistant), an end-to-end trained large multimodal model that connects a vision encoder and LLM for general-purpose visual and language understanding.
3. Performance. Our early experiments show that LLaVA demonstrates impressive multimodel chat abilities, sometimes exhibiting the behaviors of multimodal GPT-4 on unseen images/instructions, and yields a 85.1% relative score compared with GPT-4 on a synthetic multimodal instruction-following dataset. When fine-tuned on Science QA, the synergy of LLaVA and GPT-4 achieves a new state-of-the-art accuracy of 92.53%.
4. Open-source. We make GPT-4 generated visual instruction tuning data, our model and code base publicly available.

Permalink: /feed/llava-language-vision-assistant/

Tags: #ai #language #vision

lqdev👽04/15/2023

https://developer.mozilla.org/en-US/docs/Web/API/WebGPU_API

The WebGPU API enables web developers to use the underlying system's GPU (Graphics Processing Unit) to carry out high-performance computations and draw complex images that can be rendered in the browser.

Permalink: /feed/webgpu-api/

Tags: #ai #web #openstandards

lqdev👽04/14/2023

https://github.com/openai/consistency_models

Paper: https://arxiv.org/abs/2303.01469

Diffusion models have made significant breakthroughs in image, audio, and video generation, but they depend on an iterative generation process that causes slow sampling speed and caps their potential for real-time applications. To overcome this limitation, we propose consistency models, a new family of generative models that achieve high sample quality without adversarial training. They support fast one-step generation by design, while still allowing for few-step sampling to trade compute for sample quality. They also support zero-shot data editing, like image inpainting, colorization, and super-resolution, without requiring explicit training on these tasks. Consistency models can be trained either as a way to distill pre-trained diffusion models, or as standalone generative models. Through extensive experiments, we demonstrate that they outperform existing distillation techniques for diffusion models in one- and few-step generation. For example, we achieve the new state-of-the-art FID of 3.55 on CIFAR-10 and 6.20 on ImageNet 64x64 for one-step generation. When trained as standalone generative models, consistency models also outperform single-step, non-adversarial generative models on standard benchmarks like CIFAR-10, ImageNet 64x64 and LSUN 256x256.

Permalink: /feed/openai-consistency-models/

Tags: #openai #ai

lqdev👽04/12/2023

https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm

Permalink: /feed/free-dolly-llm/

Tags: #ai #llm #oss

lqdev👽04/11/2023

https://arxiv.org/pdf/2304.03442.pdf

In this paper, we introduce generative agents—computational software agents that simulate believable human behavior. Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; they form opinions, notice each other, and initiate conversations; they remember and reflect on days past as they plan the next day. To enable generative agents, we describe an architecture that extends a large language model to store a complete record of the agent’s experiences using natural language, synthesize those memories over time into higher-level reflections, and retrieve them dynamically to plan behavior. We instantiate generative agents to populate an interactive sandbox environment inspired by The Sims, where end users can interact with a small town of twenty five agents using natural language. In an evaluation, these generative agents produce believable individual and emergent social behaviors...

By fusing large language models with computational, interactive agents, this work introduces architectural and interaction patterns for enabling believable simulations of human behavior.

Permalink: /feed/llm-generative-agents/

Tags: #ai #llm

lqdev👽04/05/2023

https://bair.berkeley.edu/blog/2023/04/03/koala/

...Koala, a chatbot trained by fine-tuning Meta’s LLaMA on dialogue data gathered from the web.

Many of the most capable LLMs require huge computational resources to train, and oftentimes use large and proprietary datasets. This suggests that in the future, highly capable LLMs will be largely controlled by a small number of organizations, and both users and researchers will pay to interact with these models without direct access to modify and improve them on their own. On the other hand, recent months have also seen the release of increasingly capable freely available or (partially) open-source models, such as LLaMA. These systems typically fall short of the most capable closed models, but their capabilities have been rapidly improving. This presents the community with an important question: will the future see increasingly more consolidation around a handful of closed-source models, or the growth of open models with smaller architectures that approach the performance of their larger but closed-source cousins?

Our results suggest that learning from high-quality datasets can mitigate some of the shortcomings of smaller models, maybe even matching the capabilities of large closed-source models in the future.

Permalink: /feed/koala-dialogue-model/

Tags: #ai #llm

lqdev👽04/04/2023

http://lqdev.me/tags

My response

Permalink: /feed/test-tagged-response/

Tags: #webdev

lqdev👽04/04/2023

https://www.fast.ai/posts/part2-2023.html

From Deep Learning Foundations to Stable Diffusion...is part 2 of Practical Deep Learning for Coders.

In this course, containing over 30 hours of video content, we implement the astounding Stable Diffusion algorithm from scratch!

Permalink: /feed/deep-learning-foundations-stable-diffusion/

Tags: #untagged

lqdev👽03/29/2023

https://learn.microsoft.com/semantic-kernel/howto/schillacelaws

The "Schillace Laws" were formulated after working with a variety of Large Language Model (LLM) AI systems to date. Knowing them will accelerate your journey into this exciting space of reimagining the future of software engineering.

Permalink: /feed/schillace-laws-semantic-ai/

Tags: #untagged

lqdev👽03/28/2023

https://blog.darklang.com/gpt/

On February 1, we stopped working on what we're now calling "darklang-classic", and are fully heads down on building "darklang-gpt", which is the same core Darklang but redesigned to have AI as the primary (or possibly only) way of writing code.

Permalink: /feed/darklang-gpt/

Tags: #untagged

lqdev👽03/28/2023

https://laion.ai/blog/open-flamingo/

...OpenFlamingo, an open-source reproduction of DeepMind's Flamingo model. At its core, OpenFlamingo is a framework that enables training and evaluation of large multimodal models (LMMs).

Permalink: /feed/laion-openflamingo-vision-llm/

Tags: #untagged

lqdev👽03/28/2023

https://www.cerebras.net/blog/cerebras-gpt-a-family-of-open-compute-efficient-large-language-models/

Cerebras open sources seven GPT-3 models from 111 million to 13 billion parameters. Trained using the Chinchilla formula, these models set new benchmarks for accuracy and compute efficiency.

Today’s release is designed to be used by and reproducible by anyone. All models, weights, and checkpoints are available on Hugging Face and GitHub under the Apache 2.0 license. Additionally, we provide detailed information on our training methods and performance results in our forthcoming paper. The Cerebras CS-2 systems used for training are also available on-demand via Cerebras Model Studio.

Permalink: /feed/cerebras-gpt/

Tags: #untagged

lqdev👽03/21/2023

https://lilianweng.github.io/posts/2023-03-15-prompt-engineering/

This post only focuses on prompt engineering for autoregressive language models, so nothing with Cloze tests, image generation or multimodality models. At its core, the goal of prompt engineering is about alignment and model steerability.

Permalink: /feed/prompt-engineering-weng/

Tags: #untagged

lqdev👽03/14/2023

https://openai.com/product/gpt-4

GPT-4 can accept images as inputs and generate captions, classifications, and analyses.

GPT-4 is capable of handling over 25,000 words of text, allowing for use cases like long form content creation, extended conversations, and document search and analysis.

Permalink: /feed/openai-gpt-4/

Tags: #untagged

lqdev👽03/14/2023

https://crfm.stanford.edu/2023/03/13/alpaca.html

We are releasing our findings about an instruction-following language model, dubbed Alpaca, which is fine-tuned from Meta’s LLaMA 7B model. We train the Alpaca model on 52K instruction-following demonstrations generated in the style of self-instruct using text-davinci-003. Alpaca shows many behaviors similar to OpenAI’s text-davinci-003, but is also surprisingly small and easy/cheap to reproduce.

Permalink: /feed/alpaca-model-stanford/

Tags: #untagged

lqdev👽03/14/2023

https://danielmiessler.com/blog/spqa-ai-architecture-replace-existing-software/

AI-based applications will be completely different than those we have today. The new architecture will be a far more elegant, four-component structure based around GPTs: State, Policy, Questions, and Action.

A security program using SPQA

Choose the base model — You start with the latest and greatest overall GPT model from OpenAI, Google, Meta, McKinsey, or whoever. Lots of companies will have one. Let’s call it OpenAI’s GPT-6. It already knows so incredibly much about security, biotech, project management, scheduling, meetings, budgets, incident response, and audit preparedness that you might be able to survive with it alone. But you need more personalized context.

Train your custom model — Then you train your custom model which is based on your own data, which will stack on top of GPT-6. This is all the stuff in the STATE section above. It’s your company’s telemetry and context. Logs. Docs. Finances. Chats. Emails. Meeting transcripts. Everything. It’s a small company and there are compression algorithms as part of the Custom Model Generation (CMG) product we use, so it’s a total of 312TB of data. You train your custom model on that.

Train your policy model — Now you train another model that’s all about your company’s desires. The mission, the goals, your anti-goals, your challenges, your strategies. This is the guidance that comes from humans that we’re using to steer the ACTION part of the architecture. When we ask it to make stuff for us, and build out our plans, it’ll do so using the guardrails captured here in the POLICY.

Tell the system to take the following actions — Now the models are combined. We have GPT-6, stacked with our STATE model, also stacked with our POLICY model, and together they know us better than we know ourselves.

Permalink: /feed/spqa-ai-based-architecture-replace-software/

Tags: #untagged

lqdev👽03/14/2023

https://simonwillison.net/2023/Mar/11/llama/

The race is on to release the first fully open language model that gives people ChatGPT-like capabilities on their own devices.

Permalink: /feed/large-language-models-stable-diffusion-moment/

Tags: #untagged

lqdev👽02/23/2023

https://bitwarden.com/blog/access-your-bitwarden-vault-without-a-password/

Bitwarden expands the Log in with device option that lets you use a second device to authenticate your Bitwarden vault login instead of using your Bitwarden password.

Permalink: /feed/access-bitwarden-vault-passwordless/

Tags: #untagged

lqdev👽02/23/2023

https://dcai.csail.mit.edu/

Data-Centric AI (DCAI) is an emerging science that studies techniques to improve datasets, which is often the best way to improve performance in practical ML applications. While good data scientists have long practiced this manually via ad hoc trial/error and intuition, DCAI considers the improvement of data as a systematic engineering discipline.

This is the first-ever course on DCAI. This class covers algorithms to find and fix common issues in ML data and to construct better datasets, concentrating on data used in supervised learning tasks like classification. All material taught in this course is highly practical, focused on impactful aspects of real-world ML applications, rather than mathematical details of how particular models work. You can take this course to learn practical techniques not covered in most ML classes, which will help mitigate the “garbage in, garbage out” problem that plagues many real-world ML applications.

Permalink: /feed/introduction-data-centric-ai-mit/

Tags: #untagged

lqdev👽02/23/2023

https://karpathy.medium.com/software-2-0-a64152b37c35

The “classical stack” of Software 1.0 is what we’re all familiar with — it is written in languages such as Python, C++, etc...Software 2.0 is written in much more abstract, human unfriendly language, such as the weights of a neural network.

Benefits of Software 2.0

Computationally homogeneous
Simple to bake into silicon
Constant running time
Constant memory use
Highly portable
Agile
Modules can meld into an optimal whole
It is better than you

Limitations of Software 2.0

At the end of the optimization we’re left with large networks that work well, but it’s very hard to tell how. Across many applications areas, we’ll be left with a choice of using a 90% accurate model we understand, or 99% accurate model we don’t.

The 2.0 stack can fail in unintuitive and embarrassing ways ,or worse, they can “silently fail”

...the existence of adversarial examples and attacks highlights the unintuitive nature of this stack.

Programming Software 2.0

If you recognize Software 2.0 as a new and emerging programming paradigm instead of simply treating neural networks as a pretty good classifier in the class of machine learning techniques, the extrapolations become more obvious, and it’s clear that there is much more work to do.

In particular, we’ve built up a vast amount of tooling that assists humans in writing 1.0 code, such as powerful IDEs with features like syntax highlighting, debuggers, profilers, go to def, git integration, etc. In the 2.0 stack, the programming is done by accumulating, massaging and cleaning datasets. Who is going to develop the first Software 2.0 IDEs, which help with all of the workflows in accumulating, visualizing, cleaning, labeling, and sourcing datasets?

Github is a very successful home for Software 1.0 code. Is there space for a Software 2.0 Github? In this case repositories are datasets and commits are made up of additions and edits of the labels.

Traditional package managers and related serving infrastructure like pip, conda, docker, etc. help us more easily deploy and compose binaries. How do we effectively deploy, share, import and work with Software 2.0 binaries? What is the conda equivalent for neural networks?

Permalink: /feed/software-2-0-karpathy/

Tags: #untagged

lqdev👽02/15/2023

https://huggingface.co/blog/blip-2

This guide introduces BLIP-2 from Salesforce Research that enables a suite of state-of-the-art visual-language models that are now available in 🤗 Transformers. We'll show you how to use it for image captioning, prompted image captioning, visual question-answering, and chat-based prompting.

BLIP-2 bridges the modality gap between vision and language models by adding a lightweight Querying Transformer (Q-Former) between an off-the-shelf frozen pre-trained image encoder and a frozen large language model. Q-Former is the only trainable part of BLIP-2; both the image encoder and language model remain frozen.

Permalink: /feed/blip-2-zero-shot-image-to-text-generation/

Tags: #untagged

lqdev👽02/14/2023

https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/

Permalink: /feed/what-is-chatgpt-why-it-works-wolfram/

Tags: #untagged

lqdev👽02/07/2023

https://www.schneier.com/blog/archives/2023/02/attacking-machine-learning-systems.html

At their core, modern ML systems have complex mathematical models that use training data to become competent at a task. And while there are new risks inherent in the ML model, all of that complexity still runs in software. Training data are still stored in memory somewhere. And all of that is on a computer, on a network, and attached to the Internet. Like everything else, these systems will be hacked through vulnerabilities in those more conventional parts of the system.

Permalink: /feed/attacking-machine-learning-systems/

Tags: #untagged

lqdev👽02/07/2023

https://github.com/microsoft/LMOps

LMOps is a research initiative on fundamental research and technology for building AI products w/ foundation models, especially on the general technology for enabling AI capabilities w/ LLMs and Generative AI models.

Better Prompts: Promptist, Extensible prompts

Longer Context: Structured prompting, Length-Extrapolatable Transformers

Knowledge Augmentation (TBA)

Fundamentals

Permalink: /feed/microsoft-lmops-github/

Tags: #untagged

lqdev👽01/30/2023

https://answers.microsoft.com/skype/forum/all/release-notes-for-skype-893/6407f3c4-4037-4e6b-b93f-1953408cf9f3

A few updates I found interesting.

The Beauty of languages: You can now use your own voice during a translated call in Skype.

Universal translator: During a Skype call, if a participant speaks different languages, Skype Translator will automatically detect the languages and translate it for you.

Extra! Extra! Read all about it: Stay up-to-date with the latest news and trends.

Hit me up sometime: You can easily add contacts in Skype on mobile using a unique QR code to get connected.

I have a few thoughts on the Today feature and overall updates Skype has been receiving over the past few months but I'll leave those for another post.

Permalink: /feed/skype-8-93-release/

Tags: #untagged

lqdev👽01/29/2023

https://www.windowscentral.com/how-install-manjaro-wsl-windows-10-and-11

Permalink: /feed/windows-central-install-manjaro-wsl-guide/

Tags: #untagged

lqdev👽01/28/2023

https://webmention.rocks/test/1

Testing WebmentionFs send integration. From Website.

Permalink: /feed/webmentionfs-send-test/

Tags: #untagged

lqdev👽01/26/2023

https://www.windowscentral.com/software-apps/windows-11/microsoft-edge-phoenix-is-an-internal-reimagining-of-the-edge-web-browser-with-a-new-ui-and-more-features

As someone who spends a significant amount of time in the browser, making Edge, more specifically the browser the platform for applications, this looks appealing. Especially when you consider Progressive Web App (PWA) support and integrations with the Windows Store.

Permalink: /feed/microsoft-edge-phoenix-browser/

Tags: #untagged

lqdev👽01/25/2023

https://newsletter.danhon.com/archive/s14e09-building-and-documentation-making-new/

Automattic, the current owner of tumblr, one of the biggest meme mines in the world, has said they’re going to implement ActivityPub, which is the underlying protocol on which Mastodon operates. There are boring ways this could go, and then interesting ways this could go.

Interesting points. I'm looking forward to how this shakes out.

Permalink: /feed/hon-eternal-tumblr/

Tags: #untagged

lqdev👽01/25/2023

https://www.coursera.org/specializations/mathematics-for-machine-learning-and-data-science

Master the Toolkit of AI and Machine Learning. Mathematics for Machine Learning and Data Science is a beginner-friendly specialization where you’ll learn the fundamental mathematics toolkit of machine learning: calculus, linear algebra, statistics, and probability.

Permalink: /feed/math-machine-learning-data-science-mooc/

Tags: #untagged

lqdev👽01/25/2023

https://blog.jim-nielsen.com/2023/best-time-to-own-a-domain/

if own your domain, create value there, and drive people to it, you’re paying ~$10 a year to build unbounded value over the years — value you control.

That is why owning a domain (and publishing your content there) is like planting a tree: it’s value that starts small and grows. The best time to own a domain and publish your content there was 20 years ago. The second best time is today.

I love a technology like rel=me which pushes the idea of domain ownership into broader and broader spheres of society — “How do I get that nice little green checkmark on my profile?” It reinforces the value of owning your own domain (which you can verify elsewhere) and encourages citizens of the internet people to build value in their own little corners of the world wide web.

Permalink: /feed/nielsen-best-time-to-own-domain/

Tags: #untagged

lqdev👽01/25/2023

https://bitwarden.com/blog/bitwarden-open-source-security-explained/

This article answers three common questions about how being open source strengthens Bitwarden security, transparency, and privacy.

Permalink: /feed/bitwarden-open-source-security-explained/

Tags: #untagged

lqdev👽01/24/2023

https://github.com/fsprojects/BioProviders

Permalink: /feed/github-fsharp-bioproviders/

Tags: #untagged

lqdev👽01/24/2023

https://github.com/satellite-image-deep-learning/remote-sensing-datasets

Permalink: /feed/ml-remote-sensing-datasets/

Tags: #untagged

lqdev👽01/19/2023

http://simonwillison.net/2023/Jan/13/semantic-search-answers/#atom-everything

Here's how to do this:

Run a text search (or a semantic search, described later) against your documentation to find content that looks like it could be relevant to the user's question.

Grab extracts of that content and glue them all together into a blob of text.

Construct a prompt consisting of that text followed by "Given the above content, answer the following question: " and the user's question

Send the whole thing through the GPT-3 API and see what comes back

Permalink: /feed/implement-q-a-documentation-gpt-willison/

Tags: #untagged

lqdev👽01/18/2023

https://iximiuz.com/en/posts/ssh-tunnels/

Permalink: /feed/visual-guide-ssh-tunnels/

Tags: #untagged

lqdev👽01/18/2023

https://www.youtube.com/watch?v=kCc8FmEb1nY

Permalink: /feed/karpathy-lets-build-gpt-from-scratch-video/

Tags: #untagged

lqdev👽01/11/2023

https://chriscoyier.net/2023/01/05/a-big-pile-of-personal-developer-designer-blogs-in-an-opml-file/

OPML File: https://chriscoyier.s3.amazonaws.com/personal-developer-blogs.xml

Permalink: /feed/coyier-big-list-dev-designer-blogs-opml-file/

Tags: #untagged

lqdev👽01/11/2023

https://blog.jim-nielsen.com/2023/subscribe-wherever-you-get-your-content/

Wouldn’t it be amazing if a similar phrase could enter the larger public consciousness for blogs or even people who you follow online?

Instead of:
“Follow me on Twitter.”  
“I’m @realChuckieCheese on Instagram.”  
“Subscribe to my videos on YouTube.”  
You’d hear some popular influencer saying:
“Follow me wherever you follow people online.”  
“Find me and subscribe wherever you get your content.”

In a world like this, perhaps apex domains could become the currency of online handles: follow me @jim-nielsen.com.

Permalink: /feed/subscribe-wherever-you-get-your-content-nielsen/

Tags: #untagged

lqdev👽01/11/2023

https://localghost.dev/blog/building-a-website-like-it-s-1999-in-2022/

Now the only challenges for me are:

Should the song on my website match my ringtone?
Where do I place the flames GIF?

Flames GIF

Permalink: /feed/building-website-like-1999-in-2022-localghost/

Tags: #untagged

lqdev👽01/04/2023

https://bringback.blog/

If you're looking for new folks follow, here's a list of blogs on a variety of topics.

Permalink: /feed/bring-back-blogs-directory/

Tags: #untagged

lqdev👽01/04/2023

https://www.windowscentral.com/hardware/computers-desktops/intel-core-i3-n-series-launch-ces-2023

Overview of Intel N-Series Processor Features — Source: *Intel*

The article suggests these processors are for the education market. Hopefully a few of the devices with these processors in a Surface Go form-factor are made generally available as well. I think the N200 series is the sweet spot in terms of performance and battery life with only 6W of max turbo power. The i3 which draws a max of 15W may be too much. At that point, some of the newly announced U-series processors might make more sense.

Permalink: /feed/ces-intel-entry-level-n-series-processor/

Tags: #untagged

lqdev👽01/03/2023

https://hacdias.com/2022/12/26/personal-websites-and-online-identities

Great post.

For me, I've found POSSE to Twitter and Mastodon using RSS and IFTTT-like services has been a relatively low-effort thing to do. The downside though is any follow-up conversations take place on those platforms. That hasn't been an issue for me yet though so until it is, I'll continue to use my relatively low-effort solution.

I agree with you on likes / reposts. My current implementation of those posts is focused on Webmentions, which means it's virtually meaningless since few people know about let alone integrate Webmentions into their website. I would argue replies tend to fall into that same space as well. I have however found great use in bookmark posts to which I try and add some content from the source document to help me quickly identfy why I thought that post / website was interesting or relevant to a specific topic at that time.

Permalink: /feed/personal-websites-and-online-identifies-hd/

Tags: #untagged

lqdev👽01/03/2023

https://github.com/karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs. It's a re-write of minGPT, which I think became too complicated, and which I am hesitant to now touch. Still under active development, currently working to reproduce GPT-2 on OpenWebText dataset. The code itself aims by design to be plain and readable: train.py is a ~300-line boilerplate training loop and model.py a ~300-line GPT model definition, which can optionally load the GPT-2 weights from OpenAI. That's it.

Permalink: /feed/nanoGPT/

Tags: #untagged

lqdev👽01/03/2023

https://petals.ml/

Run 100B+ language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Permalink: /feed/petals-ml-model-inference/

Tags: #untagged

lqdev👽01/03/2023

https://www.theverge.com/23513418/bring-back-personal-blogging

In the beginning, there were blogs, and they were the original social web. We built community. We found our people. We wrote personally. We wrote frequently. We self-policed, and we linked to each other so that newbies could discover new and good blogs.

The biggest reason personal blogs need to make a comeback is a simple one: we should all be in control of our own platforms.

People built entire communities around their favorite blogs, and it was a good thing. You could find your people, build your tribe, and discuss the things your collective found important.

Buy that domain name. Carve your space out on the web. Tell your stories, build your community, and talk to your people. It doesn’t have to be big. It doesn’t have to be fancy. You don’t have to reinvent the wheel. It doesn’t need to duplicate any space that already exists on the web — in fact, it shouldn’t. This is your creation. It’s your expression. It should reflect you.

Permalink: /feed/bring-back-personal-blogging/

Tags: #untagged

lqdev👽01/02/2023

https://openai.com/blog/new-and-improved-embedding-model/

The new model, text-embedding-ada-002, replaces five separate models for text search, text similarity, and code search, and outperforms our previous most capable model, Davinci, at most tasks, while being priced 99.8% lower.

Permalink: /feed/openai-text-embedding-ada-002/

Tags: #untagged

lqdev👽01/02/2023

https://www.swyx.io/fave-podcasts-2022/

Permalink: /feed/swyx-favorite-podcasts-2022/

Tags: #untagged

lqdev👽12/29/2022

https://www.bobble.tech/free-stuff/used-thinkpad-buyers-guide

Permalink: /feed/used-thinkpad-buyers-guide/

Tags: #untagged

lqdev👽12/28/2022

https://danielmiessler.com/blog/its-time-to-get-back-into-rss/

We all mourned when Reader died and took RSS with it, but it's time to return to what made it great

Permalink: /feed/miessler-time-get-back-rss/

Tags: #untagged

lqdev👽12/28/2022

https://umbrel.com/

Really cool project. Don't really care for the crypto stuff but the rest looks very interesting.

Permalink: /feed/umbrel-personal-server-os-selfhosting/

Tags: #untagged

lqdev👽12/28/2022

https://e2eml.school/transformers.html

Permalink: /feed/e2eml-transformers-from-scratch/

Tags: #untagged

lqdev👽12/27/2022

https://matrix.org/blog/2022/12/25/the-matrix-holiday-update-2022

These updates are exciting!

Reddit appears to be building out new Chat functionality using Matrix

Discourse is working on adding Matrix support

Thunderbird launched Matrix support.

Automattic is busy building Matrix plugins for Wordpress

Not as exciting

...only a handful of these initiatives have resulted in funding reaching the core Matrix team. This is directly putting core Matrix development at risk.

In short: folks love the amazing decentralised encrypted comms utopia of Matrix. But organisations also love that they can use it without having to pay anyone to develop or maintain it. This is completely unsustainable, and Element is now literally unable to fund the entirety of the Matrix Foundation on behalf of everyone else - and has had to lay off some of the folks working on the core team as a result.

In the interim, if you are an organisation who’s building on Matrix and you want the project to continue to flourish, please mail funding@matrix.org to discuss how you can support the foundations that you are depending on.

I'm looking forward to the future of the Matrix protocol, especially the P2P components.

Permalink: /feed/matrix-holiday-edition-2022/

Tags: #untagged

lqdev👽12/27/2022

https://www.appropedia.org/Welcome_to_Appropedia

Permalink: /feed/appropedia-sustaibility-wiki/

Tags: #untagged

lqdev👽12/25/2022

https://jonudell.info/bytecols/2002-05-15.html

Permalink: /feed/personal-rss-aggregators-udell/

Tags: #untagged

lqdev👽12/22/2022

https://google.github.io/comprehensive-rust/

This is a four day Rust course developed by the Android team. The course covers the full spectrum of Rust, from basic syntax to advanced topics like generics and error handling. It also includes Android-specific content on the last day.

Permalink: /feed/comprehensive-rust-course/

Tags: #untagged

lqdev👽12/21/2022

https://9to5linux.com/pine64-announces-the-pinetab2-linux-tablet-with-up-to-8gb-ram-and-rk3566-soc

Futurama Fry Take My Money GIF

Permalink: /feed/pine64-announces-pinetab-2-linux/

Tags: #untagged

lqdev👽12/16/2022

https://github.com/norvig/pytudes/blob/main/ipynb/AlphaCode.ipynb

Large language models have recently shown an ability to solve a variety of problems. In this notebook we consider programming problems (as solved by AlphaCode) and mathematics problems (as solved by Minerva). The questions we would like to get at are:

In the future, what role will these generative models play in assisting a programmer or mathematician?

What will be a workflow that incorporates these models?

How will other existing tools (such as programming languages) change to accomodate this workflow?

Permalink: /feed/norvig-pytudes-alphacode/

Tags: #untagged

lqdev👽12/15/2022

https://boffosocko.com/2022/12/15/55812758/

Worked well for me. Thanks for sharing!

Permalink: /feed/mastodon-hashtag-rss-boffosocko/

Tags: #untagged

lqdev👽12/13/2022

https://dugas.ch/artificial_curiosity/GPT_architecture.html

Drawing of GPT-3 Architecture — Source: *dugas.ch*

Permalink: /feed/gpt-3-architecture-napkin/

Tags: #untagged

lqdev👽12/12/2022

https://podbay.fm/p/the-twiml-ai-podcast-formerly-this-week-in-machine-learning-and-artificial-intelligence/e/1670516880

Today we're joined by ChatGPT, the latest and coolest large language model developed by OpenAl. In our conversation with ChatGPT, we discuss the background and capabilities of large language models, the potential applications of these models, and some of the technical challenges and open questions in the field. We also explore the role of supervised learning in creating ChatGPT, and the use of PPO in training the model. Finally, we discuss the risks of misuse of large language models, and the best resources for learning more about these models and their applications.

Permalink: /feed/twiml-exploring-llm-chatgpt/

Tags: #untagged

lqdev👽12/07/2022

https://github.com/charmbracelet/glow

Render markdown on the CLI

Glow CLI Tool GIF — Source: *github.com/charmbracelet/glow*

Permalink: /feed/glow-markdown-cli/

Tags: #untagged

lqdev👽12/07/2022

https://github.blog/2022-12-07-github-copilot-is-generally-available-for-businesses/

GitHub Copilot for Business gives organizations:

The power of AI. Millions of developers have already used GitHub Copilot to build software faster, stay in the flow longer, and solve problems in new ways—all right from their editor of choice.

Simple license management. Administrators can enable GitHub Copilot for their teams and select which organizations, teams, and developers receive licenses.

Organization-wide policy management. You can easily set policy controls to enforce user settings for public code matching on behalf of your organization.

Your code is safe with us. With Copilot for Business, we won’t retain code snippets, store or share your code regardless if the data is from public repositories, private repositories, non-GitHub repositories, or local files.

Permalink: /feed/gh-copilot-business-ga/

Tags: #untagged

lqdev👽12/07/2022

https://brandewinder.com/2022/12/04/simulating-wrapinator-5000/

Cool use of DiffSharp.

Permalink: /feed/fsadvent-wrapinator-5000/

Tags: #untagged

lqdev👽12/07/2022

https://www.theverge.com/2022/12/7/23497938/microsoft-teams-communities-feature

Windows Phone Rooms Feature Screenshot — Source: *blogs.windows.com*

This reminds me so much of the Rooms feature in Windows Phone. 😢

Permalink: /feed/teams-communities-feature/

Tags: #untagged

lqdev👽12/06/2022

https://waydro.id/

A container-based approach to boot a full Android system on a regular GNU/Linux system like Ubuntu.

Permalink: /feed/waydroid-linux/

Tags: #untagged

lqdev👽12/06/2022

https://dev.to/cognipla/geospatial-is-a-function-of-your-life-1924

Florence, a low-code geospatial library for everyone that ranks city places with (life-inspired) functions.

Florence heavily laverages .NET Interactive/Polyglot Notebooks runtime to hide and execute code behind the scene (including breaking some language constraints).

Permalink: /feed/florence-fsharp-geospatial/

Tags: #untagged

lqdev👽12/06/2022

https://toobnix.org/c/emacsconf/videos

Permalink: /feed/emacsconf-2022-videos/

Tags: #untagged

lqdev👽12/06/2022

https://www.infoq.com/news/2022/12/microsoft-farmvibes/

Microsoft Research recently open-sourced FarmVibes.AI, a suite of ML models and tools for sustainable agriculture. FarmVibes.AI includes data processing workflows for fusing multiple sets of spatiotemporal and geospatial data, such as weather data and satellite and drone imagery.

Permalink: /feed/msft-open-sources-farmvibes-ai/

Tags: #untagged

lqdev👽12/03/2022

https://www.nlpdemystified.org/

Permalink: /feed/nlp-demystified-course/

Tags: #untagged

lqdev👽12/02/2022

https://medium.com/emacs/using-elfeed-to-view-videos-6dfc798e51e6

Slightly modified the original script to use Streamlink and lower quality to 240p for bandwith and resource purposes.

(require 'elfeed)

(defun elfeed-v-mpv (url)
  "Watch a video from URL in MPV" 
  (async-shell-command (format "streamlink -p mpv %s 240p" url)))

(defun elfeed-view-mpv (&optional use-generic-p)
  "Youtube-feed link"
  (interactive "P")
  (let ((entries (elfeed-search-selected)))
    (cl-loop for entry in entries
     do (elfeed-untag entry 'unread)
     when (elfeed-entry-link entry) 
     do (elfeed-v-mpv it)) 
   (mapc #'elfeed-search-update-entry entries) 
   (unless (use-region-p) (forward-line)))) 

(define-key elfeed-search-mode-map (kbd "v") 'elfeed-view-mpv)

Permalink: /feed/elfeed-view-videos/

Tags: #untagged

lqdev👽12/01/2022

https://openai.com/blog/chatgpt/

We’ve trained a model called ChatGPT which interacts in a conversational way. The dialogue format makes it possible for ChatGPT to answer followup questions, admit its mistakes, challenge incorrect premises, and reject inappropriate requests. ChatGPT is a sibling model to InstructGPT, which is trained to follow an instruction in a prompt and provide a detailed response.

Permalink: /feed/openai-chatgpt/

Tags: #untagged

lqdev👽11/30/2022

https://opensource.com/article/22/11/open-source-payphone-philtel

PhilTel is a telephone collective based in Philadelphia, Pennsylvania, focusing on making communications accessible to everyone by installing free-to-use payphones. While you'll be able to make standard telephone calls through our phones, we're also focusing on offering interesting services or experiences. We don't want to only facilitate human-to-human interaction but also human-to-machine interaction and give people an environment where they can explore the telephone network and learn from it.

Permalink: /feed/philtel-open-source-payphone/

Tags: #untagged

lqdev👽11/29/2022

https://simonwillison.net/2022/Nov/26/productivity/

Consider timezones: engineers in Madrid and engineers in San Francisco had almost no overlap in their working hours. Good asynchronous communication was essential.

Over time, I noticed that the teams that were most effective at this scale were the teams that had a strong culture of documentation and automated testing.

As I started to work on my own array of smaller personal projects, I found that the same discipline that worked for large teams somehow sped me up, when intuitively I would have expected it to slow me down.

Permalink: /feed/documentation-coping-serial-project-hoarder/

Tags: #untagged

lqdev👽11/24/2022

https://github.com/jlevy/the-art-of-command-line

Permalink: /feed/art-of-command-line/

Tags: #untagged

lqdev👽11/24/2022

https://stability.ai/blog/stable-diffusion-v2-release

The Stable Diffusion 2.0 release includes robust text-to-image models trained using a brand new text encoder (OpenCLIP), developed by LAION with support from Stability AI, which greatly improves the quality of the generated images compared to earlier V1 releases.

Stable Diffusion 2.0 also includes an Upscaler Diffusion model that enhances the resolution of images by a factor of 4.

Permalink: /feed/stable-diffusion-v2/

Tags: #untagged

lqdev👽11/23/2022

https://www.tremendous.com/blog/the-perks-of-a-high-documentation-low-meeting-work-culture

the intangible, overarching benefit of practicing meeting mindfulness is this: you spend less of your day sort-of-listening and more of your day really thinking.

Permalink: /feed/perks-high-documentation-culture/

Tags: #untagged

lqdev👽11/23/2022

https://cybernews.com/news/whatsapp-data-leak/

Just a friendly reminder, there are alternatives.

Permalink: /feed/whatsapp-data-leak-alternatives/

Tags: #untagged

lqdev👽11/23/2022

https://techcrunch.com/2022/11/21/tumblr-to-add-support-for-activitypub-the-social-protocol-powering-mastodon-and-other-apps/

Automattic CEO Matt Mullenweg — whose company acquired Tumblr from Verizon in 2019 — suggested...Tumblr... would soon add ...activitypub.

I'm perfectly happy using my site as the main place to post content but considering it's under new management and adopting open standards, it might be time to checkout out Tumblr again.

Permalink: /feed/tumblr-activitypub/

Tags: #untagged

lqdev👽11/21/2022

https://blog.archive.org/2022/11/16/digital-library-of-amateur-radio-communications-surpasses-25000-items/

In the six weeks since announcing that Internet Archive has begun gathering content for the Digital Library of Amateur Radio and Communications (DLARC), the project has quickly grown to more than 25,000 items, including ham radio newsletters, podcasts, videos, books, and catalogs.

Permalink: /feed/digital-library-amateur-radio-25k/

Tags: #untagged

lqdev👽11/21/2022

https://derw.substack.com/p/telepathic-technical-writing

...blog posts are always async, but they can lead to conversations and debates, once the reader is done reading. There's also the nature of blog posts being one-to-many, whereas chat is many-to-many, if done in a public channel. One-to-many forms of communication should generally be more formal, but can spread ideas and thoughts more coherently than many-to-many.

Permalink: /feed/telepathic-tech-writing/

Tags: #untagged

lqdev👽11/20/2022

https://blog.archive.org/2022/11/15/digital-books-wear-out-faster-than-physical-books/

Our paper books have lasted hundreds of years on our shelves and are still readable. Without active maintenance, we will be lucky if our digital books last a decade.

Permalink: /feed/digital-physical-books-wear/

Tags: #untagged

lqdev👽11/16/2022

https://www.feta.bz/

Feta is a Matrix server distribution for the Raspberry Pi 3 and 4.

This looks like a great project to get started with self-hosting and join the Matrix network. Having something similar for the fediverse would be great as well. I've hosted Matrix and Mastodon servers on a Raspberry Pi before so I know it's up to the task.

Permalink: /feed/feta-matrix-raspi/

Tags: #untagged

lqdev👽11/16/2022

https://rsapkf.org/weblog/q2z/

Permalink: /feed/rss-privacy-efficiency/

Tags: #untagged

lqdev👽11/05/2022

https://www.zylstra.org/blog/2022/11/two-decades-of-blogging-the-free-as-a-bird-edition/

Permalink: /feed/two-decades-blogging-free-bird-edition/

Tags: #untagged

lqdev👽11/05/2022

https://simonwillison.net/2022/Nov/5/mastodon/

Permalink: /feed/willison-mastodon/

Tags: #untagged

lqdev👽10/30/2022

https://doctorow.medium.com/how-to-leave-dying-social-media-platforms-9fc550fe5abf

Permalink: /feed/how-to-leave-dying-social-platforms/

Tags: #untagged

lqdev👽10/30/2022

https://www.eff.org/deeplinks/2019/11/altinteroperabilityadversarial

Permalink: /feed/alt-interoperability-adversarial/

Tags: #untagged

lqdev👽10/28/2022

https://newsletter.danhon.com/archive/4230/

Here’s how it works in practice:

A news organization (or any organization, let’s just start with news) already asserts ownership of its domain e.g. via its certificate, so we piggyback trust off its domain.

It stands up a Mastodon or other social server at a standard address. I’d propose follow.washingtonpost.com but there’s a bunch of reasons why you might do something else, see below, and uses the agreed well-known autodiscovery protocol to return the address for its Mastodon server (but I don’t see an entry for activitypub or Mastodon yet).

It creates accounts for its staff on its Mastodon server. Nobody else gets an account; the public can’t sign up.

What you get:

Verified accounts. Instead of relying on a third party to “verify” your account by sticking a blue check against your display name, account provenance and “verification” is inherited as a property of the Mastodon server....

Ease of discovery...all a user would have to do, to find Washington Post accounts to follow, would be to know the washingtonpost.com domain. Autodiscovery would let your Mastodon client point itself to the appropriate server.

Not just news organizations...anyone can set up a Mastodon server...the federation means that “official” accounts become “more official” when their server home is hung off the domain of the actual organization.

You wouldn’t need Twitter (or anyone else, really) to verify that the UK Prime Minister’s account is official, because you’d have following.gov.uk as the Mastodon server, which means you can trust that server as much as you trust .gov.uk domains.

Your university or college wants you to have a social media account? Sure, you can have it hosted at following.ucla.edu.

And yes, brands can get in on it. Sure. That way there’s a tiny chance you’re following the Proper Brand Account rather than a Parody Brand Account, which… is probably for the best. Or it’s easier to see that a Parody Account is a Parody Account because you can look at the parent server.

Permalink: /feed/proposal-news-mastodon-servers/

Tags: #untagged

lqdev👽10/28/2022

https://www.karlsutt.com/articles/communicating-effectively-as-a-developer/

Communicating effectively as an engineer means empathically increasing the resolution of your writing.

...“low-resolution writing”...There is very little context, too much reliance on pronouns and unclear references. The writing is not emphatic —the reader has to spend extra energy to work out what is being said

Longer-form writing gives you an opportunity to dive deeper into why you are saying what you are saying. It is a chance to educate, to teach, to help understand and to level up.

The quality of the API documentation will carry an astronomical amount of leverage. This leverage will work in both directions. Genuinely helpful documentation is the difference between being swamped by support requests from frustrated API users and significantly increasing the usage of your service. Happy users beget more happy users.

Spoken words get forgotten. Written words are shared, preserved, and become the basis of a company's culture. source

High resolution, empathic writing...You will have to spend more energy to make your writing easy to follow. You will have to grapple with your own confusion and holes in your understanding. You will have to figure out what the appropriate density for your writing is.

It's not about you, though.. It's about them.

Not only does a single recipient benefit from your extra effort, what if ten people read your good work? A hundred? A thousand? What if the company CEO reads it? Taking writing seriously at work or in your organisation and putting in the effort to delight the reader will, over time, compound into a massive body of quality writing that benefits everyone. It is a literal win-win-win

Produce writing you would read with delight if you were on the other end.

Permalink: /feed/communicating-effectively-developer/

Tags: #untagged

lqdev👽10/27/2022

https://www.jeffgeerling.com/blog/2022/why-i-use-jellyfin-my-home-media-library

It's also built with .NET 🙂

Permalink: /feed/jellyfin-home-media-library/

Tags: #untagged

lqdev👽10/25/2022

https://blog.jim-nielsen.com/2022/what-work-looks-like/

Permalink: /feed/what-work-looks-like/

Tags: #untagged

lqdev👽10/25/2022

https://matthiasott.com/notes/suspension

write your most important thoughts on your own site. You can share the link on as many platforms as you like and have conversations with anyone who wants to connect with you and your work. But nobody can take it from you. You are in control. Forever.

Permalink: /feed/ott-suspension/

Tags: #untagged

lqdev👽10/24/2022

https://www.windowscentral.com/hardware/laptops/android-12l-for-surface-duo-is-now-available-with-new-ui-and-more

Downloading now.

Permalink: /feed/surface-duo-android-12L/

Tags: #untagged

lqdev👽10/20/2022

https://pytorch.org/blog/PyTorchfoundation/

PyTorch is moving to the Linux Foundation (LF) as a top-level project under the name PyTorch Foundation.

The PyTorch Technical Governance now supports a hierarchical maintainer structure and clear outlining of processes around day to day work and escalations. This doesn’t change how we run things, but it does add discipline and openness that at our scale feels essential and timely.

Permalink: /feed/pytorch-lf-top-level-project/

Tags: #untagged

lqdev👽10/20/2022

https://blog.pocketcasts.com/2022/10/19/pocket-casts-mobile-apps-are-now-open-source/

We’ve been eager to take this step since we joined Automattic last year...

Wow. Yet another company I didn't know was owned by Automaticc (parent company of WordPress). So not only do they have a stake in regular blogs and websites, but they also are in the microblogging space with Tumblr. With Pocket Casts they're now into podcasts too. Good for them.

We believe that podcasting can not and should not be controlled by Apple and Spotify, and instead support a diverse ecosystem of third-party clients.

I couldn't agree more. Though to be fair to Apple, in all the years since podcasts have been a thing, despite them being one of the main indices, they didn't make any overt attempts to lock down the ecosystem. It's not until companies like Amazon and Spotify tried to make certain content platform exclusives that the ecosystem has started to feel more closed.

Permalink: /feed/pocket-casts-mobile-open-source/

Tags: #untagged

lqdev👽10/19/2022

https://www.fast.ai/posts/part2-2022-preview.html

In total, we’re releasing four videos, with around 5.5 hours of content, covering the following topics (the lesson numbers start at “9”, since this is a continuation of Practical Deep Learning for Coders part 1, which had 8 lessons):

Lesson 9 by Jeremy Howard: How to use Diffusers pipelines; What are the conceptual parts of Stable Diffusion

Lesson 9A by Jonathan Whitaker: A deep dive into Stable Diffusion concepts and code

Lesson 9B by Wasim Lorgat and Tanishq Abraham: The math of diffusion

Lesson 10 by Jeremy Howard: Creating a custom diffusion pipeline; Starting “from the foundations”

Permalink: /feed/fastai-from-dl-stable-diffusion/

Tags: #untagged

lqdev👽10/19/2022

https://twitter.com/andrestaltz/status/1582448952057958401

My experience working on SSB (i.e. non-crypto fully decentralized protocol) is suitable for small world communication, and definitely not suitable for big world. Email proves that federation can do big world.

Permalink: /feed/staltz-thoughts-bluesky-at-protocol/

Tags: #untagged

lqdev👽10/19/2022

https://maggieappleton.com/garden-history

The conversational feed design of email inboxes, group chats, and InstaTwitBook is fleeting – they're only concerned with self-assertive immediate thoughts that rush by us in a few moments...But streams only surface the Zeitgeisty ideas of the last 24 hours...Gardens present information in a richly linked landscape that grows slowly over time...The garden helps us move away from time-bound streams and into contextual knowledge spaces.

The Six Patterns of Gardening:

Topography over Timelines - Gardens are organised around contextual relationships and associative links; the concepts and themes within each note determine how it's connected to others.

Continuous Growth - Gardens are never finished, they're constantly growing, evolving, and changing.

Imperfection & Learning in Public - Gardens are imperfect by design. They don't hide their rough edges or claim to be a permanent source of truth.

Playful, Personal, and Experimental - Gardens are non-homogenous by nature. You can plant the same seeds as your neighbour, but you'll always end up with a different arrangement of plants.

Intercropping & Content Diversity - Gardens are not just a collection of interlinked words...Podcasts, videos, diagrams, illustrations, interactive web animations, academic papers, tweets, rough sketches, and code snippets should all live and grow in the garden.

Independent Ownership - Gardening is about claiming a small patch of the web for yourself, one you fully own and control... If you give it a bit of forethought, you can build your garden in a way that makes it easy to transfer and adapt. Platforms and technologies will inevitably change. Using old-school, reliable, and widely used web native formats like HTML/CSS is a safe bet.

Permalink: /feed/appleton-garden-history/

Tags: #untagged

lqdev👽10/18/2022

https://blueskyweb.org/blog/10-18-2022-the-at-protocol

Bluesky was created to build a social protocol. In the spring, we released “ADX,” the very first iteration of the protocol...ADX is now the “Authenticated Transport Protocol"...more simply, the “AT Protocol.”

The “AT Protocol” is a new federated social network

What makes AT Protocol unique:

Account portability

Algorithmic choice

Interoperation

Performance

AT Procol website

Permalink: /feed/bluesky-at-protocol/

Tags: #untagged

lqdev👽10/18/2022

https://herbertlui.net/overdue-insights-on-daily-blogging/

Here are a bunch of not-so-obvious lessons I’ve internalized through writing each day:

Writing can be a starting point, not an ending one

Write to think

The power of DIFY: Do it for yourself, don’t think too much about what you want other people to get out of it.

Think small

Gain energy

A lot of books are collections of blog posts

Small is an unblocker

Permalink: /feed/overdue-insights-daily-blogging/

Tags: #untagged

lqdev👽10/18/2022

https://visitmy.website/2022/10/04/patterns-for-collaborative-blogging/

Permalink: /feed/patterns-collaborative-blogging/

Tags: #untagged

lqdev👽10/18/2022

https://gist.github.com/luisquintanilla/164176ec414e465246d6323aa62b38df

This sample shows how to use a pretrained Bidirectional Attention Flow (BiDAF) ONNX model in ML.NET for answering a query about a given context paragraph.

Permalink: /feed/bidaf-mlnet-onnx/

Tags: #untagged

lqdev👽10/18/2022

https://jalammar.github.io/illustrated-stable-diffusion/

Permalink: /feed/illustrated-stable-diffusion/

Tags: #untagged

lqdev👽10/18/2022

https://almanac.httparchive.org/en/2022/

The Web Almanac is a comprehensive report on the state of the web, backed by real data and trusted web experts. The 2022 edition is comprised of 23 chapters spanning aspects of page content, user experience, publishing, and distribution.

Permalink: /feed/http-archive-state-of-web-2022-almanac/

Tags: #untagged

lqdev👽10/17/2022

http://jalammar.github.io/illustrated-gpt2/

Permalink: /feed/illustrated-gpt-2/

Tags: #untagged

lqdev👽10/16/2022

https://blog.archive.org/2022/10/10/new-ebook-protection-software-gaining-popularity-among-publishers-and-libraries/

Readium LCP was developed five years ago to protect digital files from unauthorized distribution. Unlike proprietary platforms, the technology is open to anyone who wants to look inside the codebase and make improvements. It is a promising alternative for libraries and users wanting to avoid the limitations of traditional DRM.

Permalink: /feed/readium-lcp-drm-technology/

Tags: #untagged

lqdev👽10/14/2022

https://platform.leolabs.space/visualization

Permalink: /feed/leo-visualization/

Tags: #untagged

lqdev👽10/14/2022

https://berk.es/2022/10/12/blog-comments-on-a-static-site-via-social-networks/

discu.eu, a fantastic service that you provide with an URL and then gives back results from various social networks. Places where that URL is discussed. Hackernews, Reddit, and/or Lobsters. It has an API, that I can call with some JavaScript and then insert in the page.

Permalink: /feed/discueu-comments-static-page/

Tags: #untagged

lqdev👽10/14/2022

https://www.process-one.net/blog/matrix-protocol-added-to-ejabberd/

Permalink: /feed/ejabberd-matrix-protocol/

Tags: #untagged

lqdev👽10/14/2022

https://www.hello.coop/pages/approach.html

Permalink: /feed/hello-decentralized-identity-coop/

Tags: #untagged

lqdev👽10/14/2022

https://zenodo.org/record/7070952#.Y0m5eX7MK00

This dataset contains detailed data on 42,207 apartments (242,257 rooms) in 3,093 buildings including their geometries, room typology as well as their visual, acoustical, topological and daylight characteristics.

Permalink: /feed/swiss-dwelling-apartment-dataset/

Tags: #untagged

lqdev👽10/14/2022

https://arxiv.org/abs/2210.03945

Large language models (LLMs) have shown exceptional performance on a variety of natural language tasks. Yet, their capabilities for HTML understanding...have not been fully explored. We contribute HTML understanding models (fine-tuned LLMs) and an in-depth analysis of their capabilities under three tasks: (i) Semantic Classification of HTML elements, (ii) Description Generation for HTML inputs, and (iii) Autonomous Web Navigation of HTML pages.

Out of the LLMs we evaluate, we show evidence that T5-based models are ideal due to their bidirectional encoder-decoder architecture.

Permalink: /feed/understanding-html-large-language-models/

Tags: #untagged

lqdev👽10/14/2022

https://www.kaggle.com/kaggle-survey-2022

Development

Python and SQL remain the two most common programming skills for data scientists

VSCode is now used by over 50% of working data scientists

Notebooks are a popular environment as well.

Colab notebooks are the most popular cloud-based Jupyter notebook environment

Makes sense especially since Kaggle is owned by Google.

Machine Learning

Kaggle DS & ML Survey 2022 Scikit-learn is the most popular ML framework while PyTorch has been growing steadily year-over-year

LightGBM, XGBoost are also among the top frameworks.

Transformer architectures are becoming more popular for deep learning models (both image and text data)

Cloud computing

All major cloud computing providers saw strong year over year growth in 2022

Specialized hardware like Tensor Processing Units (TPUs) is gaining initial traction with Kaggle data scientists

Resources

PDF
Dataset

Permalink: /feed/kaggle-state-data-science-ml-2022/

Tags: #untagged

lqdev👽10/13/2022

https://www.jeffgeerling.com/blog/2022/streaming-services-lost-plot

I got serious about consolidating the media my family consumes; I decided to buy blu-ray and DVD copies of all the movies and TV shows we actually cared about, and rip them onto a NAS.

I'm glad I've gone down this path, and become more intentional about my media consumption, especially as companies are deleting already-purchased content from users' media libraries! It's sickening (and, IMHO, should be illegal) they can show a 'buy' button for a DRM'ed digital downloads that the user never actually 'owns'.

Couldn't agree more. Although the focus of this post is on video, you can say the same for music and books.

Permalink: /feed/streaming-companies-lost-plot/

Tags: #untagged

lqdev👽10/13/2022

https://twitter.com/element_hq/status/1580555805803261952

Permalink: /feed/matrix-element-android-redesign-2022/

Tags: #untagged

lqdev👽10/12/2022

http://scripting.com/2022/10/01/133834.html?title=docQuixote

I spent great time, energy and money, over many years to create the writing and programming environment I wanted to use and I wanted my peers to use, so we could work together to create species-saving communication tools, and just beauty...

...I read the story of David Bowie's last days, he did something amazing when he knew he had a short time to live. He stepped back and got out of the way. He understood this is no longer his world.

When you're young, you think expansively, and as you get old reality sinks in and your imagination contracts. The horizon gets closer and closer. We don't get to mold the world, we are not gods, no matter how good or generous, smart of ruthless you may be, we all start out young and if we're lucky we get old and then we're gone.

Permalink: /feed/doc-quixote/

Tags: #untagged

lqdev👽10/12/2022

https://www.stateof.ai/2022-report-launch.html

Key takeaways:

New independent research labs are rapidly open sourcing the closed source output of major labs.
Safety is gaining awareness among major AI research entities
The China-US AI research gap has continued to widen
AI-driven scientific research continues to lead to breakthroughs

Permalink: /feed/2022-state-of-ai-report/

Tags: #untagged

lqdev👽10/12/2022

https://stevens.netmeister.org/631/

In this course, students will learn to develop complex system-level software in the C programming language while gaining an intimate understanding of the Unix operating system (and all OS that belong to this family, such as Linux, the BSDs, and even Mac OS X) and its programming environment.

Topics covered will include the user/kernel interface, fundamental concepts of Unix, user authentication, basic and advanced I/O, fileystems, signals, process relationships, and interprocess communication. Fundamental concepts of software development and maintenance on Unix systems (development and debugging tools such as "make" and "gdb") will also be covered.

Permalink: /feed/cs631-advanced-programming-unix-env/

Tags: #untagged

lqdev👽10/12/2022

https://werat.dev/blog/how-wine-works-101/

Permalink: /feed/how-wine-works/

Tags: #untagged

lqdev👽10/11/2022

https://www.sane.fyi/

Permalink: /feed/sane-app/

Tags: #untagged

lqdev👽10/11/2022

https://jamesg.blog/2022/10/11/fragmention-links/

Permalink: /feed/jamesg-fragmentation-links/

Tags: #untagged

lqdev👽10/11/2022

https://www.infoq.com/news/2022/10/nlp-community-metasurvey/

Relevant links:

Permalink: /feed/nlp-metasurvey-results/

Tags: #untagged

lqdev👽10/11/2022

https://arxiv.org/pdf/2210.00108.pdf

...backdoors can be added during compilation, circumventing any safeguards in the data preparation and model training stages.

some backdoors, such as ImpNet, can only be reliably detected at the stage where they are inserted and removing them anywhere else presents a significant challenge.

machine-learning model security requires assurance of provenance along the entire technical pipeline, including the data, model architecture, compiler, and hardware specification.

Permalink: /feed/impnet-undetectable-backdoors-compiled-nn/

Tags: #untagged

lqdev👽10/10/2022

https://indieweb.org/bookmark

A bookmark (or linkblog) is a post that is primarily comprised of a URL, often title text from that URL, sometimes optional text describing, tagging, or quoting from its contents.

Bookmarks are useful for saving things to read later or build a recommended reading list.

Permalink: /feed/indieweb-bookmark-post/

Tags: #untagged

lqdev👽10/10/2022

https://freecontent.manning.com/rediscover-the-joy-in-coding-learn-f

Permalink: /feed/manning-rediscover-joy-coding-fsharp/

Tags: #untagged

lqdev👽10/09/2022

https://www.newyorker.com/tech/annals-of-technology/can-indie-social-media-save-us

Permalink: /feed/indie-social-media-save-us/

Tags: #untagged

lqdev👽10/09/2022

https://handwritten.blog/

Permalink: /feed/handwrittern-blog/

Tags: #untagged

lqdev👽10/06/2022

https://blog.jim-nielsen.com/2021/things-learned-blogging/

Both this post as well as Tom MacWright's "How to Blog" resonate. I've been posting more consistently to my microblog feeds. These posts are more informal, but just the aspect of producing something even if it's just a short snippet is gratifying. The informality of it makes the posts significantly shorter but the pace at which I publish content and share ideas is faster.

Permalink: /feed/things-learned-blogging/

Tags: #untagged

lqdev👽10/06/2022

https://blog.jim-nielsen.com/2019/i-love-rss/

It's always fun to stumble upon these lists and finding more interesting people and websites to follow.

Permalink: /feed/nielsen-favorite-rss-feeds/

Tags: #untagged

lqdev👽10/06/2022

https://www.youtube.com/watch?v=QMSmcpn8nLM&ab_channel=ResairchiaVideos

Permalink: /feed/numerical-fnalysis-intro/

Tags: #untagged

lqdev👽10/06/2022

https://www.compositional-it.com/news-blog/f-with-python-pt-2-tensorflow-binding/

Permalink: /feed/fsharp-python-tf-binding/

Tags: #untagged

lqdev👽10/05/2022

https://unredactedmagazine.com/issues/004.pdf

Permalink: /feed/unredacted-mag-issue-4/

Tags: #untagged

lqdev👽10/05/2022

https://archive.org/details/locations-venues-indie-web-camp-berlin-2022

Interesting discussion at about 21:30 on a federated wiki / review aggregator.

Permalink: /feed/indiewebcamp-berlin-2022-location-venues/

Tags: #untagged

lqdev👽10/05/2022

https://blog.jim-nielsen.com/2022/other-peoples-websites/

Google+ was Google trying to mimic the walled garden of Facebook — their “how” of extracting value from the people of the internet. But they already had an answer for Facebook’s News Feed in front of them: Blogger / Google Reader, the read / write of the internet.

They provide the tools – Reader, Blogger, Search — we provide the personal websites. The open, accessible, indexable web as The Next Great Social Network.

Permalink: /feed/og-social-network-other-peoples-websites/

Tags: #untagged

lqdev👽10/05/2022

https://blog.jim-nielsen.com/2022/patching-open-web/#readlists

I've been doing something similar to readlists, except with RSS feeds. I create a custom recipe in Calibre which pulls and aggregates the 50 latest articles for each feed in my recipe. I limit it to only look at articles published in the past day since I do this every evening. Think of it like the evening paper. The result is an EPUB file which I then import into my e-book reader.

A few advantages I've found to doing this are:

I take a break from the computer.
Since the e-book reader is offline, I focus on reading the article and don't have the option to click on links and get distracted browsing the internet.
Since I'm already using the e-book reader, it's easy to transition to reading a book.

Permalink: /feed/patching-open-web/

Tags: #untagged

lqdev👽10/03/2022

https://www.thisdaysportion.com/posts/minimalism-as-narcissim/

Permalink: /feed/minimalism-narcissism/

Tags: #untagged

lqdev👽10/03/2022

https://stellarium.org/release/2022/10/01/stellarium-1.0.html

Can't believe this is the first time I'd heard of this desktop app. Usually using mobile apps like Sky Guide is convenient when on the go, but Stellarium not only seems to have lots of information, but also cross-platform and web-based.

Permalink: /feed/stellarium-1-0/

Tags: #untagged

lqdev👽10/02/2022

https://wilw.dev/blog/2022/10/02/bear-joplin/

Permalink: /feed/bear-joplin/

Tags: #untagged

lqdev👽09/29/2022

https://twitter.com/flyinglotus/status/1575517121122996225

This ☝️. If you know, you know. 🔥

Permalink: /feed/flylo-terrifier-2/

Tags: #untagged

lqdev👽09/29/2022

https://www.theverge.com/2022/9/29/23378713/google-stadia-shutting-down-game-streaming-january-2023

DJ Khaled Another One GIF

Permalink: /feed/google-stadia-shutdown/

Tags: #untagged

lqdev👽09/29/2022

https://rutar.org/writing/how-to-build-a-personal-webpage-from-scratch/

Permalink: /feed/how-to-build-personal-webpage-scratch/

Tags: #untagged

lqdev👽09/28/2022

https://andregarzia.com/2022/09/The-appeal-of-small-computers.html

Permalink: /feed/appeal-small-pc/

Tags: #untagged

lqdev👽09/28/2022

https://spectrum.ieee.org/meet-an-open-source-pc-that-can-fit-in-your-pocket

Take My Money GIF

Permalink: /feed/mnt-pocket-reform/

Tags: #untagged

lqdev👽09/28/2022

https://openai.com/blog/dall-e-now-available-without-waitlist/

Permalink: /feed/dalle-no-waitlist/

Tags: #untagged

lqdev👽09/28/2022

https://blog.jonudell.net/tag/fedwiki/

Permalink: /feed/fed-wiki-udell/

Tags: #untagged

lqdev👽09/28/2022

https://www.infoworld.com/article/3674309/github-for-english-teachers.html

Permalink: /feed/github-english-teachers/

Tags: #untagged

lqdev👽09/26/2022

https://fosdem.org/2023/news/2022-09-14-fosdem-2023-dates/

It feels like the 2022 conference was just yesterday. In any case, save the date February 4-5,2023.

Permalink: /feed/fosdem-2023-dates/

Tags: #untagged

lqdev👽09/26/2022

https://opensource.com/article/22/9/joplin-interview

Good read. Other than Emacs, Joplin is my go-to notetaking application.

Permalink: /feed/joplin-interview/

Tags: #untagged

lqdev👽09/26/2022

https://proton.me/blog/proton-drive-launch

Permalink: /feed/proton-drive-launch/

Tags: #untagged

lqdev👽09/25/2022

https://github.com/Rezmason/matrix

Matrix Rain sample with different variations.

Permalink: /feed/web-matrix-rain/

Tags: #untagged

lqdev👽09/23/2022

https://github.com/mlcommons/ck/blob/master/docs/mlperf-education-workgroup.md

Permalink: /feed/mlperf-modular-ml-workgroup/

Tags: #untagged

lqdev👽09/23/2022

https://mariachec.medium.com/on-objectives-okrs-ncts-lessons-from-reinventing-the-company-goal-setting-10eb969e4ca5

Alternative to OKRs

Permalink: /feed/nct-goal-setting/

Tags: #untagged

lqdev👽09/22/2022

https://blog.jim-nielsen.com/2022/moving-with-prototypes/

Permalink: /feed/moving-prototypes/

Tags: #untagged

lqdev👽09/22/2022

https://mutant.tech/

an alternative...emoji set.

Permalink: /feed/mutant-standard-emojis/

Tags: #untagged

lqdev👽09/22/2022

https://web.dev/testing-web-design-color-contrast/

W3C’s Web Accessibility Initiative provides strategies, standards, and resources to ensure that the internet is accessible for as many people as possible. The guidelines that underpin these standards are called the Web Content Accessibility Guidelines, or WCAG.

Color contrast is an important piece of the puzzle for accessibility on the web, and adhering to it makes the web more usable for the greatest number of people in the most varied situations.

Apps to test contrast:

Pika
VisBug
Chrome Dev Tools

Permalink: /feed/testing-web-design-color-contrast/

Tags: #untagged

lqdev👽09/22/2022

https://devblogs.microsoft.com/visualstudio/now-introducing-arm64-support-for-vs-extensions/

Permalink: /feed/vs-extension-arm-support/

Tags: #untagged

lqdev👽09/22/2022

https://ai.googleblog.com/2022/09/tensorstore-for-high-performance.html

High-perf, scalable array storage that can be used for scenarios like language models.

Permalink: /feed/tensorstore/

Tags: #untagged

lqdev👽09/22/2022

https://openai.com/blog/whisper/

Permalink: /feed/open-ai-whisper/

Tags: #untagged

lqdev👽09/22/2022

https://opensource.com/article/22/9/git-configuration-linux

Create global configuration
Set default name
Set default email address
Set default branch name
Set default editor

Permalink: /feed/five-git-configurations-linux/

Tags: #untagged

lqdev👽09/21/2022

https://www.robinsloan.com/notes/home-cooked-app/

When you liberate programming from the requirement to be general and professional and scalable, it becomes a different activity altogether, just as cooking at home is really nothing like cooking in a commercial kitchen.

Same goes for websites and self-hosting.

Permalink: /feed/app-home-cooked/

Tags: #untagged

lqdev👽09/21/2022

https://xibbon.com/terminal/2022/09/21/welcome-to-la-terminal.html

I'm not an Apple user, but this is cool.

Permalink: /feed/welcome-to-la-terminal/

Tags: #untagged

lqdev👽09/21/2022

https://larahogan.me/blog/management-resource-library/

Resource on:

Influence & managing up: Enact positive change for yourself, your team, or the whole organization.

Leading through crises: Strengthen your support network, meet your team where they’re at, and weather the tough times.

Cross-functional relationships: Strengthen relationships by creating role clarity and creatively supporting one another.

One-on-ones: Set your teammates up for success during your one-on-one meetings!

Hiring: Build consistent, repeatable, and equitable interviews and onboarding plans.

Meetings: Support participants, hone the content, and nail the meeting goal.

Feedback & performance reviews: Everyone deserves clear, actionable feedback!

Communication & team dynamics: Plan ahead, facilitate well, and create clarity.

Adapting your approach: As your work context and team evolves, your leadership approach will need to evolve, too.

Permalink: /feed/resources-navigating-complex-leadership-work/

Tags: #untagged

lqdev👽09/21/2022

https://weaviate.io/blog/2022/09/Distance-Metrics-in-Vector-Search.html

Metrics covered:

Cosine
Dot Product
L2-Squared
Manhattan
Hamming

Permalink: /feed/distance-metrics-vector-search/

Tags: #untagged

lqdev👽09/21/2022

https://en.wikipedia.org/wiki/1%25_rule

...about 1% of Internet users create content, while 99% are just consumers of that content

Permalink: /feed/1-percent-rule/

Tags: #untagged

lqdev👽09/20/2022

http://paulgraham.com/users.html

What have I learned from YC's users, the startups we've funded?

...most startups have the same problems.

...the batch that broke YC was a powerful demonstration of how individualized the process of advising startups has to be.

...founders can be [bad] at realizing what their problems are. Founders will sometimes come in to talk about some problem, and we'll discover another much bigger one in the course of the conversation.

Often founders know what their problems are, but not their relative importance.

Focus is doubly important for early stage startups, because not only do they have a hundred different problems, they don't have anyone to work on them except the founders. If the founders focus on things that don't matter, there's no one focusing on the things that do.

Speed defines startups. Focus enables speed. YC improves focus.

Why are founders uncertain about what to do? Partly because startups almost by definition are doing something new, which means no one knows how to do it yet, or in most cases even what "it" is.

Permalink: /feed/what-ive-learned-from-users/

Tags: #untagged

lqdev👽09/20/2022

https://www.eff.org/deeplinks/2022/09/how-ditch-facebook-without-losing-your-friends-or-family-customers-or-communities

disgruntled Facebook users keep using the service because they don’t want to leave behind their friends, family, communities and customers.

“How to Ditch Facebook Without Losing Your Friends” explains the rationale behind these proposals - and offers a tour of what it would be like to use a federated, interoperable Facebook, from setting up your account to protecting your privacy and taking control of your own community’s moderation policies, overriding the limits and permissions that Facebook has unilaterally imposed on its users.

Permalink: /feed/ditch-facebook-without-losing-friends/

Tags: #untagged

lqdev👽09/20/2022

https://ffprofile.com/

This tool will help you to create a Firefox profile with the defaults you like.

You select which features you want to enable and disable and in the end you get a download link for a zip-file with your profile template.

Permalink: /feed/firefox-profile-maker/

Tags: #untagged

lqdev👽09/18/2022

https://www.zylstra.org/blog/2022/09/rss2-0-at-20/

20 years and still going strong.

Permalink: /feed/rss-20-at-20/

Tags: #untagged

lqdev👽09/16/2022

https://webmention.rocks/test/1

This is my comment 2

Permalink: /feed/webmention-test-1/

Tags: #untagged

lqdev👽09/16/2022

https://www.zylstra.org/blog/2022/09/wordpressindieweb-as-the-os-of-the-open-social-web/

...2022 Netherlands WordCamp edition in Arnhem [presentation] on turning all WordPress sites into fully IndieWeb enabled sites. Meaning turning well over a third of the web into the open social web. Outside all the silos.

Permalink: /feed/wordpress-indieweb-os-open-social-web/

Tags: #untagged

lqdev👽09/14/2022

https://ryanholiday.net/the-notecard-system-the-key-for-remembering-organizing-and-using-everything-you-read/

Permalink: /feed/rh-notecard-system/

Tags: #untagged

lqdev👽09/14/2022

http://www.paulgraham.com/chameleon.html

When Lisp adopts a new paradigm it not only replicates existing practice, but goes beyond it to become a testbed for advancing the state of the art. Why has Lisp been able to adapt so easily when other languages have not? One reason is that Lisp is a programmable programming language.

Permalink: /feed/lisp-is-a-chameleon/

Tags: #untagged

lqdev👽09/14/2022

https://www.freecodecamp.org/news/new-online-courses/

Subjects:

Science
Programming
Education & Teaching
Health & Medicine
Social Sciences
Business
Information Security (InfoSec)
Mathematics
Computer Science
Data Science
Personal Development
Humanities
Engineering
Art & Design

Permalink: /feed/850-free-mooc-courses/

Tags: #untagged

lqdev👽09/11/2022

https://www.openbuildinginstitute.org/

The OBI system is open source, collaborative and distributed.

Our focus is on low cost and rapidly-built structures that are modular, ecological, and energy efficient.

Permalink: /feed/open-building-institute/

Tags: #untagged

lqdev👽09/09/2022

https://ideaspace.substack.com/p/metablogging

...internally public blogs written by members of the...squad detailing what they’re working on and thinking about.

Permalink: /feed/metablogging-ideaspace/

Tags: #untagged

lqdev👽09/08/2022

https://aeturrell.github.io/coding-for-economists/intro.html

...a guide for economists on what programming is, why it’s useful, and how to do it.

Permalink: /feed/coding-for-economists/

Tags: #untagged

lqdev👽09/06/2022

https://github.com/google/haskell-trainings

This repository contains the source for the slides and the exercises used in the Haskell trainings at Google.

Permalink: /feed/google-haskell-trainings/

Tags: #untagged

lqdev👽09/06/2022

https://www.opensourcealternative.to/

Discover 350+ popular open source alternatives to your proprietary SaaS.

Permalink: /feed/open-source-alternativeto/

Tags: #untagged

lqdev👽09/05/2022

https://openlibrary.org/

Open Library is an initiative of the Internet Archive, a 501(c)(3) non-profit, building a digital library of Internet sites and other cultural artifacts in digital form.

Permalink: /feed/open-library/

Tags: #untagged

lqdev👽09/05/2022

https://pyimagesearch.com/2022/09/05/a-deep-dive-into-transformers-with-tensorflow-and-keras-part-1/

Permalink: /feed/deep-dive-transformers-tf-keras-pt1/

Tags: #untagged

lqdev👽09/04/2022

https://pine.blog/search

Find feeds for all of your favorite sites and keep up with everything they post!

Permalink: /feed/pine-blog-feed-directory/

Tags: #untagged

lqdev👽09/04/2022

https://evernote.com/blog/how-to-create-commonplace-with-evernote/

Why commonplace?

Commonplace is an ideal medium for the curation and cultivation of intellectual ideas, thoughts, and knowledge. It’s also a proven and timeless system, tested by folks from a spectrum of backgrounds including authors, professors, and scientists.

Remember things
Write to recall
Understand reading
Personal reference system
Filter ideas
Unleash creativity

Reflecting on your commonplace

The most profound power of your commonplace is being able to thumb through reading and review material. Your commonplace book isn’t just a filing cabinet—it’s an evolving record of your life and observations.

Permalink: /feed/how-to-create-commonplace-book-evernote/

Tags: #untagged

lqdev👽09/04/2022

https://www.linuxjournal.com/content/every-user-neo

Permalink: /feed/every-user-neo/

Tags: #untagged

lqdev👽09/04/2022

https://buckrogers.neocities.org/

Permalink: /feed/buck-rogers-25-century-ad/

Tags: #untagged

lqdev👽09/02/2022

https://www.jayeless.net/wiki/small-web.html

...the kind of web they define themselves against; that kind of bloated, corporate, algorithm-ruled and ad-ridden mess that constitutes the majority of highly-trafficked websites these day.

...for me, the term “Small Web” refers to a couple of main things: independence from tech giants, and websites that are lightweight and high-performance.

Related concepts:

A late 90s-style, hand-crafted web
Alternative protocols, like Gemini
An independent web

Permalink: /feed/small-web-jayless-wiki/

Tags: #untagged

lqdev👽08/30/2022

https://blot.im/

Blot is a blogging platform with no interface. It turns a folder into a website.

Permalink: /feed/blot-blogging-platform/

Tags: #untagged

lqdev👽08/12/2022

https://github.com/osmlab/awesome-openstreetmap

Curated list of awesome OpenSteetMap-projects

Permalink: /feed/osmlab-awesome-osm/

Tags: #untagged

lqdev👽08/03/2022

https://yesterweb.org/

This site is dedicated to a community (and a larger movement) about the internet how it's changed. We are creating, discovering and enjoying websites and digital spaces.

Permalink: /feed/yesterweb/

Tags: #untagged

lqdev👽08/03/2022

https://knightcolumbia.org/events/reimagine-the-internet

A virtual conference exploring what the internet could become over the next decade

Sessions:

Pioneering Alternative Models for Community on the Internet
Misinformation, Disinformation, and Media Literacy in a Less-Centralized Social Media Universe
Interoperability and Alternative Social Media
Lessons from Experiments in Local Community-Building
Deplatforming and Innovation
New Directions in Social Media Research

Permalink: /feed/columbia-reimagine-internet/

Tags: #untagged

lqdev👽08/03/2022

https://masterwiki.how/

...masterWiki is the direct adaptation of MasterClass' video courses translated into wikiHow-style how-to guides...

Permalink: /feed/masterwiki/

Tags: #untagged

lqdev👽08/01/2022

https://betterhumans.pub/how-to-set-up-your-iphone-for-productivity-focus-and-your-own-longevity-bb27a68cc3d8

Here’s how to read the post:

Level 1 — Casual. Read the headlines — figure out the details yourself. Most of this isn’t rocket science.

Level 2 — Tutorial. Read the steps underneath the headline. I’ve spelled out every step so that you can save your brain power for something else.

Level 3 — Productivity Nerd. Below the tutorial steps, I’ve included discussion of the behavior design implications. This is for true productivity nerds, i.e. the readers of Better Humans.

Optimize First for Single Tasking

#1. Turn OFF (almost) all notifications
#2. Hide social media slot machines
#3. Hide messaging slot machines
#4. Disable app review requests
#5. Turn on Do Not Disturb
#6. Be strategic about your wallpaper
#7. Turn off Raise to Wake
#8. Add the Screen Time widget
#9. Add Content Restrictions
#10. (Optional) Use Restrictions to turn off Safari
#11. Organize your Apps and Folders alphabetically

Switch to Google Cloud to Work Faster

#12. Choose GMail
#13. Choose Google Calendar
#14. Replace Apple Maps with Google Maps
#15. Install the GBoard keyboard for faster typing
#16. Switch to Google Photos

Install These Apps for Productivity

#17. Use Evernote for all note taking, to-do lists, everything
#18. The Case for Calm as your go-to meditation app
#19. Install the right goal tracker for you
#20. Store all your passwords in a password manager, probably LastPass
#21. Use Numerical as your default calculator
#22. Put the Camera app in your toolbar
#23. Use this Doppler Radar app
#24. Use this Pomodoro app
#25. Use Brain.fm for background noise

Use These Apps and Configurations for Deep Learning

#26. Subscribe to these podcasts
#27. Install the Kindle app but never read it in bed
#28. Use Safari this way
#29. Organize your home screen for deep learning over shallow learning

Use These Apps and Configurations for Longevity

#30. Track steps this way
#31. Prefer Time Restricted Eating Over Calorie Counting
#32. Schedule Night Shift
#33. Set up Medical ID

Make The Finishing Touches with These Configurations

#34. Change Siri to a man
#35. Change your phone’s name
#36. Turn off advertising tracking
#37. Set auto-lock to the maximum time
#38. Set your personal hotspot password to a random three word phrase
#39. Turn on control center everywhere
#40. Turn on Background App Refresh
#41. Delete Garage Band
#42. Develop verbal memory for talking to Siri
#43. Set up these text replacement shortcuts
#44. Set your address
#45. Backup this way

Permalink: /feed/iphone-settings-focus-productivity-health/

Tags: #untagged

lqdev👽07/27/2022

https://klemet.github.io/Workshop-Organization-EN/index.html

Workshop objectives

Discover and learn to use the basic functions of the following software:
- Joplin
- Zotero
- Nextcloud
Learn how to use these different software programs together to manage information using the following methods:
The Zettelkasten method
The P.A.R.A method
The inbox method

Permalink: /feed/open-source-organization-workshop/

Tags: #untagged

lqdev👽07/26/2022

https://www.fast.ai/2022/07/21/dl-coders-22/

A free course designed for people with some coding experience, who want to learn how to apply deep learning and machine learning to practical problems.

Lessons

Getting started
Deployment
Neural net foundations
Natural Language (NLP)
From-scratch Model
Random forests
Collaborative filtering and embeddings
Convolutions (CNNs)

Permalink: /feed/practical-deep-learning-coders-2022/

Tags: #untagged

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Benefits of a multi-agent system

Architecture overview

Prompt engineering and evaluations for research agents

Effective evaluation of agents

Production reliability and engineering challenges

Conclusion

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

How it works

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention

Send me a message or webmention