AI

12 of the best books on computer vision

12 of the best books on computer vision

Computer vision is expanding quickly and has the potential to completely change how we interact with technology, being at the forefront of many cutting-edge advancements, from self-driving automobiles to augmented reality. Reading a computer vision book can be an excellent approach to learning and acquiring insight into this field and its applications. From the principles of computer vision to more advanced technologies, these books will provide you with a thorough overview of the area and its applications – whether you’re a student, researcher, or professional.In this article, you’ll find 12 of the best books on computer vision:Computer Vision: Algorithms and…
Read More
Research Papers in February 2024: A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM research

Research Papers in February 2024: A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM research

Once again, this has been an exciting month in AI research. This month, I'm covering two new openly available LLMs, insights into small finetuned LLMs, and a new parameter-efficient LLM finetuning technique.The two LLMs mentioned above stand out for several reasons. One LLM (OLMo) is completely open source, meaning that everything from the training code to the dataset to the log files is openly shared.The other LLM (Gemma) also comes with openly available weights but achieves state-of-the-art performance on several benchmarks and outperforms popular LLMs of similar size, such as Llama 2 7B and Mistral 7B, by a large margin.However,…
Read More
AI Is a Black Box. Anthropic Figured Out a Way to Look Inside

AI Is a Black Box. Anthropic Figured Out a Way to Look Inside

Last year, the team began experimenting with a tiny model that uses only a single layer of neurons. (Sophisticated LLMs have dozens of layers.) The hope was that in the simplest possible setting they could discover patterns that designate features. They ran countless experiments with no success. “We tried a whole bunch of stuff, and nothing was working. It looked like a bunch of random garbage,” says Tom Henighan, a member of Anthropic’s technical staff. Then a run dubbed “Johnny”—each experiment was assigned a random name—began associating neural patterns with concepts that appeared in its outputs.“Chris looked at it, and…
Read More
Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock | Amazon Web Services

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock | Amazon Web Services

In the world of online retail, creating high-quality product descriptions for millions of products is a crucial, but time-consuming task. Using machine learning (ML) and natural language processing (NLP) to automate product description generation has the potential to save manual effort and transform the way ecommerce platforms operate. One of the main advantages of high-quality product descriptions is the improvement in searchability. Customers can more easily locate products that have correct descriptions, because it allows the search engine to identify products that match not just the general category but also the specific attributes mentioned in the product description. For example,…
Read More
Poking parts of Sonnet’s brain to make it less annoying

Poking parts of Sonnet’s brain to make it less annoying

In a groundbreaking new paper (actually groundbreaking, IMO), researchers at Anthropic have scaled up an interpretability technique called "dictionary learning" to one of their deployed models, Claude 3 Sonnet. The results provide an unprecedented look inside the mind of a large language model, revealing millions of interpretable features that correspond to specific concepts and behaviors (like sycophancy) and shedding light on the model's inner workings. In this post, we'll explore the key findings of this research, including the discovery of interpretable features, the role of scaling laws, the abstractness and versatility of these features, and their implications for model steering…
Read More
Numenta Unveils NuPIC 2.0

Numenta Unveils NuPIC 2.0

A breakthrough for AI, NuPIC empowers enterprises to easily deploy robust generative AI applications on commodity CPUs Numenta Inc., the world leader in deploying large AI models on CPUs, announced version 2.0 of its flagship product, the Numenta Platform for Intelligent Computing (NuPIC). NuPIC empowers companies to deploy large language models (LLMs) on CPUs, offering an efficient, scalable, and secure solution. With a focus on flexibility and real-world applications, NuPIC makes it easy for businesses to choose and deploy the right model for the right task. Whether customers want to run an existing model, fine-tune models with their proprietary data,…
Read More
Adobe introduces AI-powered eraser to Lightroom

Adobe introduces AI-powered eraser to Lightroom

Say goodbye to photobombs. Adobe is introducing an AI-driven Generative Remove feature to its Lightroom photo editor. This feature simplifies the removal of unwanted elements like that annoying person in the background. Currently in public beta, it works seamlessly across the Lightroom ecosystem on mobile, desktop, and web platforms.Streamlined editing with Firefly AILightroom's Generative Remove effortlessly replaces unwanted elements using Adobe's Firefly AI engine. Paint over the area you want to remove, and Lightroom sends this information to Adobe's Firefly servers, which process the data and return the edited image. In contrast to Adobe Photoshop's Reference Image feature, which allows…
Read More
Tips for LLM Pretraining and Evaluating Reward Models

Tips for LLM Pretraining and Evaluating Reward Models

It's another month in AI research, and it's hard to pick favorites.Besides new research, there have also been many other significant announcements. Among them, xAI has open-sourced its Grok-1 model, which, at 314 billion parameters, is the largest open-source model yet. Additionally, reports suggest that Claude-3 is approaching or even exceeding the performance of GPT-4. Then there’s also Open-Sora 1.0 (a fully open-source project for video generation), Eagle 7B (a new RWKV-based model), Mosaic’s 132 billion parameter DBRX (a mixture-of-experts model), and AI21's Jamba (a Mamba-based SSM-transformer model).However, since detailed information about these models is quite scarce, I'll focus on…
Read More
The Low-Paid Humans Behind AI’s Smarts Ask Biden to Free Them From ‘Modern Day Slavery’

The Low-Paid Humans Behind AI’s Smarts Ask Biden to Free Them From ‘Modern Day Slavery’

AI projects like OpenAI’s ChatGPT get part of their savvy from some of the lowest-paid workers in the tech industry—contractors often in poor countries paid small sums to correct chatbots and label images. On Wednesday, 97 African workers who do AI training work or online content moderation for companies like Meta and OpenAI published an open letter to President Biden, demanding that US tech companies stop “systemically abusing and exploiting African workers.”Most of the letter’s signatories are from Kenya, a hub for tech outsourcing, whose president, William Ruto, is visiting the US this week. The workers allege that the practices…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.