GenAI

To optimize guide-dog robots, first listen to the visually impaired

To optimize guide-dog robots, first listen to the visually impaired

What features does a robotic guide dog need? Ask the blind, say the authors of an award-winning paper. Led by researchers at the University of Massachusetts Amherst, a study identifying how to develop robot guide dogs with insights from guide dog users and trainers won a Best Paper Award at CHI 2024: Conference on Human Factors in Computing Systems (CHI). Guide dogs enable remarkable autonomy and mobility for their handlers. However, only a fraction of people with visual impairments have one of these companions. The barriers include the scarcity of trained dogs, cost (which is $40,000 for training alone), allergies…
Read More
Pocket-Sized AI Models Could Unlock a New Era of Computing

Pocket-Sized AI Models Could Unlock a New Era of Computing

When ChatGPT was released in November 2023, it could only be accessed through the cloud because the model behind it was downright enormous.Today I am running a similarly capable AI program on a Macbook Air, and it isn’t even warm. The shrinkage shows how rapidly researchers are refining AI models to make them leaner and more efficient. It also shows how going to ever larger scales isn’t the only way to make machines significantly smarter.The model now infusing my laptop with ChatGPT-like wit and wisdom is called Phi-3-mini. It’s part of a family of smaller AI models recently released by…
Read More
Least-Squares Concept Erasure with Oracle Concept Labels

Least-Squares Concept Erasure with Oracle Concept Labels

This post assumes some familiarity with the idea of concept erasure and our LEACE concept erasure method. We encourage the reader to consult our arXiv paper for background. For a PyTorch implementation of this method, see the OracleFitter class in our GitHub repository. WARNING: Because this erasure transformation depends on the ground truth concept label, it can increase the nonlinearly-extractable information about the target concept inside a representation, even though it eliminates the linearly available information. For this reason, optimizing deep neural networks on top of O-LEACE'd representations is not recommended; for those use cases we recommend vanilla LEACE. In…
Read More

Thoughts on Academia and Industry in Machine Learning Research • David Stutz

Introduction By construction, a PhD has a clear end. Depending on the program, country and field, a PhD is supposed to be done within 3-6 years when it is usually awarded after an official defense of the research work. This is in contrast to most other careers and jobs, especially in industry but also in the public sector. Even though a PhD is often considered as a qualification for independent research and thereby acts as the entry to an academic career, it is commonly assumed that most PhD graduates do not continue in academia. This also matches my impression and…
Read More
Neo4j CTO says new Graph Query Language standard will have ‘massive ripple effects’ – SiliconANGLE

Neo4j CTO says new Graph Query Language standard will have ‘massive ripple effects’ – SiliconANGLE

The International Standards Organization’s publication of a standard for the Graph Query Language earlier this month generated relatively little media interest, but executives at graph database makers were turning virtual cartwheels in the halls. The GQL standard is the first to be ratified by the ISO since Structured Query Language in 1986. The 600-page document defines the rules for “creating, accessing, querying, maintaining, and controlling property graphs and the data they comprise.” A property graph is a type of data structure used primarily in graph databases and graph processing frameworks. It consists of nodes and edges that represent relationships that…
Read More
Microsoft Copilot Cheat Sheet: Benefits, Price and Versions

Microsoft Copilot Cheat Sheet: Benefits, Price and Versions

Microsoft Copilot, with its integration into Windows, Bing, 365, Azure, and Server, is purported to be the AI that unlocks the creative and productive potential of an organization’s people and data. What is Microsoft Copilot? Microsoft Copilot is an AI product that combines the power of large language models with in-house enterprise data generated by the Microsoft Graph and Microsoft 365 applications. Using the power of AI and natural language conversations, users can find better answers to their questions and potentially create content from those answers. Copilot was developed on the ChatGPT platform and announced as an in-development platform at…
Read More
3D printing robot creates extreme shock-absorbing shape, with help of AI

3D printing robot creates extreme shock-absorbing shape, with help of AI

Inside a lab in Boston University's College of Engineering, a robot arm drops small, plastic objects into a box placed perfectly on the floor to catch them as they fall. One by one, these tiny structures -- feather-light, cylindrical pieces, no bigger than an inch tall -- fill the box. Some are red, others blue, purple, green, or black. Each object is the result of an experiment in robot autonomy. On its own, learning as it goes, the robot is searching for, and trying to make, an object with the most efficient energy-absorbing shape to ever exist. To do this,…
Read More
Adobe Adds an AI-Powered Eraser to Lightroom

Adobe Adds an AI-Powered Eraser to Lightroom

Photo bombing is dead. Adobe is adding an artificial-intelligence-powered Generative Remove feature to its Lightroom photo editor that makes it dead simple to zap out unwanted elements, like that annoying guy in the background. The new feature is in a public beta-testing phase, but it will work across the Lightroom ecosystem whether you're using the app on mobile, desktop, or web.Lightroom's Generative Remove uses Adobe's Firefly AI engine to smoothly replace unwanted elements. Simply paint over the area you want to remove and Lightroom will send that information to Adobe's Firefly servers, which then crunch the data and send it…
Read More
The Foundation Model Development Cheatsheet

The Foundation Model Development Cheatsheet

The pace of foundation model releases and progress has continued to grow rapidly over the past few years, with many new models released from organizations of all kinds worldwide. In addition to releasing models themselves, it's also important to make the tools to create these models - large-scale training libraries, data processing and creation tooling, and more - widely available. In April 2023 we released the Pythia model suite, the first LLMs with a fully released and reproducible technical pipeline from start to finish. We are excited to see other organizations following suit, with the LLM360 project releasing Amber later…
Read More
How to Optimize Hyperparameter Search Using Bayesian Optimization and Optuna

How to Optimize Hyperparameter Search Using Bayesian Optimization and Optuna

Hyperparameter optimization is an integral part of machine learning. It aims to find the best set of hyperparameter values to achieve the best model performance. Grid search and random search are popular hyperparameter tuning methods. They roam around the entire search space to get the best set of hyperparameters, which makes them time-consuming and inefficient for larger datasets. Based on Bayesian logic, Bayesian optimization considers the model performance for previous hyperparameter combinations while determining the next set of hyperparameters to evaluate. Optuna is a popular tool for Bayesian hyperparameter optimization. It provides easy-to-use algorithms, automatic algorithm selection, integrations with a…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.