GenAI

ChatGPT Gets an Upgrade With ‘Natively Multimodal’ GPT-4o

ChatGPT Gets an Upgrade With ‘Natively Multimodal’ GPT-4o

OpenAI’s Spring Update livestream on May 13 brought three major announcements from the AI company: A new flagship AI model called GPT-4o. A desktop ChatGPT app for macOS. ChatGPT users who don’t pay for a subscription can now access more features for free. The coming changes to ChatGPT “brings GPT-4 level intelligence to everyone, including our free users,” said OpenAI Chief Technology Officer Mira Murati during the livestream. OpenAI CTO Mira Murati speaks in a livestream on May 13. Image: Screenshot by TechRepublic GPT-4o improves on GPT-4 Turbo’s voice and video capabilities Murati said OpenAI’s next flagship model GPT-4o is…
Read More
Animal brain inspired AI game changer for autonomous robots

Animal brain inspired AI game changer for autonomous robots

A team of researchers at Delft University of Technology has developed a drone that flies autonomously using neuromorphic image processing and control based on the workings of animal brains. Animal brains use less data and energy compared to current deep neural networks running on GPUs (graphic chips). Neuromorphic processors are therefore very suitable for small drones because they don't need heavy and large hardware and batteries. The results are extraordinary: during flight the drone's deep neural network processes data up to 64 times faster and consumes three times less energy than when running on a GPU. Further developments of this…
Read More
VINC-S: Closed-form Optionally-supervised Knowledge Elicitation with Paraphrase Invariance

VINC-S: Closed-form Optionally-supervised Knowledge Elicitation with Paraphrase Invariance

$^*$Equal contribution In Spring 2023, a team at EleutherAI and elsewhere worked on a follow-up to CCS that aimed to improve its robustness, among other goals. We think the empirical side of the project was largely unsuccessful, failing to provide evidence that any method had predictably better generalization properties. In the spirit of transparency, we are sharing our proposed method and some results on the Quirky Models benchmark. Introduction# As we rely more and more on large language models (LLMs) to automate cognitive labor, it's increasingly important that we can trust them to be truthful. Unfortunately, LLMs often reproduce human…
Read More
On the Utility of Conformal Prediction Intervals • David Stutz

On the Utility of Conformal Prediction Intervals • David Stutz

This article is meant as an ad-hoc response to Ben Recht’s recent blog series on whether we need conformal prediction intervals. I have been thinking a lot about the use of conformal prediction myself and this seems like a good opportunity to share some thoughts and learnings from working on conformal prediction the past few years. Ben Recht recently published some blog articles questioning the utility of prediction intervals and sets, especially as obtained using distribution-free, conformal methods. In this article, I want to add some color to the discussion given my experience with applying these methods in various settings.…
Read More
Strategic growth: commercetools surges towards IPO – SiliconANGLE

Strategic growth: commercetools surges towards IPO – SiliconANGLE

Recently, commercetools GmbH achieved significant momentum, setting the company up for ongoing success and continuous innovative developments in the e-commerce market. This news highlights the value consumers find in commercetools as it gains market share over its competition, helping pave the way to Initial Public Offerings and even more considerable strategic growth. “That tightening of the belt I actually think is good for business; I think it’s a smart thing to do, and you move from that growth at all costs to efficient growth and smart growth,” said Dan Murphy (pictured), chief financial officer of commercetools. “Commercetools has been able…
Read More
Snowflake Arctic, a New AI LLM for Enterprise Tasks, is Coming to APAC

Snowflake Arctic, a New AI LLM for Enterprise Tasks, is Coming to APAC

Data cloud provider Snowflake has launched an open source large language model, Arctic LLM, as part of a growing portfolio of AI offerings helping enterprises leverage their data. Typical use cases include data analysis, including sentiment analysis of reviews, chatbots for customer service or sales, and business intelligence queries, like the extraction of revenue information. Snowflake’s Arctic is being offered alongside other LLM models from Meta, Mistral AI, Google and Reka in its Cortex product, which is only available in select regions. Snowflake said Cortex will be available in APAC in Japan in June via the AWS Asia Pacific (Tokyo)…
Read More
To optimize guide-dog robots, first listen to the visually impaired

To optimize guide-dog robots, first listen to the visually impaired

What features does a robotic guide dog need? Ask the blind, say the authors of an award-winning paper. Led by researchers at the University of Massachusetts Amherst, a study identifying how to develop robot guide dogs with insights from guide dog users and trainers won a Best Paper Award at CHI 2024: Conference on Human Factors in Computing Systems (CHI). Guide dogs enable remarkable autonomy and mobility for their handlers. However, only a fraction of people with visual impairments have one of these companions. The barriers include the scarcity of trained dogs, cost (which is $40,000 for training alone), allergies…
Read More
Pocket-Sized AI Models Could Unlock a New Era of Computing

Pocket-Sized AI Models Could Unlock a New Era of Computing

When ChatGPT was released in November 2023, it could only be accessed through the cloud because the model behind it was downright enormous.Today I am running a similarly capable AI program on a Macbook Air, and it isn’t even warm. The shrinkage shows how rapidly researchers are refining AI models to make them leaner and more efficient. It also shows how going to ever larger scales isn’t the only way to make machines significantly smarter.The model now infusing my laptop with ChatGPT-like wit and wisdom is called Phi-3-mini. It’s part of a family of smaller AI models recently released by…
Read More
Least-Squares Concept Erasure with Oracle Concept Labels

Least-Squares Concept Erasure with Oracle Concept Labels

This post assumes some familiarity with the idea of concept erasure and our LEACE concept erasure method. We encourage the reader to consult our arXiv paper for background. For a PyTorch implementation of this method, see the OracleFitter class in our GitHub repository. WARNING: Because this erasure transformation depends on the ground truth concept label, it can increase the nonlinearly-extractable information about the target concept inside a representation, even though it eliminates the linearly available information. For this reason, optimizing deep neural networks on top of O-LEACE'd representations is not recommended; for those use cases we recommend vanilla LEACE. In…
Read More

Thoughts on Academia and Industry in Machine Learning Research • David Stutz

Introduction By construction, a PhD has a clear end. Depending on the program, country and field, a PhD is supposed to be done within 3-6 years when it is usually awarded after an official defense of the research work. This is in contrast to most other careers and jobs, especially in industry but also in the public sector. Even though a PhD is often considered as a qualification for independent research and thereby acts as the entry to an academic career, it is commonly assumed that most PhD graduates do not continue in academia. This also matches my impression and…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.