stp2y

33065 Posts
FlashAttention-3 unleashes the power of H100 GPUs for LLMs

FlashAttention-3 unleashes the power of H100 GPUs for LLMs

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Attention is a core component of the transformer architecture used in large language models (LLMs). But as LLMs grow larger and handle longer input sequences, the computational cost of attention becomes a bottleneck.  To address this challenge, researchers from Colfax Research, Meta, Nvidia, Georgia Tech, Princeton University, and Together AI have introduced FlashAttention-3, a new technique that significantly speeds up attention computation on Nvidia Hopper GPUs (H100 and H800). FlashAttention-3 builds upon previous work on FlashAttention and FlashAttention-2 and further optimizes…
Read More
AI’s Outrageous Environmental Toll Is Probably Worse Than You Think

AI’s Outrageous Environmental Toll Is Probably Worse Than You Think

Wow, that's *bad.*Up in the AirBy now, you're probably well aware of the staggering energy and resource costs of generative AI. But even if the whole industry is a bubble ready to burst, chances are that the environmental toll we're hearing about now is only going to get worse — because AI's appetite is absolutely insatiable.Consider the obscene amounts of water that's needed just to cool the data centers that train and host generative AI models, which is somewhere in the millions of gallons per year. Internal estimates from Microsoft about its data facility in Goodyear, Arizona, for example, show…
Read More
Leveraging large language models for nano synthesis mechanism explanation: solid foundations or mere conjectures?

Leveraging large language models for nano synthesis mechanism explanation: solid foundations or mere conjectures?

arXiv:2407.08922v1 Announce Type: new Abstract: With the rapid development of artificial intelligence (AI), large language models (LLMs) such as GPT-4 have garnered significant attention in the scientific community, demonstrating great potential in advancing scientific discovery. This progress raises a critical question: are these LLMs well-aligned with real-world physicochemical principles? Current evaluation strategies largely emphasize fact-based knowledge, such as material property prediction or name recognition, but they often lack an understanding of fundamental physicochemical mechanisms that require logical reasoning. To bridge this gap, our study developed a benchmark consisting of 775 multiple-choice questions focusing on the mechanisms of gold nanoparticle synthesis.…
Read More
Thomson Reuters’ Future of Professionals Report Shows Cautious Optimism Toward AI in Law

Thomson Reuters’ Future of Professionals Report Shows Cautious Optimism Toward AI in Law

It has been a common refrain today that generative AI can take over simpler tasks but struggles with more difficult ones. In that case, how much does generative AI actually save time or improve performance at work? Thomson Reuters, a professional services and technology company in the fields of law, tax, compliance and more, explored how professionals are using AI in its 2024 Future of Professionals report. We spoke to Thomson Reuters Chief Product Officer David Wong about generative AI in the workplace in an exclusive interview about the release of the report. Thomson Reuters surveyed 2,205 professionals in legal,…
Read More
Greenland sharks can live for over 250 years, and scientists want to use their anti-aging secrets to help humans live longer

Greenland sharks can live for over 250 years, and scientists want to use their anti-aging secrets to help humans live longer

Abigail Adams, wife of the second US president, was born in 1744. It's entirely possible that there are Greenland sharks still living today that were swimming in the North Atlantic Ocean at the time.There's no doubt that these large, carnivorous sharks can live hundreds of years. In 2016, researchers discovered they can survive for at least 272 years, but they might get as old as 400.However, why these sharks have that kind of longevity is more of a mystery. Some theories include the shark's slow growth rate and low metabolic rate, but research is ongoing.Scientists hope that unlocking the secrets…
Read More
Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval

Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval

arXiv:2407.08908v1 Announce Type: new Abstract: Image retrieval plays a pivotal role in applications from wildlife conservation to healthcare, for finding individual animals or relevant images to aid diagnosis. Although deep learning techniques for image retrieval have advanced significantly, their imperfect real-world performance often necessitates including human expertise. Human-in-the-loop approaches typically rely on humans completing the task independently and then combining their opinions with an AI model in various ways, as these models offer very little interpretability or textit{correctability}. To allow humans to intervene in the AI model instead, thereby saving human time and effort, we adapt the Concept Bottleneck Model…
Read More
Alleged Pixel 9 and Pixel 9 XL leak shows a redesigned camera bar

Alleged Pixel 9 and Pixel 9 XL leak shows a redesigned camera bar

We’re less than a month away from the next Made by Google event, and we may already know what one of the marquee announcements will look like. TikTok user pixo_unpacking (via YTechB) posted videos over the weekend of apparent pre-production samples of the Pixel 9 and Pixel 9 XL.The phones in the video have different backs: a glossy finish on the standard Pixel 9’s rear and a matte one on the larger Pixel XL’s. 9to5Google notes that they appear to include rear-panel etchings Google uses for prototypes, although they’re mostly covered in the clips by labels.Based on the video, the…
Read More
Domain-Hierarchy Adaptation via Chain of Iterative Reasoning for Few-shot Hierarchical Text Classification

Domain-Hierarchy Adaptation via Chain of Iterative Reasoning for Few-shot Hierarchical Text Classification

arXiv:2407.08959v1 Announce Type: new Abstract: Recently, various pre-trained language models (PLMs) have been proposed to prove their impressive performances on a wide range of few-shot tasks. However, limited by the unstructured prior knowledge in PLMs, it is difficult to maintain consistent performance on complex structured scenarios, such as hierarchical text classification (HTC), especially when the downstream data is extremely scarce. The main challenge is how to transfer the unstructured semantic space in PLMs to the downstream domain hierarchy. Unlike previous work on HTC which directly performs multi-label classification or uses graph neural network (GNN) to inject label hierarchy, in this…
Read More
In response to Rockset-OpenAI: a brief real-time analytics manifesto | Deephaven

In response to Rockset-OpenAI: a brief real-time analytics manifesto | Deephaven

OpenAI’s acquisition of Rockset sent waves through the data infrastructure industry. On the heels of Databricks’ billion-dollar reach for Tabular, the industry is both in play and in flux.Companies like Clickhouse, StarTree, and Imply have published pieces that pontificate about the impact of OpenAI’s acquisition on the landscape and pitch their gear to Rockset customers looking for a Plan B.Even my loved ones aren’t interested in my armchair quarterbacking of OpenAI’s strategy, so I’ll let the All-In, Ben & Mark, and Acquired podcasts weigh in. However, as the CEO of Deephaven Data Labs, a company developing software for today’s most…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.