generative ai

NVIDIA’s new AI model Fugatto can create audio from text prompts

NVIDIA’s new AI model Fugatto can create audio from text prompts

NVIDIA has debuted a new experimental generative AI model, which it describes as "a Swiss Army knife for sound." The model called Foundational Generative Audio Transformer Opus 1, or Fugatto, can take commands from text prompts and use them to create audio or to modify existing music, voice and sound files. It was designed by a team of AI researchers from around the world, and NVIDIA says that made the model's "multi-accent and multilingual capabilities stronger.""We wanted to create a model that understands and generates sound like humans do," said Rafael Valle, one of the researchers behind the project and…
Read More
AI Agent Systems: Modular Engineering for Reliable Enterprise AI Applications

AI Agent Systems: Modular Engineering for Reliable Enterprise AI Applications

Monolithic to ModularThe proof of concept (POC) of any new technology often starts with large, monolithic units that are difficult to characterize. By definition, POCs are designed to show that a technology works without considering issues around extensibility, maintenance, and quality. However, once technologies achieve maturity and are deployed widely, these needs drive product development to be broken down into smaller, more manageable units. This is the fundamental concept behind systems thinking and why we are seeing AI implementation move from models to AI agent systems. The concept of modular design has been applied to:Cars: seats, tires, lights, and engines can…
Read More
Fine-tuning Llama 3.1 with Long Sequences

Fine-tuning Llama 3.1 with Long Sequences

We are excited to announce that Mosaic AI Model Training now supports the full context length of 131K tokens when fine-tuning the Meta Llama 3.1 model family. With this new capability, Databricks customers can build even higher-quality Retrieval Augmented Generation (RAG) or tool use systems by using long context length enterprise data to create specialized models.The size of an LLM’s input prompt is determined by its context length. Our customers are often limited by short context lengths, especially in use cases like RAG and multi-document analysis. Meta Llama 3.1 models have a long context length of 131K tokens. For comparison,…
Read More
Adobe previews AI video tools that arrive later this year

Adobe previews AI video tools that arrive later this year

On Wednesday, Adobe unveiled Firefly AI video generation tools that will arrive in beta later this year. Like many things related to AI, the examples are equal parts mesmerizing and terrifying as the company slowly integrates tools built to automate much of the creative work its prized user base is paid for today. Echoing AI salesmanship found elsewhere in the tech industry, Adobe frames it all as supplementary tech that “helps take the tedium out of post-production.”Adobe describes its new Firefly-powered text-to-video, Generative Extend (which will be available in Premiere Pro) and image-to-video AI tools as helping editors with tasks…
Read More
Google brings the AI feature that told Americans to eat rocks to six more countries

Google brings the AI feature that told Americans to eat rocks to six more countries

Google is expanding AI Overviews, the feature that summarizes answers to complex questions from the web and presents them at the top of traditional search results, to six more countries — India, Japan, Mexico, Indonesia, Brazil and the United Kingdom — from Thursday with support for local languages as well as English.That’s less than three months after AI Overviews launched in the United States and promptly told people to eat rocks and put glue on their pizzas. Bringing them to millions more people begs the question: How do you prevent another glue pizza fiasco in a foreign country?“It’s a challenging…
Read More
Opera’s AI-focused web browser One is now on iOS

Opera’s AI-focused web browser One is now on iOS

Opera One, the browser with a focus on generative AI features that Opera launched for desktop last year, is now available for iOS devices. It retains its desktop counterpart's cleaner look, but it comes with a full screen interface and features specifically designed for mobile use. The company said it experienced a 63 percent growth in new users across the European Union after the Digital Markets Act was implemented, and now it has "embraced the opportunities presented by the new regulatory landscape."Users will be able to move their search bar to the bottom of the screen if that will make…
Read More
Meta is reportedly offering millions to use Hollywood voices in AI projects

Meta is reportedly offering millions to use Hollywood voices in AI projects

A future artificial intelligence product by Meta could have you chatting with celebrities. According to Bloomberg and The New York Times, the company is in talks with Awkwafina, Judi Dench and Keegan-Michael Key, among other celebrities from various Hollywood agencies for its AI projects. The company apparently intends to incorporate their voices into a conversational generative AI-slash-digital assistant called MetaAI, which is similar to Siri and Google Assistant.Meta plans to record their voices and to secure the right to use them for as many situations as possible across Facebook, Messenger, Instagram, WhatsApp and even the Ray-Ban Meta glasses. Bloomberg says…
Read More
Websites accuse AI startup Anthropic of bypassing their anti-scraping rules and protocol

Websites accuse AI startup Anthropic of bypassing their anti-scraping rules and protocol

Freelancer has accused Anthropic, the AI startup behind the Claude large language models, of ignoring its "do not crawl" robots.txt protocol to scrape its websites' data. Meanwhile, iFixit CEO Kyle Wiens said Anthropic has ignored the website's policy prohibiting the use of its content for AI model training. Matt Barrie, the chief executive of Freelancer, told The Information that Anthropic's ClaudeBot is "the most aggressive scraper by far." His website allegedly got 3.5 million visits from the company's crawler within a span of four hours, which is "probably about five times the volume of the number two" AI crawler. Similarly,…
Read More
AI video startup Runway reportedly trained on ‘thousands’ of YouTube videos without permission

AI video startup Runway reportedly trained on ‘thousands’ of YouTube videos without permission

AI company Runway reportedly scraped “thousands” of YouTube videos and pirated versions of copyrighted movies without permission. 404 Media obtained alleged internal spreadsheets suggesting the AI video-generating startup trained its Gen-3 model using YouTube content from channels like Disney, Netflix, Pixar and popular media outlets.An alleged former Runway employee told the publication the company used the spreadsheet to flag lists of videos it wanted in its database. It would then download them without detection using open-source proxy software to cover its tracks. One sheet lists simple keywords like astronaut, fairy and rainbow, with footnotes indicating whether the company had found…
Read More
Google gives free Gemini users access to its faster, lighter 1.5 Flash AI model

Google gives free Gemini users access to its faster, lighter 1.5 Flash AI model

Google is making its Gemini AI faster and more efficient across the board. You now have access to 1.5 Flash, its generative AI model designed to be able to generate responses more quickly and efficiently, even if you're not paying for Gemini Advanced. The company says you'll notice improvements in latency, as well as the tool's reasoning and image understanding, on both the web and mobile.In addition, it's expanding the AI assistant's context window, so that you can have longer conversations with it and ask it more complex questions. In the near future, Google will also give you the ability…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.