Tips for LLM Pretraining and Evaluating Reward Models

Tips for LLM Pretraining and Evaluating Reward Models

It's another month in AI research, and it's hard to pick favorites.Besides new research, there have also been many other significant announcements. Among them, xAI has open-sourced its Grok-1 model, which, at 314 billion parameters, is the largest open-source model yet. Additionally, reports suggest that Claude-3 is approaching or even exceeding the performance of GPT-4. Then there’s also Open-Sora 1.0 (a fully open-source project for video generation), Eagle 7B (a new RWKV-based model), Mosaic’s 132 billion parameter DBRX (a mixture-of-experts model), and AI21's Jamba (a Mamba-based SSM-transformer model).However, since detailed information about these models is quite scarce, I'll focus on…
Read More
Microsoft CEO Bashes Human-Like AI After OpenAI’s Scarlett Johansson Scandal

Microsoft CEO Bashes Human-Like AI After OpenAI’s Scarlett Johansson Scandal

"I don’t need any artificial intelligence."I, RobotAfter OpenAI got in trouble for copying actor Scarlett Johansson's voice for a new ChatGPT voice assistant, the head honcho at Microsoft — a major investor and close partner of OpenAI — bashed human-like AIs in a surprising interview on Monday."I don't like anthropomorphizing AI," Microsoft CEO Satya Nadella told Bloomberg Television. "I sort of believe it's a tool.""It has got intelligence, if you want to give it that moniker, but it’s not the same intelligence that I have," he added, while also dinging the term "artificial intelligence.""I think one of the most unfortunate…
Read More
Yi-34B, Llama 2, and common practices in LLM training: a fact check of the New York Times

Yi-34B, Llama 2, and common practices in LLM training: a fact check of the New York Times

On February 21 2024, the New York Times published “China’s Rush to Dominate A.I. Comes With a Twist: It Depends on U.S. Technology.” The authors claim that Yi-34B, a recent large language model by the Chinese startup 01.AI, is fundamentally indebted to Meta’s Llama 2: There was just one twist: Some of the technology in 01.AI’s system came from Llama. Mr. Lee’s start-up then built on Meta’s technology, training its system with new data to make it more powerful. This assessment is based on a misreading of the cited Hugging Face issue. While we make no claims about the overall…
Read More
The Role of Synthetic Data in Cybersecurity

The Role of Synthetic Data in Cybersecurity

Data's value is something of a double-edged sword. On one hand, digital data lays the groundwork for powerful AI applications, many of which could change the world for the better. Conversely, storing so many details on people creates huge privacy risks. Synthetic data provides a possible solution. What Is Synthetic Data? Synthetic data is a subset of anonymized data – data that doesn't reveal any real-world details. More specifically, it refers to information that looks and acts like real-world data but has no ties to actual people, places or events. In short, it's fake data that can produce real results. In…
Read More
OpenAI will reportedly pay $250 million to put News Corp’s journalism in ChatGPT

OpenAI will reportedly pay $250 million to put News Corp’s journalism in ChatGPT

OpenAI and News Corp, the owner of The Wall Street Journal, MarketWatch, The Sun, and more than a dozen other publishing brands, have struck a multi-year deal to display news from these publications in ChatGPT, News Corp announced on Wednesday. OpenAI will be able to access both current and well as archived content from News Corp’s publications and use the data to further train its AI models. Neither company disclosed the terms of the deal, but a report in The Wall Street Journal estimated that News Corp would get $250 million over five years in cash and credits.“The pact acknowledges…
Read More
The Low-Paid Humans Behind AI’s Smarts Ask Biden to Free Them From ‘Modern Day Slavery’

The Low-Paid Humans Behind AI’s Smarts Ask Biden to Free Them From ‘Modern Day Slavery’

AI projects like OpenAI’s ChatGPT get part of their savvy from some of the lowest-paid workers in the tech industry—contractors often in poor countries paid small sums to correct chatbots and label images. On Wednesday, 97 African workers who do AI training work or online content moderation for companies like Meta and OpenAI published an open letter to President Biden, demanding that US tech companies stop “systemically abusing and exploiting African workers.”Most of the letter’s signatories are from Kenya, a hub for tech outsourcing, whose president, William Ruto, is visiting the US this week. The workers allege that the practices…
Read More
Webinar Replay – Space Loves AI: How AI promises to transform space operations

Webinar Replay – Space Loves AI: How AI promises to transform space operations

For satellite operators, AI’s potential benefits are impossible to ignore. As Earth observation and communications constellations expand, AI tools promise to streamline operations, reduce on-orbit collisions and speed up analysis of remote sensing data. Opportunities and challenges for space-based AI with experts from the Aerospace Corporation, Stanford University’s Center for AEroSpace Autonomy Research (CAESAR), Magnestar, and Redwire Space were discussed. Panelists Mike Nemerouf, DirectorInnovation, Science and Technology Aerospace Corporation Al Tadros, CTORedwire Space Simone D’Amico, Co-founderStanford Center for AEroSpace Autonomy Research Moderator Debra WernerSenior Staff WriterSpaceNews Sponsored By Whether your mission requires persistent, high-resolution imagery or resilient communications, Redwire’s highly…
Read More
Building MLOps Capabilities at GitLab As a One-Person ML Platform Team

Building MLOps Capabilities at GitLab As a One-Person ML Platform Team

Eduardo Bonet is an incubation engineer at GitLab, building out their MLOps capabilities. One of the first features Eduardo implemented in this role was a diff for Jupyter Notebooks, bringing code reviews into the data science process. Eduardo believes in an iterative, feedback-driven product development process, although he emphasizes that “minimum viable change” does not necessarily mean that there is an immediately visible value-add from the user’s point of view. While LLMs are quickly gaining traction, Eduardo thinks they’ll not replace ML or traditional software engineering but add to the capabilities. Thus, he believes that GitLab’s current focus on MLOps…
Read More
Generative AI Translation Startup DeepL Locks Up $300M

Generative AI Translation Startup DeepL Locks Up $300M

Investors can’t get enough of different ways to use generative AI. Translation and language startup DeepL became the latest startup using generative AI to raise big, nabbing $300 million at a $2 billion post-money valuation in a round led by Index Ventures. The valuation is about double its previous $1 billion-plus valuation from January 2023. The round included participation from ICONIQ Growth, Teachers’ Venture Growth, IVP, Atomico and WiL (World Innovation Lab). The German startup language AI platform offers writing, editing and translation services for 63 markets and 32 languages for business use cases. The company said it has already…
Read More
Announcing General Availability of Liquid Clustering

Announcing General Availability of Liquid Clustering

We’re excited to announce the General Availability of Delta Lake Liquid Clustering in the Databricks Data Intelligence Platform. Liquid Clustering is an innovative data management technique that replaces table partitioning and ZORDER so you no longer have to fine-tune your data layout to achieve optimal query performance.  Liquid clustering significantly simplifies data layout-related decisions and provides the flexibility to redefine clustering keys without data rewrites. It allows data layout to evolve alongside analytic needs over time – something you could never do with partitioning on Delta.  Since the Public Preview of Liquid Clustering at the Data and AI Summit last year, we’ve…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.