Viral News

Data Machina #242

Data Machina #242

AI and Causality. The introduction of OpenAI Sora (simulate real worlds from video understanding) has sparked a bit of a debate among some prominent AI researchers. First, What do AI researchers mean by “causal”?Secondly: Do LLMs have causal reasoning capabilities? Can LLMs learn causality from just real world training data? Can LLMs learn, represent, and understand world models and physics? Judea Pearl - a world’s top researchers in Probabilistic AI, Bayesian Networks, and Causal Inference- once famously said in an interview:Deep Learning -albeit complex and non-trivial- it’s a curve fitting exercise. To build truly Intelligent Machines, teach them cause and…
Read More
Databricks Is a Glassdoor Best-Led Company in 2024

Databricks Is a Glassdoor Best-Led Company in 2024

Databricks is pleased to announce we are ranked #2 in the inaugural annual Glassdoor Award List of  Best-Led Companies in 2024! At Databricks, we're not just building cutting-edge technology; we're cultivating a culture of transparency. Our leadership mirrors our internal commitment to collaborative and transparent work practices. Databricks, a Glassdoor Best-Led CompanyOur CEO and Co-founder Ali Ghodsi’s leadership is rooted in truth-seeking and first principles thinking, two of Databricks’ core values that stay true to our origins in academia. As Databricks has grown, we’ve strived to maintain the open and transparent culture we had in our early days at the AMP research lab in UC Berkeley.…
Read More
Data Machina #254

Data Machina #254

On the State of AI Coding Agents. “How could we start using AI to migrate years of messy, flimsy legacy code to a modern stack? ... Perhaps an AI Code Migration Agent ???” We’re doing AI chat & espresso at Level 39, One Canada Square. James -a veteran CTO with all the scars- is asking these rather funny, rhetorical questions. There is a deep silence in the room, pensive faces around. Everyone is staring through the massive windows overlooking The City skyline as the sunset strikes. We wonder in perplexity -in the very philosophical and information theory sense- whether AI…
Read More
Semiconductors on the Data Intelligence Platform

Semiconductors on the Data Intelligence Platform

In the semiconductor industry, research and development tasks, manufacturing processes, and enterprise planning systems produce an array of data artifacts that can be fused to create an intelligent semiconductor enterprise. Through intelligent data use, an intelligent semiconductor enterprise accelerates time to market, increases manufacturing yield, and enhances product reliability.The Databricks Intelligence Platform suits semiconductor enterprises’ unique needs for performance, collaboration, and self-service access. Built on a lakehouse architecture with leading technologies of Delta Lake, Apache Spark™, MLflow, Mosaic AI, and Unity Catalog, the Data Intelligence Platform is the substrate for semiconductor companies to connect engineering technology (ET), operational technology (OT),…
Read More
Data Machina #243

Data Machina #243

Beyond GenAI & LLMs. GenAI & LLMs have literally kidnapped the DL/ML space, and pretty much sucked all the investment and top AI minds, as if there is nothing else under the sun. There are literally hundreds of GenAI models out there. Many of these GenAI models have questionable value or are a repeat-rinse-recycle of the same. Checkout this mega spreadsheet: An Annotated and Updated Directory of GenAI Models. So: Many DL/ML researchers are starting to question the LLM status quo: Shouldn’t we focus $ and brains in new, alt AI/DL paradigms beyond LLMs? And: Do we need so many…
Read More
Unlocking the Potential of Private Data Sharing using Databricks Private Exchanges

Unlocking the Potential of Private Data Sharing using Databricks Private Exchanges

We are thrilled to announce an exciting new feature on the Databricks Marketplace that simplifies the process of setting up private exchanges for all Databricks customers. With this feature, becoming a Private Exchange provider is easier than ever.In this blog post, we will delve deeply into the private exchange features of the Databricks Marketplace. We'll compare various exchange mechanisms—public marketplace and private exchanges—and examine the newly introduced feature that simplifies becoming a private exchange provider.Comparing Private Exchange and Public MarketplaceIn the evolving data sharing and monetization landscape, companies have multiple avenues to distribute their data and AI models. Each method…
Read More
Data Machina #244

Data Machina #244

AI Reasoning Like Humans. The storm has been battering the airport viciously. Three hours later we departed enduring some massive turbulences. Then this: “Captain speaking. This is to inform you that we’ll be performing an auto-pilot landing [watch this] upon arriving to Heathrow.” We should trust the AI-copilot reasoning in harsh situations. Shouldn’t we?… Five days ago, Anthropic introduced next-gen Claude 3 model family. I’ve tried Claude 3: It’s very good at certain language tasks, it pars or beats GPT-4 Turbo in several areas, has a huge context window, and it’s quite cheaper. Funnily enough, it miserably failed at a…
Read More
Announcing Mosaic AI Vector Search General Availability in Databricks

Announcing Mosaic AI Vector Search General Availability in Databricks

Following the announcement we made around a suite of tools for Retrieval Augmented Generation, today we are thrilled to announce the general availability of Mosaic AI Vector Search in Databricks.What is Mosaic AI Vector Search?Vector Search enables developers to improve the accuracy of their Retrieval Augmented Generation (RAG) and generative AI applications through similarity search over unstructured documents such as PDFs, Office Documents, Wikis, and more. This enriches the LLM queries with context and domain knowledge, improving accuracy, and quality of results.Vector Search is part of the Databricks Data Intelligence Platform, making it easy for your RAG and Generative AI applications…
Read More
Introduction to GPT-1 and GPT-2

Introduction to GPT-1 and GPT-2

GPT (Generative Pretrained Transformer) models have changed the landscape of the NLP in the last few years. In 2024, we are seeing complex and close-sourced models like OpenAI’s ChatGPT-3.5 & ChatGPT-4, Anthropic’s Claude, and Google’s Bard. However, the beginnings of these Transformer models were simpler and open-source. Vaswani et al. introduced the Transformer architecture. After that, two more seminal papers by OpenAI, GPT-1 and GPT-2 laid the foundation of almost all the NLP innovations to date. In this article, we will discuss the GPT-1 and GPT-2 models in detail along with their contributions and results. Figure 1. Architecture of the…
Read More
Has Codeium Cracked the Code for AI Assistants?

Has Codeium Cracked the Code for AI Assistants?

(AI generated/Shutterstock) When it comes to AI-powered coding assistants, Microsoft’s Copilot has the name and the numbers. But a competitor called Codeium is growing quickly, and according to its co-founder and CEO Varun Mohan, the sky is the limit for AI assistants. Codeium started life in 2021 as Exafunction, an infrastructure startup that provided big compute for other companies developing deep learning systems. Mohan and his business partner, Douglas Chen, managed 10,000 GPUs on behalf of autonomous vehicle development companies, an industry they previously worked in. But by late 2022, ChatGPT had exploded onto the scene, and Mohan and Chen…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.