FasterViT for Image Classification

FasterViT for Image Classification

FasterViT is a family of Vision Transformer models that is both fast and provides better accuracy than other ViT models. It combines the local representation learning of CNNs and the global learning properties of ViTs. In this article, we will cover the FasterViT model for image classification. Figure 1. FasterViT architecture, throughput, and benchmark on ImageNet1K. We will go through image inference using the pretrained network along with a brief of its architectural components. Furthermore, we will also fine-tune a FasterViT model for image classification. We will cover the following topics in this article We will start with a discussion…
Read More
Language Modeling Reading List (to Start Your Paper Club)

Language Modeling Reading List (to Start Your Paper Club)

Some friends and I started a weekly paper club to read and discuss fundamental papers in language modeling. By pooling together our shared knowledge, experience, and questions, we learned more as a group than we could have individually. To encourage others to do the same, here’s the list of papers we covered, and a one-sentence summary for each. I’ll update this list with new papers as we discuss them. (Also, why and how to read papers .) Attention Is All You Need: Query, Key, and Value are all you need* (*Also position embeddings, multiple heads, feed-forward layers, skip-connections, etc.) GPT:…
Read More
Scarlett Johansson’s OpenAI Feud Makes Her an Uncanny Folk Hero

Scarlett Johansson’s OpenAI Feud Makes Her an Uncanny Folk Hero

There is a distinct moment in the Marvel Cinematic Universe when Black Widow became a hero for the everyfan. It happens early in 2012’s The Avengers: She’s tied to a chair. Agent Coulson calls. A nondescript military leader who has been interrogating her hands her the phone. Coulson explains that S.H.I.E.L.D. needs to pull her out of the field. She kicks her questioner in the shin, smashes the chair she’s tied to, takes out three dudes, grabs her heels, and leaves.The Avengers went on to make $1.5 billion globally and catapulted nearly everyone in it to superstardom, even the actors…
Read More
AI at the crossroads of cybersecurity, space and national security in the digital age

AI at the crossroads of cybersecurity, space and national security in the digital age

Technological prowess, especially regarding humanity’s increased presence in space, is increasingly becoming the linchpin of global competitiveness and national security. There, new opportunities to integrate AI are accompanied by a new generation of risks. Artificial intelligence in particular plays a crucial role in democratizing access to space exploration and research, opening it to many beyond just governmental space agencies, as evidenced by the large number of commercially financed and operated space launches over the last five years. As launch companies adopt AI-enabled autonomous flight safety systems, Space Launch Delta 45 is saving on mission control chairs and looping out about…
Read More
Enterprises Have Just Two Years to Harness the Full Potential of GenAI: Genpact and HFS Report

Enterprises Have Just Two Years to Harness the Full Potential of GenAI: Genpact and HFS Report

(Berit Kessler/Shutterstock) The advent of GenAI has proven to be the first real innovation to disrupt industry since the advent of the internet. While GenAI is only over a year old, it has left enterprises scrambling to gain a competitive advantage. However, the window of opportunity for these enterprises may be shorter than anticipated. Enterprises have only two years to adopt GenAI before competitive disadvantages emerge, according to a new report by Genpact and HFS Research. The report also highlights that only 5% of enterprises have mature GenAI initiatives, signaling an urgent need for acceleration of GenAI adoption.  Genpact is…
Read More
Training Diffusion Models with  Reinforcement Learning

Training Diffusion Models with Reinforcement Learning

Training Diffusion Models with Reinforcement Learning replay Diffusion models have recently emerged as the de facto standard for generating complex, high-dimensional outputs. You may know them for their ability to produce stunning AI art and hyper-realistic synthetic images, but they have also found success in other applications such as drug design and continuous control. The key idea behind diffusion models is to iteratively transform random noise into a sample, such as an image or protein structure. This is typically motivated as a maximum likelihood estimation problem, where the model is trained to generate samples that match the training data as closely as…
Read More

PRECISE Seminar Talk “Evaluating and Calibrating AI Models with Uncertain Ground Truth” • David Stutz

PRECISE Seminar Talk “Evaluating and Calibrating AI Models with Uncertain Ground Truth” I had the pleasure to present our work on evaluating and calibrating with uncertain ground truth at the seminar series of the PRECISE center at the University of Pennsylvania. Besides talking about our recent papers on evaluating AI models in health with uncertain ground truth and conformal prediction with uncertain ground truth, I also got to learn more about the research at PRECISE through post-doc and student presentations. In this article, I want to share the corresponding slides. Abstract For safety, AI systems in health undergo thorough evaluations…
Read More
We Stood on Both Sides of the New York–Dublin Portal and It Was Glorious

We Stood on Both Sides of the New York–Dublin Portal and It Was Glorious

Amanda: I got to the Portal in Manhattan’s Flatiron District a little before 11 am New York time, and found that there’s now a fence keeping people several feet away from it (but the same isn’t happening in Dublin). This is part of the new security the organizers have implemented: If someone steps on the Portal or blocks the camera, the livestream will blur for both sides, organizers say. For the next hour, a steady stream of people stopped by the Portal, with usually about 30 there at any time. They waved, they smiled, they danced YMCA and the Macarena…
Read More
Data Machina #248

Data Machina #248

Jailbreaking AI Models: It’s easy. Hundreds of millions of dollars have been thrown at AI Safety & Alignment over the years. Despite that, jailbreaking LLMs in April 2024 is easy. Oddly enough, as the LLM models become more capable and sophisticated, the jailbreaking attacks are becoming easier to perform, more effective, and frequent. Gary Marcus - who is hypercritical about LLMs and current AI trends- just published this very opinionated post: An unending array of jailbreaking attacks could be the death of LLMs.I often speak to colleagues and clients about the “LLM jailbreaking elephant in the room.” And they all…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.