stp2y

29877 Posts
Onehouse Breaks Data Catalog Lock-In with More Openness

Onehouse Breaks Data Catalog Lock-In with More Openness

(Majcot/Shutterstock) Onehouse, the Apache Hudi-backer that bills itself as the most open data platform in the world, further opened up its platform today with the launch of a data catalog synchronization feature that streamlines user access to data residing in major cloud platforms. The feature complements the company’s investment in developing XTable, an open-source offering that delivers read-write interoperability among Hudi, Delta, and Apache Iceberg table formats. The advent of open table formats like Hudi, Delta, and Iceberg revolutionized data openness by enabling multiple query engines access the same piece of data without fear of data corruption. As the key…
Read More
Revolutionary weight-loss drugs like Wegovy come with a catch

Revolutionary weight-loss drugs like Wegovy come with a catch

This article is an installment of Future Explored, a weekly guide to world-changing technology. You can get stories like this one straight to your inbox every week by subscribing here.Anti-obesity drugs cause people to lose more than just fat.More than 73% of American adults are overweight, according to the CDC. This puts them at increased risk of death and many serious health issues, but losing weight and keeping it off through diet changes and exercise — the standard approach — is notoriously difficult.That made the FDA’s 2021 approval of Novo Nordisk’s semaglutide (Wegovy) as an obesity treatment seem like something of…
Read More
Research Papers in Oct 2023: A Potential Successor to RLHF for Efficient LLM Alignment and the Resurgence of CNNs

Research Papers in Oct 2023: A Potential Successor to RLHF for Efficient LLM Alignment and the Resurgence of CNNs

From Vision Transformers to innovative large language model finetuning techniques, the AI community has been very active with lots of interesting research this past month.Here's a snapshot of the highlights I am covering in this article:In the paper ConvNets Match Vision Transformers at Scale, Smith et al. invest significant computational resources to conduct a thorough comparison between Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), challenging the prevailing notion that ViTs outperform CNNs in image classification tasks. The Mistral 7B paper introduces a compact yet powerful language model that, despite its relatively modest size of 7 billion tokens, outperforms its larger…
Read More

If Scarlett Johansson can’t bring the AI firms to heel, what hope for the rest of us? | John Naughton

On Monday 13 May, OpenAI livestreamed an event to launch a fancy new product – a large language model (LLM) dubbed GPT-4o – that the company’s chief technology officer, Mira Murati, claimed to be more user-friendly and faster than boring ol’ ChatGPT. It was also more versatile, and multimodal, which is tech-speak for being able to interact in voice, text and vision. Key features of the new model, we were told, were that you could interrupt it in mid-sentence, that it had very low latency (delay in responding) and that it was sensitive to the user’s emotions.Viewers were then treated…
Read More

ArXiv Pre-Print “Evaluating AI Systems under Uncertain Ground Truth: a Case Study in Dermatology” • David Stutz

ArXiv Pre-Print “Evaluating AI Systems under Uncertain Ground Truth: a Case Study in Dermatology” In supervised machine learning, we usually assume access to ground truth label for evaluation. In many applications, however, these ground truth labels are derived from expert opinions. Disagreement among these experts is typically ignored using simple majority voting or averaging. Unfortunately, this can have severe consequences by over-estimating performance or mis-guiding model selection. In our work presented in this article, we tackle this problem by introducing a statistical framework for aggregating expert opinions. Abstract For safety, AI systems in health undergo thorough evaluations before deployment, validating…
Read More
Data Machina #246

Data Machina #246

New Trends in Vision-Language Models (VLMs.) The evolution of VLMs in recent months has been pretty impressive. Today VLMs exhibit some amazing capabilities. See the two links below on what VLMs can do and how they work:But still VLMs are facing some challenges for example in terms of: multimodal training datasets, resolution, long-form modality, vision-language integration, and concept understanding. Somewhat along those lines, I see 5 trends happening in VLMs: 1) VLMs run on local environment 2) Emerging VLM videoagents 3) Unified structure learning for VLMs 4) Personalisation of VLMs and 5) Fixing the VLM resolution curse. Let’s see…VLMs on…
Read More
The AI Revolution Will Not Be Monopolized: Behind the scenes

The AI Revolution Will Not Be Monopolized: Behind the scenes

FabNER Claude 2 accuracy on # of examples 10 20 30 40 50 60 70 80 90 100 0 100 200 300 400 500 20 examples F-Score Speed (words/s) GPT-3.5 1 78.6 < 100 GPT-4 1 83.5 < 100 spaCy 91.6 4,000 Flair 93.1 1,000 SOTA 2023 2 94.6 1,000 SOTA 2003 3 88.8 > 20,000 1. Ashok and Lipton (2023), 2. Wang et al. (2021), 3. Florian et al. (2003) SOTA on few- shot prompting RoBERTa-base * * EXPERIMENTS * * * EXPERIMENTS * * * EXPERIMENTS * * * CoNLL 2003: Named Entity Recognition Source link lol
Read More
Viruses are doing mysterious things everywhere – AI can help researchers understand what they’re up to in the oceans and in your gut

Viruses are doing mysterious things everywhere – AI can help researchers understand what they’re up to in the oceans and in your gut

Viruses are a mysterious and poorly understood force in microbial ecosystems. Researchers know they can infect, kill and manipulate human and bacterial cells in nearly every environment, from the oceans to your gut. But scientists don’t yet have a full picture of how viruses affect their surrounding environments in large part because of their extraordinary diversity and ability to rapidly evolve. Communities of microbes are difficult to study in a laboratory setting. Many microbes are challenging to cultivate, and their natural environment has many more features influencing their success or failure than scientists can replicate in a lab. So systems…
Read More
PeerDB raises $3.6M to accelerate PostgreSQL data movement – SiliconANGLE

PeerDB raises $3.6M to accelerate PostgreSQL data movement – SiliconANGLE

Startup PeerDB Inc., which has built a data movement platform specifically for open-source PostgreSQL database management systems, said today it has closed on a seed funding round worth $3.6 million. Leading the investment was 8VC, and it was joined by Y Combinator, Wayfinder Ventures, Webb Investment Network, Flex Capital, Rogue Capital, Pioneer Fund, Orange Collective and several angel investors. PeerDB says its data movement platform is designed to improve on the capabilities of existing data pipelines that were never designed for Postgres databases. Existing data movement and extract, transform and load or ETL tools often prioritize the sheer number of connectors…
Read More
Coding With Devin: My New AI Programming Agent

Coding With Devin: My New AI Programming Agent

Sponsored By: Reflect This essay is brought to you by Reflect, an ultra-fast notes app with an AI assistant built in directly. Simplify your note-taking with Reflect's advanced features, like custom prompts, voice transcription, and the ability to chat with your notes effortlessly. Elevate your productivity and organization with Reflect. Was this newsletter forwarded to you? Sign up to get it in your inbox.We onboarded four new engineers at Every this week. The onboarding process was what you'd expect. I gave them access to our GitHub account so they could download our main code repos. I was peppered with the usual…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.