Viral News

Onehouse Breaks Data Catalog Lock-In with More Openness

Onehouse Breaks Data Catalog Lock-In with More Openness

(Majcot/Shutterstock) Onehouse, the Apache Hudi-backer that bills itself as the most open data platform in the world, further opened up its platform today with the launch of a data catalog synchronization feature that streamlines user access to data residing in major cloud platforms. The feature complements the company’s investment in developing XTable, an open-source offering that delivers read-write interoperability among Hudi, Delta, and Apache Iceberg table formats. The advent of open table formats like Hudi, Delta, and Iceberg revolutionized data openness by enabling multiple query engines access the same piece of data without fear of data corruption. As the key…
Read More
Data Machina #246

Data Machina #246

New Trends in Vision-Language Models (VLMs.) The evolution of VLMs in recent months has been pretty impressive. Today VLMs exhibit some amazing capabilities. See the two links below on what VLMs can do and how they work:But still VLMs are facing some challenges for example in terms of: multimodal training datasets, resolution, long-form modality, vision-language integration, and concept understanding. Somewhat along those lines, I see 5 trends happening in VLMs: 1) VLMs run on local environment 2) Emerging VLM videoagents 3) Unified structure learning for VLMs 4) Personalisation of VLMs and 5) Fixing the VLM resolution curse. Let’s see…VLMs on…
Read More
AI vs. Humanity: Who Will Come Out on Top?

AI vs. Humanity: Who Will Come Out on Top?

The below is a summary of my recent article on superintelligence. Elon Musk predicts that Artificial Superintelligence (ASI) will emerge by 2025, much earlier than his previous estimates. While Musk's track record with predictions is mixed, this one sparks serious contemplation about the future. The moment AI surpasses human cognitive abilities, known as the singularity, will usher in a new era with both unprecedented possibilities and profound perils. As we edge closer to this event horizon, it's essential to ask if we are prepared to navigate the uncertainties and harness the potential of AI responsibly. The journey towards ASI has…
Read More
Delta Sharing: Secure End-to-End Data Sharing Solution

Delta Sharing: Secure End-to-End Data Sharing Solution

In today's digital landscape, secure data sharing is critical to operational efficiency and innovation. Databricks and the Linux Foundation developed Delta Sharing as the first open source approach to data sharing across data, analytics and AI.  Databricks provides secure data exchange, facilitating seamless sharing across platforms, clouds and regions. Enterprises of all sizes trust Delta Sharing, which supports a broad spectrum of applications and diverse data formats. This flexibility makes it a reliable tool for organizations seeking to harness the full potential of their data assets.In this blog, we will review Delta Sharing's security architecture through three different sharing scenarios—…
Read More
Multiscale Vision Transformer for Video Recognition

Multiscale Vision Transformer for Video Recognition

Vision transformers are already good at multiple tasks like image recognition, object detection, and semantic segmentation. However, we can also apply them to data with temporal information like videos. One such use case is using Vision Transformers for video classification. To this end, in this article, we will go over the important parts of the Multiscale Vision Transformer (MViT) paper and also carry out inference using the pretraining model. Figure 1. An example output after passing a bowling video through the Multiscale Vision Transformer model. Although there are several models for this, the Multiscale Vision Transformer model stands out for…
Read More
Snowflake Looks to AI to Bolster Growth

Snowflake Looks to AI to Bolster Growth

(Michael Vi/Shutterstock) Investors in Snowflake breathed a sigh of relief this week when the cloud data warehouser reported solid revenue growth for its first quarter and raised its guidance for the rest of the year. But questions still remain over its long-term growth, which the company is hoping that artificial intelligence will power. The company’s acquisition this week of assets of TruEra fits that mold. Snowflake on Wednesday reported $829 million in total GAAP revenues for the quarter ended April 30, 2024, representing a 33% increase over the same period last year. It reported 14 cents per share, which was…
Read More
Data Machina #247

Data Machina #247

The New Breed of Open Mixture-of-Experts (MoE) Models. In a push to beat the closed-box AI models from the AI Titans, many startups and research orgs have embarked in releasing open MoE-based models. These new breed of MoE-based models introduce many clever architectural tricks, and seek to balance training cost efficiency, output quality, inference performance and much more. For an excellent introduction to MoEs, checkout this long post by the Hugging Face team: Mixture of Experts ExplainedWe’re starting to see several open MoE-based models achieving near-SOTA or SOTA performance as compared to e.g. OpenAI GPT-4 and Google Gemini 1.5 Pro.…
Read More
The Role of AI in Big Data Quality Management

The Role of AI in Big Data Quality Management

In the realm of big data quality management, the convergence of AI technologies has opened up avenues for unparalleled levels of data accuracy and reliability. By harnessing the power of artificial intelligence, organizations can now automate the process of detecting and correcting errors in massive datasets with unprecedented speed and efficiency. Through advanced machine learning algorithms, AI systems can continuously learn from data patterns, enhancing their ability to identify inconsistencies and anomalies that might have otherwise gone unnoticed by human analysts. AI-driven big data quality management solutions offer a proactive approach to maintaining data integrity by predicting potential issues before…
Read More
Shaping the Future With Data and AI: Announcing the 2024 Databricks GenAI Innovation Award Finalists

Shaping the Future With Data and AI: Announcing the 2024 Databricks GenAI Innovation Award Finalists

The annual Data Team Awards showcase the remarkable efforts of top global enterprise data teams committed to tackling some of today's toughest business challenges.This year, we received more than 200 nominations across six categories, from companies representing a diverse array of industries and regions. In the lead-up to the Data + AI Summit, we'll showcase the finalists from each category, highlighting those pioneering the advances in data and AI.New this year, the GenAI Award represents the widespread enterprise adoption of large language models (LLMs). As LLMs transform industries by enhancing productivity, personalizing user experiences, and opening up new possibilities in…
Read More
Exploring Dark Knowledge under Various Teacher Capacities and Addressing Capacity Mismatch

Exploring Dark Knowledge under Various Teacher Capacities and Addressing Capacity Mismatch

arXiv:2405.13078v1 Announce Type: new Abstract: Knowledge Distillation (KD) could transfer the ``dark knowledge" of a well-performed yet large neural network to a weaker but lightweight one. From the view of output logits and softened probabilities, this paper goes deeper into the dark knowledge provided by teachers with different capacities. Two fundamental observations are: (1) a larger teacher tends to produce probability vectors that are less distinct between non-ground-truth classes; (2) teachers with different capacities are basically consistent in their cognition of relative class affinity. Abundant experimental studies verify these observations and in-depth empirical explanations are provided. The difference in dark…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.