GenAI

Pile-T5

Pile-T5

The T5 model (Raffel et al, 2019) is widely used in the NLP community. Its base model has been downloaded from Hugging Face millions of times, leaving no doubt that these models are a favorite of the community. However, T5's tokenizer omits important code-related tokens and subsequent pretraining datasets have been released with higher quality filtering and more diverse domains. In this blog post, we introduce a new version of T5 intended to address those weaknesses: Pile-T5, trained on the Pile (Gao et al, 2020) and using the LLaMA tokenizer (Touvron et al, 2023). Model Description# Our alternative version replaces…
Read More
Scaling ML Experiments With neptune.ai and Kubernetes

Scaling ML Experiments With neptune.ai and Kubernetes

Scaling machine learning (ML) experiments is a challenging process that requires efficient resource management, experiment tracking, and infrastructure scalability. neptune.ai offers a centralized platform to manage ML experiments, track real-time model performance, and store metadata. Kubernetes automates container orchestration, improves resource utilization, and enables horizontal and vertical scalability. Combining neptune.ai and Kubernetes provides a robust solution for scaling ML experiments, making it easier to manage and scale experiments across multiple environments and team members. Scaling machine-learning experiments efficiently is a challenge for ML teams. The complexity lies in managing configurations, launching experiment runs, tracking their outcomes, and optimizing resource allocation.…
Read More

FAQ for our Monte Carlo Conformal Prediction • David Stutz

Over the past months, I have given several talks about Monte Carlo conformal prediction and the problem of calibrating with uncertain ground truth, for example, stemming from annotator disagreement. Each time, the audience had great questions and ideas for extensions and interesting applications. In this article, I want to provide a sort of FAQ for our work. Are code and data available? Yes, code and data are on GitHub. Code includes both Monte Carlo conformal prediction as well as the plausibility regions from v1 of the paper. Can you derive the conformal $p$-values used in the paper? The connection of…
Read More
Brazilian data and AI consultancy Indicium raises $40M for US expansion – SiliconANGLE

Brazilian data and AI consultancy Indicium raises $40M for US expansion – SiliconANGLE

Brazilian data and artificial intelligence consultancy startup Indicium today announced that it had raised $40 million in new funding to expand its operations in the U.S. Founded in 2017, Indicium helps enterprises become “data-driven” using modern data stacks such as those provided by Amazon Web Services Inc., Databricks Inc., Google Cloud and Snowflake Inc. The company can support organizations at every stage of their journey, from strategy and execution to training, so they can become data-driven and AI-enabled. Indicium has successfully implemented its framework in more than 120 projects and deployed more than 200 AI and machine learning models in…
Read More
Reviving the Classics: Why Convolutional Models Still Shine in the Age of Transformers.

Reviving the Classics: Why Convolutional Models Still Shine in the Age of Transformers.

The rise of Transformer models has taken the machine learning world by storm, overshadowing many other techniques that have proven their worth over time.While transformers are powerful, it’s important to recognize that they are not the one-size-fits-all solution for every problem.Convolutional Neural Networks (CNNs), for instance, remain highly effective for those involving image and spatial data.We have a scenario where we need to classify medical X-ray images to determine whether a bone is broken or not. In such tasks, convolutional neural networks shine due to their ability to learn spatial hierarchies from images.TensorFlow Hub provides access to a wide range…
Read More
Dell AI Laptops Will Be Powered By Next-Gen Qualcomm Chips

Dell AI Laptops Will Be Powered By Next-Gen Qualcomm Chips

AI partnerships took top billing at Dell Technologies World 2024, held in Las Vegas from May 20 to May 23. Major news from the conference so far included: Five new AI-capable laptops. More integrations between NVIDIA and Dell’s AI Factory, Dell’s AI enablement program. New partnerships with Hugging Face, Meta and Microsoft. Dell reveals AI capabilities on XPS, Latitude and Inspiron Laptops New Dell PCs are getting in on the generative AI boom, with the Qualcomm Snapdragon X Series processor coming to five new models: XPS 13, coming later this year with preorders beginning May 20 in the U.S.. The…
Read More
Large language models can’t effectively recognize users’ motivation, but can support behavior change for those ready to act

Large language models can’t effectively recognize users’ motivation, but can support behavior change for those ready to act

Large language model-based chatbots have the potential to promote healthy changes in behavior. But researchers from the ACTION Lab at the University of Illinois Urbana-Champaign have found that the artificial intelligence tools don't effectively recognize certain motivational states of users and therefore don't provide them with appropriate information. Michelle Bak, a doctoral student in information sciences, and information sciences professor Jessie Chin reported their research in the Journal of the American Medical Informatics Association. Large language model-based chatbots -- also known as generative conversational agents -- have been used increasingly in healthcare for patient education, assessment and management. Bak and…
Read More
Indian Voters Are Being Bombarded With Millions of Deepfakes. Political Candidates Approve

Indian Voters Are Being Bombarded With Millions of Deepfakes. Political Candidates Approve

On a stifling April afternoon in Ajmer, in the Indian state of Rajasthan, local politician Shakti Singh Rathore sat down in front of a greenscreen to shoot a short video. He looked nervous. It was his first time being cloned.Wearing a crisp white shirt and a ceremonial saffron scarf bearing a lotus flower—the logo of the BJP, the country’s ruling party—Rathore pressed his palms together and greeted his audience in Hindi. “Namashkar,” he began. “To all my brothers—”Before he could continue, the director of the shoot walked into the frame. Divyendra Singh Jadoun, a 31-year-old with a bald head and…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.