stp2y

32846 Posts
Characterizing Prompt Compression Methods for Long Context Inference

Characterizing Prompt Compression Methods for Long Context Inference

arXiv:2407.08892v1 Announce Type: new Abstract: Long context inference presents challenges at the system level with increased compute and memory requirements, as well as from an accuracy perspective in being able to reason over long contexts. Recently, several methods have been proposed to compress the prompt to reduce the context length. However, there has been little work on comparing the different proposed methods across different tasks through a standardized analysis. This has led to conflicting results. To address this, here we perform a comprehensive characterization and evaluation of different prompt compression methods. In particular, we analyze extractive compression, summarization-based abstractive compression,…
Read More
Generalizable Physics-informed Learning for Stochastic Safety-critical Systems

Generalizable Physics-informed Learning for Stochastic Safety-critical Systems

arXiv:2407.08868v1 Announce Type: new Abstract: Accurate estimate of long-term risk is critical for safe decision-making, but sampling from rare risk events and long-term trajectories can be prohibitively costly. Risk gradient can be used in many first-order techniques for learning and control methods, but gradient estimate is difficult to obtain using Monte Carlo (MC) methods because the infinitesimal devisor may significantly amplify sampling noise. Motivated by this gap, we propose an efficient method to evaluate long-term risk probabilities and their gradients using short-term samples without sufficient risk events. We first derive that four types of long-term risk probability are solutions of…
Read More
X is in hot water in the EU over blue checkmarks and ads

X is in hot water in the EU over blue checkmarks and ads

is the latest notable tech company to land in trouble with the . The , the bloc’s executive arm, has of an investigation. It claims that X has violated the in a number of ways.The platform’s approach to paid verification has come into the EU’s crosshairs. Officials say that the practice “does not correspond to industry practice and deceives users.” It added that, since anyone can pay to get a blue checkmark, it’s difficult for folks to determine the authenticity of accounts (a can tell you which accounts are verified because of notability and which paid for a checkmark). The…
Read More
Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets

Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Source link lol
Read More
Use Guardrails to prevent hallucinations in generative AI applications

Use Guardrails to prevent hallucinations in generative AI applications

With Contextual grounding check, you can prevent hallucinations by detecting irrelevant and ungrounded LLM responses. Guardrails for Amazon Bedrock enables you to implement safeguards for your generative AI applications based on your use cases and responsible AI policies. You can create multiple guardrails tailored to different use cases and apply them across multiple foundation models (FM), providing a consistent user experience and standardizing safety and privacy controls across generative AI applications. Until now, Guardrails supported four policies - denied topics, content filters, sensitive information filters, and word filters. The Contextual grounding check policy (the latest one added at the time…
Read More
Disney has a big problem. It’s running out of kids.

Disney has a big problem. It’s running out of kids.

When Bob Iger returned to Disney in late 2022 for his second tour as CEO, the company was in dire straits. It had just reported poor earnings, was scrambling from unpopular business moves, and was left reeling from previous CEO Bob Chapek's bad press.Just a year later, Iger began to put the Mouse House back in order: He delivered a strong earnings report in February, announced partnerships with Epic Games and Taylor Swift, and trumpeted a sports streaming platform. "We have entered a new era," Iger effused to investors during his February earnings call, a nod to the news that…
Read More
Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Models

Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Models

[Submitted on 11 Jul 2024] View a PDF of the paper titled Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Models, by Mohammadreza Tayaranian and 4 other authors View PDF Abstract:Transformer-based language models have shown state-of-the-art performance on a variety of natural language understanding tasks. To achieve this performance, these models are first pre-trained on general corpus and then fine-tuned on downstream tasks. Previous work studied the effect of pruning the training set of the downstream tasks on the performance of the model on its evaluation set. In this work, we propose an automatic dataset pruning method for the training…
Read More
Inflationary Flows: Calibrated Bayesian Inference with Diffusion-Based Models

Inflationary Flows: Calibrated Bayesian Inference with Diffusion-Based Models

[Submitted on 11 Jul 2024] View a PDF of the paper titled Inflationary Flows: Calibrated Bayesian Inference with Diffusion-Based Models, by Daniela de Albuquerque and John Pearson View PDF HTML (experimental) Abstract:Beyond estimating parameters of interest from data, one of the key goals of statistical inference is to properly quantify uncertainty in these estimates. In Bayesian inference, this uncertainty is provided by the posterior distribution, the computation of which typically involves an intractable high-dimensional integral. Among available approximation methods, sampling-based approaches come with strong theoretical guarantees but scale poorly to large problems, while variational approaches scale well but offer few…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.