Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

arXiv:2411.14318v1 Announce Type: new Abstract: It is well-known that a diverse corpus is critical for training large language models, which are typically constructed from a mixture of various domains. In general, previous efforts resort to sampling training data from different domains with static proportions, as well as adjusting data proportions during training. However, few methods have addressed the complexities of domain-adaptive continual pre-training. To fill this gap, we propose Velocitune, a novel framework dynamically assesses learning velocity and adjusts data proportions accordingly, favoring slower-learning domains while shunning faster-learning ones, which is guided by a scaling law to indicate the desired…
Read More
SatFed: A Resource-Efficient LEO Satellite-Assisted Heterogeneous Federated Learning Framework

SatFed: A Resource-Efficient LEO Satellite-Assisted Heterogeneous Federated Learning Framework

[Submitted on 20 Sep 2024 (v1), last revised 21 Nov 2024 (this version, v3)] View a PDF of the paper titled SatFed: A Resource-Efficient LEO Satellite-Assisted Heterogeneous Federated Learning Framework, by Yuxin Zhang and 7 other authors View PDF HTML (experimental) Abstract:Traditional federated learning (FL) frameworks rely heavily on terrestrial networks, where coverage limitations and increasing bandwidth congestion significantly hinder model convergence. Fortunately, the advancement of low-Earth orbit (LEO) satellite networks offers promising new communication avenues to augment traditional terrestrial FL. Despite this potential, the limited satellite-ground communication bandwidth and the heterogeneous operating environments of ground devices-including variations in data,…
Read More
Delta-Influence: Unlearning Poisons via Influence Functions

Delta-Influence: Unlearning Poisons via Influence Functions

arXiv:2411.13731v1 Announce Type: new Abstract: Addressing data integrity challenges, such as unlearning the effects of data poisoning after model training, is necessary for the reliable deployment of machine learning models. State-of-the-art influence functions, such as EK-FAC, often fail to accurately attribute abnormal model behavior to the specific poisoned training data responsible for the data poisoning attack. In addition, traditional unlearning algorithms often struggle to effectively remove the influence of poisoned samples, particularly when only a few affected examples can be identified. To address these challenge, we introduce $Delta$-Influence, a novel approach that leverages influence functions to trace abnormal model behavior…
Read More
Schema-Driven Information Extraction from Heterogeneous Tables

Schema-Driven Information Extraction from Heterogeneous Tables

[Submitted on 23 May 2023 (v1), last revised 20 Nov 2024 (this version, v5)] View a PDF of the paper titled Schema-Driven Information Extraction from Heterogeneous Tables, by Fan Bai and 5 other authors View PDF HTML (experimental) Abstract:In this paper, we explore the question of whether large language models can support cost-efficient information extraction from tables. We introduce schema-driven information extraction, a new task that transforms tabular data into structured records following a human-authored schema. To assess various LLM's capabilities on this task, we present a benchmark comprised of tables from four diverse domains: machine learning papers, chemistry literature,…
Read More
Black Friday headphone deals include the latest Bose QuietComfort model on sale for $199

Black Friday headphone deals include the latest Bose QuietComfort model on sale for $199

The newest version of Bose’s QuietComfort headphones are on sale via Amazon for just $199. This ties a record-low price, as these headphones typically cost $350. All told, the early Black Friday sale represents a discount of 43 percent. Most colorways are included with this deal, so have at it.A version of these cans made our list of the best wireless headphones, so there’s plenty to recommend. The battery life is fantastic, lasting around 24 hours on a single charge. There’s also a quick charge feature, which can squeeze two hours of additional use with just 15 minutes at the…
Read More
Rat in the Maze Problem and Backtracking:

Rat in the Maze Problem and Backtracking:

A Comprehensive OverviewThe Rat in the Maze Problem is a classic combinatorial challenge that demonstrates the elegance and efficiency of backtracking algorithms. It involves finding a path for a rat from the start position to the destination in a maze, represented as a grid. The rat can move in specific directions (e.g., up, down, left, or right), and certain cells may be blocked. The problem emphasizes systematically exploring paths while backtracking when a dead end is encountered. This foundational problem is widely studied in computer science for its simplicity and real-world relevance. Understanding the Algorithm:The Rat in the Maze Problem…
Read More
Predictive Maintenance Study for High-Pressure Industrial Compressors: Hybrid Clustering Models

Predictive Maintenance Study for High-Pressure Industrial Compressors: Hybrid Clustering Models

[Submitted on 21 Nov 2024] View a PDF of the paper titled Predictive Maintenance Study for High-Pressure Industrial Compressors: Hybrid Clustering Models, by Alessandro Costa and 4 other authors View PDF HTML (experimental) Abstract:This study introduces a predictive maintenance strategy for high pressure industrial compressors using sensor data and features derived from unsupervised clustering integrated into classification models. The goal is to enhance model accuracy and efficiency in detecting compressor failures. After data pre processing, sensitive clustering parameters were tuned to identify algorithms that best capture the dataset's temporal and operational characteristics. Clustering algorithms were evaluated using quality metrics like…
Read More
Exosense: A Vision-Based Scene Understanding System For Exoskeletons

Exosense: A Vision-Based Scene Understanding System For Exoskeletons

[Submitted on 21 Mar 2024 (v1), last revised 21 Nov 2024 (this version, v2)] View a PDF of the paper titled Exosense: A Vision-Based Scene Understanding System For Exoskeletons, by Jianeng Wang and 7 other authors View PDF HTML (experimental) Abstract:Self-balancing exoskeletons are a key enabling technology for individuals with mobility impairments. While the current challenges focus on human-compliant hardware and control, unlocking their use for daily activities requires a scene perception system. In this work, we present Exosense, a vision-centric scene understanding system for self-balancing exoskeletons. We introduce a multi-sensor visual-inertial mapping device as well as a navigation stack…
Read More
Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models

Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Source link lol
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.