Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement

Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement

arXiv:2408.03092v1 Announce Type: new Abstract: Merging Large Language Models (LLMs) aims to amalgamate multiple homologous LLMs into one with all the capabilities. Ideally, any LLMs sharing the same backbone should be mergeable, irrespective of whether they are Fine-Tuned (FT) with minor parameter changes or Pre-Trained (PT) with substantial parameter shifts. However, existing methods often manually assign the model importance, rendering them feasible only for LLMs with similar parameter alterations, such as multiple FT LLMs. The diverse parameter changed ranges between FT and PT LLMs pose challenges for current solutions in empirically determining the optimal combination. In this paper, we make…
Read More
Warner Bros. Takes $9.1 Billion Writedown on TV Networks

Warner Bros. Takes $9.1 Billion Writedown on TV Networks

Warner Bros. Discovery Inc., the parent of CNN and TNT, posted a second-quarter charge of $9.1 billion after writing down the value of its traditional TV networks.The company, created in 2022 when Discovery Inc. acquired WarnerMedia, has concluded that cable channels like CNN and TNT are no longer worth what they were when the $42 billion merger was completed. Source link lol
Read More
OpenAI quietly releases GPT-4o update amid leadership turmoil

OpenAI quietly releases GPT-4o update amid leadership turmoil

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has quietly rolled out an improved version of its GPT-4o language model, slashing costs by half while boosting performance. This stealth update comes as the AI powerhouse faces a exodus of top talent and fierce competition in the rapidly evolving artificial intelligence landscape. The update arrives just days after OpenAI co-founder John Schulman announced his departure to join rival Anthropic, while company president Greg Brockman began an extended sabbatical. OAI quietly released a new model today!!The new GPT-4o is slightly…
Read More
How to eat like an ancient Olympic athlete: figs, feta, and liver

How to eat like an ancient Olympic athlete: figs, feta, and liver

Some athletes competing at the 2024 Paris Olympics have called the food situation "a disaster." The UK team even flew in its own chef to try and improve the protein offerings.Athletes' cuisine has long been a concern at the Olympic Games. After a trip to Greece last year, YouTube star Max Miller wanted to recreate a meal fit for an ancient Olympian for his YouTube series "Tasting History," where he cooks historical recipes.Sometimes the diets of ancient Olympians were vegetarian, but Miller's recreation added a little variety with a recipe for calf liver from around the ancient Olympic era. He…
Read More
Nextdoor Projects Improved Revenue Growth Amid Product Overhaul

Nextdoor Projects Improved Revenue Growth Amid Product Overhaul

Nextdoor Holdings Inc. reported second-quarter revenue that beat analysts’ estimates and projected stronger-than-expected sales growth, pointing to improvements in the company’s advertising technology as a key driver of sales. Chief Executive Officer Nirav Tolia also announced plans for a “complete transformation” of Nextdoor’s core social network to increase usage and turn the neighborhood-focused platform into a service that people are drawn toward daily, not just for key life moments. Source link lol
Read More
Spatial-temporal Graph Convolutional Networks with Diversified Transformation for Dynamic Graph Representation Learning

Spatial-temporal Graph Convolutional Networks with Diversified Transformation for Dynamic Graph Representation Learning

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Source link lol
Read More
AI Is Now A Core Capability Of BI Platforms

AI Is Now A Core Capability Of BI Platforms

Scientists estimate that insects account for 80% of all animal life on Earth. It’s no wonder you can’t walk even one foot into a forest without stepping on something crawling or being overwhelmed with buzzing and biting insects. While I am not ready to say that AI based functionality accounts for 80% of a typical enterprise BI platform, it’s increasingly hard to find a platform that doesn’t have AI capabilities. Specifically, in business intelligence (BI) platforms AI based functionality is now responsible for: One-click advanced analytics (predictions, anomaly detection, top influencers, etc.) Conversational interaction with data – natural language to…
Read More
Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection

Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Source link lol
Read More
Enhancing Complex Causality Extraction via Improved Subtask Interaction and Knowledge Fusion

Enhancing Complex Causality Extraction via Improved Subtask Interaction and Knowledge Fusion

arXiv:2408.03079v1 Announce Type: new Abstract: Event Causality Extraction (ECE) aims at extracting causal event pairs from texts. Despite ChatGPT's recent success, fine-tuning small models remains the best approach for the ECE task. However, existing fine-tuning based ECE methods cannot address all three key challenges in ECE simultaneously: 1) Complex Causality Extraction, where multiple causal-effect pairs occur within a single sentence; 2) Subtask~ Interaction, which involves modeling the mutual dependence between the two subtasks of ECE, i.e., extracting events and identifying the causal relationship between extracted events; and 3) Knowledge Fusion, which requires effectively fusing the knowledge in two modalities, i.e.,…
Read More
US forces on Guam are facing a Chinese missile threat unlike anything else and need more air defenses with deeper magazines, Army officials say

US forces on Guam are facing a Chinese missile threat unlike anything else and need more air defenses with deeper magazines, Army officials say

Guam faces a substantial threat from China's massive missile arsenal, and US Army officials say more air defense capabilities are desperately needed.Efforts to defend this strategic US territory in the Pacific from a barrage in the event of war are underway, but Army leaders say one of the biggest challenges is fielding integrated systems with deeper magazines to stop air and missile attacks.At a Center for Strategic and International Studies panel on the defense of Guam late last month, Army Brig. Gen. Frank Lozano, the program executive officer for the US Army's Missiles and Space program, said that he'd called…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.