stp2y

25666 Posts
Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model

Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model

arXiv:2412.02343v1 Announce Type: new Abstract: In social media, neural network models have been applied to hate speech detection, sentiment analysis, etc., but neural network models are susceptible to adversarial attacks. For instance, in a text classification task, the attacker elaborately introduces perturbations to the original texts that hardly alter the original semantics in order to trick the model into making different predictions. By studying textual adversarial attack methods, the robustness of language models can be evaluated and then improved. Currently, most of the research in this field focuses on English, and there is also a certain amount of research on…
Read More
Artificial Intelligence in manufacturing

Artificial Intelligence in manufacturing

In recent years, artificial intelligence has transformed from an aspirational technology to a driver of manufacturing innovation and efficiency. Understanding both the current landscape and future potential of AI in manufacturing has become essential for strategic decision-making. Recent research shows the manufacturing sector generates over 1,800 petabytes of data annually—more than any other industry—creating both opportunities and challenges for AI implementation.How AI is used in manufacturing todayThe use of AI in manufacturing is accelerating rapidly, with 41 percent of industry executives planning to increase their data and AI spending by more than 25 percent in the coming year, according to…
Read More
Israel showed the ‘power’ of F-35s in destroying nearly all of Iran’s air defenses without a loss, UK admiral says

Israel showed the ‘power’ of F-35s in destroying nearly all of Iran’s air defenses without a loss, UK admiral says

The UK's top military officer confirmed Israel used F-35s during its October strikes against Iran.Adm. Tony Radakin said the operation demonstrated the "power" of fifth-generation aircraft.His remarks come after Elon Musk criticized the F-35 and its manufacturer, Lockheed Martin.Israel showed the "power" of the F-35 stealth fighter jet during its late October retaliatory strikes against Iran, Britain's top military officer said on Wednesday.Adm. Tony Radakin, the UK's chief of defense staff, disclosed that Israel used its F-35s to carry out the widespread October 26 strikes against military sites across Iran, including air-defense systems and missile-manufacturing facilities.It appeared to mark the…
Read More
Free Process Rewards without Process Labels

Free Process Rewards without Process Labels

arXiv:2412.01981v1 Announce Type: new Abstract: Different from its counterpart outcome reward models (ORMs), which evaluate the entire responses, a process reward model (PRM) scores a reasoning trajectory step by step, providing denser and more fine grained rewards. However, training a PRM requires labels annotated at every intermediate step, presenting significant challenges for both manual and automatic data collection. This paper aims to address this challenge. Both theoretically and empirically, we show that an textit{implicit PRM} can be obtained at no additional cost, by simply training an ORM on the cheaper response-level labels. The only assumption is to parameterize the outcome…
Read More
Multi-student Diffusion Distillation for Better One-step Generators

Multi-student Diffusion Distillation for Better One-step Generators

[Submitted on 30 Oct 2024 (v1), last revised 3 Dec 2024 (this version, v2)] View a PDF of the paper titled Multi-student Diffusion Distillation for Better One-step Generators, by Yanke Song and 4 other authors View PDF HTML (experimental) Abstract:Diffusion models achieve high-quality sample generation at the cost of a lengthy multistep inference procedure. To overcome this, diffusion distillation techniques produce student generators capable of matching or surpassing the teacher in a single step. However, the student model's inference speed is limited by the size of the teacher architecture, preventing real-time generation for computationally heavy applications. In this work, we…
Read More
Microsoft confirms the Windows 11 TPM security requirement isn’t going anywhere

Microsoft confirms the Windows 11 TPM security requirement isn’t going anywhere

With the end date for Windows 10 less than a year away, people still using that operating system will need to start preparing to enter the Windows 11 era. And Microsoft is placing a hardware requirement on the current OS that could pose a problem for those of us using older machines.Windows 11 will require computers to have TPM 2.0. Also known as a Trusted Platform Module, this is a dedicated chip or firmware used for device security, and the 2.0 version offers several useful features for improved cryptography and encryption. A from Microsoft outlines all of the benefits and…
Read More

Doctor Who showrunners warn AI scripts will ‘eat their own tail’

One of the masterminds behind Doctor Who has warned that the more AI content is used for creative purposes the worse its output will be because it “eats its own tail”.Ahead of the Doctor Who Christmas special, eagerly awaited by fans as a centrepiece of BBC1’s festive schedule, Steven Moffat made the comments in discussion with fellow showrunner Russell T Davies.“Human beings are amazingly cheap, we’re knocking out human beings every day. And unlike anything else in history, the more we use it, the less good it is,” Moffat told the Radio Times. “Because the more content that is out…
Read More
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

arXiv:2412.02611v1 Announce Type: cross Abstract: Recently, multimodal large language models (MLLMs), such as GPT-4o, Gemini 1.5 Pro, and Reka Core, have expanded their capabilities to include vision and audio modalities. While these models demonstrate impressive performance across a wide range of audio-visual applications, our proposed DeafTest reveals that MLLMs often struggle with simple tasks humans find trivial: 1) determining which of two sounds is louder, and 2) determining which of two sounds has a higher pitch. Motivated by these observations, we introduce AV-Odyssey Bench, a comprehensive audio-visual benchmark designed to assess whether those MLLMs can truly understand the audio-visual information.…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.