Viral News

3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning

3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning

[Submitted on 19 Aug 2024 (v1), last revised 19 Nov 2024 (this version, v3)] View a PDF of the paper titled 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning, by Haoyu Zhao and 3 other authors View PDF HTML (experimental) Abstract:Existing approaches for human avatar generation--both NeRF-based and 3D Gaussian Splatting (3DGS) based--struggle with maintaining 3D consistency and exhibit degraded detail reconstruction, particularly when training with sparse inputs. To address this challenge, we propose CHASE, a novel framework that achieves dense-input-level performance using only sparse inputs through two key innovations: cross-pose intrinsic 3D consistency supervision and…
Read More
Song Form-aware Full-Song Text-to-Lyrics Generation with Multi-Level Granularity Syllable Count Control

Song Form-aware Full-Song Text-to-Lyrics Generation with Multi-Level Granularity Syllable Count Control

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Source link lol
Read More
Machine learned reconstruction of tsunami dynamics from sparse observations

Machine learned reconstruction of tsunami dynamics from sparse observations

arXiv:2411.12948v1 Announce Type: new Abstract: We investigate the use of the Senseiver, a transformer neural network designed for sparse sensing applications, to estimate full-field surface height measurements of tsunami waves from sparse observations. The model is trained on a large ensemble of simulated data generated via a shallow water equations solver, which we show to be a faithful reproduction for the underlying dynamics by comparison to historical events. We train the model on a dataset consisting of 8 tsunami simulations whose epicenters correspond to historical USGS earthquake records, and where the model inputs are restricted to measurements obtained at actively…
Read More
Medical Video Generation for Disease Progression Simulation

Medical Video Generation for Disease Progression Simulation

arXiv:2411.11943v1 Announce Type: new Abstract: Modeling disease progression is crucial for improving the quality and efficacy of clinical diagnosis and prognosis, but it is often hindered by a lack of longitudinal medical image monitoring for individual patients. To address this challenge, we propose the first Medical Video Generation (MVG) framework that enables controlled manipulation of disease-related image and video features, allowing precise, realistic, and personalized simulations of disease progression. Our approach begins by leveraging large language models (LLMs) to recaption prompt for disease trajectory. Next, a controllable multi-round diffusion model simulates the disease progression state for each patient, creating realistic…
Read More
WaterPark: A Robustness Assessment of Language Model Watermarking

WaterPark: A Robustness Assessment of Language Model Watermarking

arXiv:2411.13425v1 Announce Type: cross Abstract: To mitigate the misuse of large language models (LLMs), such as disinformation, automated phishing, and academic cheating, there is a pressing need for the capability of identifying LLM-generated texts. Watermarking emerges as one promising solution: it plants statistical signals into LLMs' generative processes and subsequently verifies whether LLMs produce given texts. Various watermarking methods (``watermarkers'') have been proposed; yet, due to the lack of unified evaluation platforms, many critical questions remain under-explored: i) What are the strengths/limitations of various watermarkers, especially their attack robustness? ii) How do various design choices impact their robustness? iii) How…
Read More
UCSD’s New SCIDS School: All About the Apps

UCSD’s New SCIDS School: All About the Apps

San Diego Supercomputer Center/Image courtesy SDSC) Some research universities excel in exploring intellectual possibilities and postulating abstract theories. Those terms aren’t likely to be associated with the University of California, San Diego’s new School of Computing, Information and Data Sciences (SCIDS), which is all about taking the power of data and computer science and applying it to solve problems in the real world. SCIDS was officially formed last month by the union of the San Diego Supercomputer Center (SDSC) and the Halıcıoğlu Data Science Institute (HDSI). As the fourth school on the rapidly expanding UC campus, SCIDS is generating quite…
Read More
Introducing an exclusively Databricks-hosted Assistant

Introducing an exclusively Databricks-hosted Assistant

 We’re excited to announce that the Databricks Assistant, now fully hosted and managed within Databricks, is available in public preview! This version ensures the Assistant relies exclusively on Databricks-hosted models, leveraging the same secure infrastructure that powers Databricks Model Serving. With this enhancement, all customers can enjoy the full benefits of our AI-powered assistant whilst ensuring that their data processing remains within their Databricks account.Databricks AssistantWith Databricks Assistant, we set out to build the best AI-powered productivity tool for enterprise data. The adoption since the preview has been overwhelming. Assistant is one of the fastest-growing Databricks features ever, with over…
Read More
AI-generated Image Detection: Passive or Watermark?

AI-generated Image Detection: Passive or Watermark?

arXiv:2411.13553v1 Announce Type: cross Abstract: While text-to-image models offer numerous benefits, they also pose significant societal risks. Detecting AI-generated images is crucial for mitigating these risks. Detection methods can be broadly categorized into passive and watermark-based approaches: passive detectors rely on artifacts present in AI-generated images, whereas watermark-based detectors proactively embed watermarks into such images. A key question is which type of detector performs better in terms of effectiveness, robustness, and efficiency. However, the current literature lacks a comprehensive understanding of this issue. In this work, we aim to bridge that gap by developing ImageDetectBench, the first comprehensive benchmark to…
Read More
Improving Visual Place Recognition Based Robot Navigation By Verifying Localization Estimates

Improving Visual Place Recognition Based Robot Navigation By Verifying Localization Estimates

[Submitted on 11 Jul 2024 (v1), last revised 19 Nov 2024 (this version, v2)] View a PDF of the paper titled Improving Visual Place Recognition Based Robot Navigation By Verifying Localization Estimates, by Owen Claxton and 6 other authors View PDF HTML (experimental) Abstract:Visual Place Recognition (VPR) systems often have imperfect performance, affecting the `integrity' of position estimates and subsequent robot navigation decisions. Previously, SVM classifiers have been used to monitor VPR integrity. This research introduces a novel Multi-Layer Perceptron (MLP) integrity monitor which demonstrates improved performance and generalizability, removing per-environment training and reducing manual tuning requirements. We test our…
Read More
Patience Is The Key to Large Language Model Reasoning

Patience Is The Key to Large Language Model Reasoning

arXiv:2411.13082v1 Announce Type: new Abstract: Recent advancements in the field of large language models, particularly through the Chain of Thought (CoT) approach, have demonstrated significant improvements in solving complex problems. However, existing models either tend to sacrifice detailed reasoning for brevity due to user preferences, or require extensive and expensive training data to learn complicated reasoning ability, limiting their potential in solving complex tasks. To bridge this gap, following the concept of scaling test-time, we propose a simple method by encouraging models to adopt a more patient reasoning style without the need of introducing new knowledge or skills. To employ…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.