Viral News

SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input

SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input

arXiv:2411.11934v1 Announce Type: new Abstract: Stereo video synthesis from a monocular input is a demanding task in the fields of spatial computing and virtual reality. The main challenges of this task lie on the insufficiency of high-quality paired stereo videos for training and the difficulty of maintaining the spatio-temporal consistency between frames. Existing methods primarily address these issues by directly applying novel view synthesis (NVS) techniques to video, while facing limitations such as the inability to effectively represent dynamic scenes and the requirement for large amounts of training data. In this paper, we introduce a novel self-supervised stereo video synthesis…
Read More
Neuron Patching: Semantic-based Neuron-level Language Model Repair for Code Generation

Neuron Patching: Semantic-based Neuron-level Language Model Repair for Code Generation

[Submitted on 8 Dec 2023 (v1), last revised 20 Nov 2024 (this version, v5)] View a PDF of the paper titled Neuron Patching: Semantic-based Neuron-level Language Model Repair for Code Generation, by Jian Gu and 3 other authors View PDF Abstract:Language Models (LMs) have become widely used in software engineering, especially for tasks such as code generation, where they are referred to as code LMs. These models have proven effective in generating code, making it easier for developers to automate coding activities. However, research has highlighted a significant limitation: despite their effectiveness, LMs often produce code that is incorrect, buggy,…
Read More
Tree Species Classification using Machine Learning and 3D Tomographic SAR — a case study in Northern Europe

Tree Species Classification using Machine Learning and 3D Tomographic SAR — a case study in Northern Europe

[Submitted on 19 Nov 2024] View a PDF of the paper titled Tree Species Classification using Machine Learning and 3D Tomographic SAR -- a case study in Northern Europe, by Colverd Grace and 4 other authors View PDF HTML (experimental) Abstract:Tree species classification plays an important role in nature conservation, forest inventories, forest management, and the protection of endangered species. Over the past four decades, remote sensing technologies have been extensively utilized for tree species classification, with Synthetic Aperture Radar (SAR) emerging as a key technique. In this study, we employed TomoSense, a 3D tomographic dataset, which utilizes a stack…
Read More
S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation

S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation

arXiv:2411.12547v1 Announce Type: cross Abstract: The irregular and challenging characteristics of lung adenocarcinoma nodules in computed tomography (CT) images complicate staging diagnosis, making accurate segmentation critical for clinicians to extract detailed lesion information. In this study, we propose a segmentation model, S3TU-Net, which integrates multi-dimensional spatial connectors and a superpixel-based visual transformer. S3TU-Net is built on a multi-view CNN-Transformer hybrid architecture, incorporating superpixel algorithms, structured weighting, and spatial shifting techniques to achieve superior segmentation performance. The model leverages structured convolution blocks (DWF-Conv/D2BR-Conv) to extract multi-scale local features while mitigating overfitting. To enhance multi-scale feature fusion, we introduce the S2-MLP Link,…
Read More
Playing Language Game with LLMs Leads to Jailbreaking

Playing Language Game with LLMs Leads to Jailbreaking

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Source link lol
Read More
CODES: Benchmarking Coupled ODE Surrogates

CODES: Benchmarking Coupled ODE Surrogates

[Submitted on 28 Oct 2024 (v1), last revised 20 Nov 2024 (this version, v2)] View a PDF of the paper titled CODES: Benchmarking Coupled ODE Surrogates, by Robin Janssen and 2 other authors View PDF HTML (experimental) Abstract:We introduce CODES, a benchmark for comprehensive evaluation of surrogate architectures for coupled ODE systems. Besides standard metrics like mean squared error (MSE) and inference time, CODES provides insights into surrogate behaviour across multiple dimensions like interpolation, extrapolation, sparse data, uncertainty quantification and gradient correlation. The benchmark emphasizes usability through features such as integrated parallel training, a web-based configuration generator, and pre-implemented baseline…
Read More
AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning

AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Source link lol
Read More
SEFD: Semantic-Enhanced Framework for Detecting LLM-Generated Text

SEFD: Semantic-Enhanced Framework for Detecting LLM-Generated Text

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Source link lol
Read More
mDAE : modified Denoising AutoEncoder for missing data imputation

mDAE : modified Denoising AutoEncoder for missing data imputation

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Source link lol
Read More
Automatic Classification of General Movements in Newborns

Automatic Classification of General Movements in Newborns

[Submitted on 14 Nov 2024 (v1), last revised 19 Nov 2024 (this version, v2)] View a PDF of the paper titled Automatic Classification of General Movements in Newborns, by Daphn'e Chopard and 6 other authors View PDF HTML (experimental) Abstract:General movements (GMs) are spontaneous, coordinated body movements in infants that offer valuable insights into the developing nervous system. Assessed through the Prechtl GM Assessment (GMA), GMs are reliable predictors for neurodevelopmental disorders. However, GMA requires specifically trained clinicians, who are limited in number. To scale up newborn screening, there is a need for an algorithm that can automatically classify GMs…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.