Viral News

XPrompt:Explaining Large Language Model’s Generation via Joint Prompt Attribution

XPrompt:Explaining Large Language Model’s Generation via Joint Prompt Attribution

arXiv:2405.20404v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated impressive performances in complex text generation tasks. However, the contribution of the input prompt to the generated content still remains obscure to humans, underscoring the necessity of elucidating and explaining the causality between input and output pairs. Existing works for providing prompt-specific explanation often confine model output to be classification or next-word prediction. Few initial attempts aiming to explain the entire language generation often treat input prompt texts independently, ignoring their combinatorial effects on the follow-up generation. In this study, we introduce a counterfactual explanation framework based on joint…
Read More
Medication Recommendation via Dual Molecular Modalities and Multi-Substructure Distillation

Medication Recommendation via Dual Molecular Modalities and Multi-Substructure Distillation

arXiv:2405.20358v1 Announce Type: new Abstract: Medication recommendation combines patient medical history with biomedical knowledge to assist doctors in determining medication combinations more accurately and safely. Existing approaches based on molecular knowledge overlook the atomic geometric structure of molecules, failing to capture the high-dimensional characteristics and intrinsic physical properties of medications, leading to structural confusion and the inability to extract useful substructures from individual patient visits. To address these limitations, we propose BiMoRec, which overcomes the inherent lack of molecular essential information in 2D molecular structures by incorporating 3D molecular structures and atomic properties. To retain the fast response required of…
Read More
P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation

P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation

arXiv:2405.20443v1 Announce Type: new Abstract: Diffusion models and multi-scale features are essential components in semantic segmentation tasks that deal with remote-sensing images. They contribute to improved segmentation boundaries and offer significant contextual information. U-net-like architectures are frequently employed in diffusion models for segmentation tasks. These architectural designs include dense skip connections that may pose challenges for interpreting intermediate features. Consequently, they might not efficiently convey semantic information throughout various layers of the encoder-decoder architecture. To address these challenges, we propose a new model for semantic segmentation known as the diffusion model with parallel multi-scale branches. This model consists of Parallel…
Read More
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools

Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools

arXiv:2405.20362v1 Announce Type: new Abstract: Legal practice has witnessed a sharp rise in products incorporating artificial intelligence (AI). Such tools are designed to assist with a wide range of core legal tasks, from search and summarization of caselaw to document drafting. But the large language models used in these tools are prone to "hallucinate," or make up false information, making their use risky in high-stakes domains. Recently, certain legal research providers have touted methods such as retrieval-augmented generation (RAG) as "eliminating" (Casetext, 2023) or "avoid[ing]" hallucinations (Thomson Reuters, 2023), or guaranteeing "hallucination-free" legal citations (LexisNexis, 2023). Because of the closed…
Read More
ADR-BC: Adversarial Density Weighted Regression Behavior Cloning

ADR-BC: Adversarial Density Weighted Regression Behavior Cloning

arXiv:2405.20351v1 Announce Type: new Abstract: Typically, traditional Imitation Learning (IL) methods first shape a reward or Q function and then use this shaped function within a reinforcement learning (RL) framework to optimize the empirical policy. However, if the shaped reward/Q function does not adequately represent the ground truth reward/Q function, updating the policy within a multi-step RL framework may result in cumulative bias, further impacting policy learning. Although utilizing behavior cloning (BC) to learn a policy by directly mimicking a few demonstrations in a single-step updating manner can avoid cumulative bias, BC tends to greedily imitate demonstrated actions, limiting its…
Read More
Learning 3D Robotics Perception using Inductive Priors

Learning 3D Robotics Perception using Inductive Priors

[Submitted on 30 May 2024] View a PDF of the paper titled Learning 3D Robotics Perception using Inductive Priors, by Muhammad Zubair Irshad View PDF HTML (experimental) Abstract:Recent advances in deep learning have led to a data-centric intelligence i.e. artificially intelligent models unlocking the potential to ingest a large amount of data and be really good at performing digital tasks such as text-to-image generation, machine-human conversation, and image recognition. This thesis covers the topic of learning with structured inductive bias and priors to design approaches and algorithms unlocking the potential of principle-centric intelligence. Prior knowledge (priors for short), often available…
Read More
Small Language Models for Application Interactions: A Case Study

Small Language Models for Application Interactions: A Case Study

arXiv:2405.20347v1 Announce Type: new Abstract: We study the efficacy of Small Language Models (SLMs) in facilitating application usage through natural language interactions. Our focus here is on a particular internal application used in Microsoft for cloud supply chain fulfilment. Our experiments show that small models can outperform much larger ones in terms of both accuracy and running time, even when fine-tuned on small datasets. Alongside these results, we also highlight SLM-based system design considerations. Source link lol
Read More
Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challenges

Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challenges

arXiv:2405.20350v1 Announce Type: new Abstract: Neural Network based approximations of the Value function make up the core of leading Policy Based methods such as Trust Regional Policy Optimization (TRPO) and Proximal Policy Optimization (PPO). While this adds significant value when dealing with very complex environments, we note that in sufficiently low State and action space environments, a computationally expensive Neural Network architecture offers marginal improvement over simpler Value approximation methods. We present an implementation of Natural Actor Critic algorithms with actor updates through Natural Policy Gradient methods. This paper proposes that Natural Policy Gradient (NPG) methods with Linear Function Approximation…
Read More
LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild

LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild

arXiv:2405.20363v1 Announce Type: new Abstract: Image geolocation is a critical task in various image-understanding applications. However, existing methods often fail when analyzing challenging, in-the-wild images. Inspired by the exceptional background knowledge of multimodal language models, we systematically evaluate their geolocation capabilities using a novel image dataset and a comprehensive evaluation framework. We first collect images from various countries via Google Street View. Then, we conduct training-free and training-based evaluations on closed-source and open-source multi-modal language models. we conduct both training-free and training-based evaluations on closed-source and open-source multimodal language models. Our findings indicate that closed-source models demonstrate superior geolocation abilities,…
Read More
Training FasterViT on VOC Segmentation Dataset

Training FasterViT on VOC Segmentation Dataset

In this article, we are going to train the FasterViT model on the Pascal VOC segmentation dataset. FasterViT is one of the faster alternatives to the original Vision Transformer models. Not only faster, but it is deemed more accurate as well across many image classification tasks. In some of the previous articles, we covered the image classification and binary semantic segmentation capabilities of the Faster ViT model. In this article, we will focus on the multi-label semantic segmentation capability using FasterViT. Figure 1. Example output after training the FasterViT on the VOC segmentation dataset. As the original FasterViT project does…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.