Skip to content

DWCTOD/cv-arxiv-daily

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Updated on 2024.11.12

Video_Classification

Publish Date Title Authors PDF Code
2024-11-08 Gender Inequalities in Content Collaborations: Asymmetric Creator Synergy and Symmetric Audience Biases Mingyue Zha et.al. 2411.05782v1 null
2024-11-08 Sketched Equivariant Imaging Regularization and Deep Internal Learning for Inverse Problems Guixian Xu et.al. 2411.05771v1 null
2024-11-08 FisherMask: Enhancing Neural Network Labeling Efficiency in Image Classification Using Fisher Information Shreen Gul et.al. 2411.05752v1 link
2024-11-08 Accurate Unsupervised Photon Counting from Transition Edge Sensor Signals Nicolas Dalbec-Constant et.al. 2411.05737v1 null
2024-11-08 Poze: Sports Technique Feedback under Data Constraints Agamdeep Singh et.al. 2411.05734v1 null
2024-11-08 Differential Privacy Under Class Imbalance: Methods and Empirical Insights Lucas Rosenblatt et.al. 2411.05733v1 null
2024-11-08 On-chip rewritable phase-change metasurface for programmable diffractive deep neural networks Sanaz Zarei et.al. 2411.05723v1 null
2024-11-08 Classification of ($ρ,τ,σ$)-derivations of two-dimensional left-symmetric dialgebras Basdouri Imed et.al. 2411.05716v1 null
2024-11-08 STARS: Sensor-agnostic Transformer Architecture for Remote Sensing Ethan King et.al. 2411.05714v1 null
2024-11-08 Scaling Laws for Task-Optimized Models of the Primate Visual Ventral Stream Abdulkadir Gokce et.al. 2411.05712v1 link
2024-11-07 ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning David Junhao Zhang et.al. 2411.05003v1 null
2024-11-07 DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation Peiqi Liu et.al. 2411.04999v1 null
2024-11-07 HourVideo: 1-Hour Video-Language Understanding Keshigeyan Chandrasegaran et.al. 2411.04998v1 null
2024-11-07 SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation Koichi Namekata et.al. 2411.04989v1 null
2024-11-07 Efficient Preparation of Solvable Anyons with Adaptive Quantum Circuits Yuanjie Ren et.al. 2411.04985v1 null
2024-11-07 Enhancing Reverse Engineering: Investigating and Benchmarking Large Language Models for Vulnerability Analysis in Decompiled Binaries Dylan Manuel et.al. 2411.04981v1 null
2024-11-07 Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification Mischa Dombrowski et.al. 2411.04956v1 null
2024-11-07 Estimating the Influence of Sequentially Correlated Literary Properties in Textual Classification: A Data-Centric Hypothesis-Testing Approach Gideon Yoffe et.al. 2411.04950v1 null
2024-11-07 Proof of the absence of local conserved quantities in the spin-1 bilinear-biquadratic chain and its anisotropic extensions Akihiro Hokkyo et.al. 2411.04945v1 null
2024-11-07 A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model Panwen Hu et.al. 2411.04942v1 null
2024-11-06 RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models Maya Varma et.al. 2411.04097v1 link
2024-11-06 Local unitary equivalence of absolutely maximally entangled states constructed from orthogonal arrays N Ramadas et.al. 2411.04096v1 null
2024-11-06 A Collaborative Content Moderation Framework for Toxicity Detection based on Conformalized Estimates of Annotation Disagreement Guillermo Villate-Castillo et.al. 2411.04090v1 link
2024-11-06 Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning Ping Li et.al. 2411.04059v1 link
2024-11-06 Distinguishing Coupled Dark Energy Models with Neural Networks L. W. K. Goh et.al. 2411.04058v1 link
2024-11-06 Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging Yuan Bi et.al. 2411.04004v1 null
2024-11-06 Learning Aggregate Queries Defined by First-Order Logic with Counting Steffen van Bergerem et.al. 2411.04003v1 null
2024-11-06 ParaGAN: A Scalable Distributed Training Framework for Generative Adversarial Networks Ziji Shi et.al. 2411.03999v1 null
2024-11-06 Fine-tuning -- a Transfer Learning approach Joseph Arul Raj et.al. 2411.03941v1 null
2024-11-06 Inter-Frame Coding for Dynamic Meshes via Coarse-to-Fine Anchor Mesh Generation He Huang et.al. 2411.03921v1 null
2024-11-05 Classification Done Right for Vision-Language Pre-Training Huang Zilong et.al. 2411.03313v1 link
2024-11-05 Automatic solid form classification in pharmaceutical drug development Julius Lange et.al. 2411.03308v1 null
2024-11-05 Data-Driven Sampling Based Stochastic MPC for Skid-Steer Mobile Robot Navigation Ananya Trivedi et.al. 2411.03289v1 link
2024-11-05 Graph-Based Semi-Supervised Segregated Lipschitz Learning Farid Bozorgnia et.al. 2411.03273v1 null
2024-11-05 Tuning into spatial frequency space: Satellite and space debris detection in the ZTF alert stream J. P. Carvajal et.al. 2411.03258v1 null
2024-11-05 Kernel Orthogonality does not necessarily imply a Decrease in Feature Map Redundancy in CNNs: Convolutional Similarity Minimization Zakariae Belmekki et.al. 2411.03226v1 null
2024-11-05 Beyond Grid Data: Exploring Graph Neural Networks for Earth Observation Shan Zhao et.al. 2411.03223v1 null
2024-11-05 Statistical Analysis to Support CSI-Based Sensing Methods Elena Tonini et.al. 2411.03203v1 null
2024-11-05 Navigating Extremes: Dynamic Sparsity in Large Output Space Nasib Ullah et.al. 2411.03171v1 null
2024-11-05 Pre-trained Visual Dynamics Representations for Efficient Policy Learning Hao Luo et.al. 2411.03169v1 null
2024-11-04 Adaptive Caching for Faster Video Generation with Diffusion Transformers Kumara Kahatapitiya et.al. 2411.02397v1 null
2024-11-04 AutoVFX: Physically Realistic Video Editing from Natural Language Instructions Hao-Yu Hsu et.al. 2411.02394v1 null
2024-11-04 How Far is Video Generation from World Model: A Physical Law Perspective Bingyi Kang et.al. 2411.02385v1 null
2024-11-04 Drone Data Analytics for Measuring Traffic Metrics at Intersections in High-Density Areas Qingwen Pu et.al. 2411.02349v1 null
2024-11-04 SplatOverflow: Asynchronous Hardware Troubleshooting Amritansh Kwatra et.al. 2411.02332v1 null
2024-11-04 PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance Ruyang Liu et.al. 2411.02327v1 link
2024-11-04 GenXD: Generating Any 3D and 4D Scenes Yuyang Zhao et.al. 2411.02319v1 null
2024-11-04 Information plane and compression-gnostic feedback in quantum machine learning Nathan Haboury et.al. 2411.02313v1 null
2024-11-04 Grouped Discrete Representation for Object-Centric Learning Rongzhen Zhao et.al. 2411.02299v1 null
2024-11-04 Conformal-in-the-Loop for Learning with Imbalanced Noisy Data John Brandon Graham-Knight et.al. 2411.02281v1 null
2024-10-31 EgoMimic: Scaling Imitation Learning via Egocentric Video Simar Kareer et.al. 2410.24221v1 link
2024-10-31 Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning Penghui Ruan et.al. 2410.24219v1 link
2024-10-31 Learning Video Representations without Natural Videos Xueyang Yu et.al. 2410.24213v1 null
2024-11-01 DELTA: Dense Efficient Long-range 3D Tracking for any video Tuan Duc Ngo et.al. 2410.24211v2 null
2024-10-31 DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion Weicai Ye et.al. 2410.24203v1 link
2024-10-31 DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning Zhenyu Jiang et.al. 2410.24185v1 null
2024-10-31 Extended Object Tracking and Classification based on Linear Splines Matteo Tesori et.al. 2410.24183v1 null
2024-10-31 $π_0$: A Vision-Language-Action Flow Model for General Robot Control Kevin Black et.al. 2410.24164v1 null
2024-10-31 Exploring Vision Language Models for Facial Attribute Recognition: Emotion, Race, Gender, and Age Nouar AlDahoul et.al. 2410.24148v1 null
2024-10-31 HoloChrome: Polychromatic Illumination for Speckle Reduction in Holographic Near-Eye Displays Florian Schiffers et.al. 2410.24144v1 null
2024-10-30 Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards Irmak Guzey et.al. 2410.23289v1 null
2024-10-30 Computing the bridge length: the key ingredient in a continuous isometry classification of periodic point sets Jonathan McManus et.al. 2410.23288v1 null
2024-10-30 ReferEverything: Towards Segmenting Everything We Can Speak of in Videos Anurag Bagchi et.al. 2410.23287v1 null
2024-10-30 DisCo: Distributed Contact-Rich Trajectory Optimization for Forceful Multi-Robot Collaboration Ola Shorinwa et.al. 2410.23283v1 null
2024-10-30 A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization Bin Wu et.al. 2410.23279v1 null
2024-10-30 SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation Yining Hong et.al. 2410.23277v1 null
2024-10-30 TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models Ziyao Shangguan et.al. 2410.23266v1 link
2024-10-30 bit2bit: 1-bit quanta video reconstruction via self-supervised photon prediction Yehe Liu et.al. 2410.23247v1 null
2024-10-30 PointRecon: Online Point-based 3D Reconstruction via Ray-based 2D-3D Matching Chen Ziwen et.al. 2410.23245v1 null
2024-10-31 Aligning Audio-Visual Joint Representations with an Agentic Workflow Shentong Mo et.al. 2410.23230v2 null
2024-10-29 Local Policies Enable Zero-shot Long-horizon Manipulation Murtaza Dalal et.al. 2410.22332v1 null
2024-10-30 Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets Guangqi Jiang et.al. 2410.22325v2 null
2024-10-29 Enhancing Code Annotation Reliability: Generative AI's Role in Comment Quality Assessment Models Seetharam Killivalavan et.al. 2410.22323v1 null
2024-10-29 Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier Kai Wang et.al. 2410.22317v1 link
2024-10-29 Convex Formulations for Training Two-Layer ReLU Neural Networks Karthik Prakhya et.al. 2410.22311v1 link
2024-10-29 Emotion-Guided Image to Music Generation Souraja Kundu et.al. 2410.22299v1 null
2024-10-29 Motion Graph Unleashed: A Novel Approach to Video Prediction Yiqi Zhong et.al. 2410.22288v1 link
2024-10-29 Non-LTE Synthetic Observables of a Multidimensional Model of Type Ia Supernovae Samuel J. Boos et.al. 2410.22276v1 null
2024-10-29 Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation Davide Berghi et.al. 2410.22271v1 null
2024-10-29 LipKernel: Lipschitz-Bounded Convolutional Neural Networks via Dissipative Layers Patricia Pauli et.al. 2410.22258v1 link
2024-10-28 LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior Hanyu Wang et.al. 2410.21264v1 null
2024-10-28 Multi-modal AI for comprehensive breast cancer prognostication Jan Witowski et.al. 2410.21256v1 null
2024-10-28 Joint Audio-Visual Idling Vehicle Detection with Streamlined Input Dependencies Xiwen Li et.al. 2410.21170v1 null
2024-10-28 KaLDeX: Kalman Filter based Linear Deformable Cross Attention for Retina Vessel Segmentation Zhihao Zhao et.al. 2410.21160v1 null
2024-10-28 Synthetica: Large Scale Synthetic Data for Robot Perception Ritvik Singh et.al. 2410.21153v1 null
2024-10-28 The tau function for ABS equations James Atkinson et.al. 2410.21148v1 null
2024-10-28 Enhancing Learned Image Compression via Cross Window-based Attention Priyanka Mudgal et.al. 2410.21144v1 null
2024-10-28 uOttawa at LegalLens-2024: Transformer-based Classification Experiments Nima Meghdadi et.al. 2410.21139v1 link
2024-10-28 Do LLMs generate test oracles that capture the actual or the expected program behaviour? Michael Konstantinou et.al. 2410.21136v1 null
2024-10-28 Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences Zhihao Zhao et.al. 2410.21130v1 null
2024-10-25 Sparse Decomposition of Graph Neural Networks Yaochen Hu et.al. 2410.19723v1 null
2024-10-25 Arabic Music Classification and Generation using Deep Learning Mohamed Elshaarawy et.al. 2410.19719v1 null
2024-10-25 Enhanced Anomaly Detection in Industrial Control Systems aided by Machine Learning Vegard Berge et.al. 2410.19717v1 null
2024-10-25 TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning Xiangyu Zeng et.al. 2410.19702v1 null
2024-10-25 MILES: Making Imitation Learning Easy with Self-Supervision Georgios Papagiannis et.al. 2410.19693v1 null
2024-10-25 Deep Learning for Classification of Inflammatory Bowel Disease Activity in Whole Slide Images of Colonic Histopathology Amit Das et.al. 2410.19690v1 null
2024-10-25 Optimizing Hearthstone Agents using an Evolutionary Algorithm Pablo García-Sánchez et.al. 2410.19681v1 null
2024-10-25 Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective Ethan Harvey et.al. 2410.19675v1 null
2024-10-25 MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services Hongjia Wu et.al. 2410.19665v1 null
2024-10-25 VARS: Vision-based Assessment of Risk in Security Systems Pranav Gupta et.al. 2410.19642v1 null
2024-10-24 Framer: Interactive Frame Interpolation Wen Wang et.al. 2410.18978v1 null
2024-10-24 CAMEL-Bench: A Comprehensive Arabic LMM Benchmark Sara Ghaboura et.al. 2410.18976v1 link
2024-10-24 Unbounded: A Generative Infinite Game of Character Life Simulation Jialu Li et.al. 2410.18975v1 null
2024-10-24 Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling Mingtong Zhang et.al. 2410.18912v1 null
2024-10-24 SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment Caelan Garrett et.al. 2410.18907v1 null
2024-10-24 A Survey of Multimodal Sarcasm Detection Shafkat Farabi et.al. 2410.18882v1 null
2024-10-24 Multi-Class Abnormality Classification in Video Capsule Endoscopy Using Deep Learning Arnav Samal et.al. 2410.18879v1 link
2024-10-24 Exploring the Universe with SNAD: Anomaly Detection in Astronomy Alina A. Volnova et.al. 2410.18875v1 null
2024-10-24 Exploring a Geometric Conjecture, Some Properties of Blaschke Products, and the Geometry of Curves Formed by Them Mehmet Celik et.al. 2410.18863v1 null
2024-10-24 Highly efficient non-rigid registration in k-space with application to cardiac Magnetic Resonance Imaging Aya Ghoul et.al. 2410.18834v1 link
2024-10-23 FIPER: Generalizable Factorized Fields for Joint Image Compression and Super-Resolution Yang-Che Sun et.al. 2410.18083v1 null
2024-10-23 WorldSimBench: Towards Video Generation Models as World Simulators Yiran Qin et.al. 2410.18072v1 null
2024-10-23 Eigenvalue crossings in equivariant families of matrices Jonathan Rawlinson et.al. 2410.18068v1 null
2024-10-23 The Double-Edged Sword of Behavioral Responses in Strategic Classification: Theory and User Studies Raman Ebrahimi et.al. 2410.18066v1 null
2024-10-23 Real time anomalies detection on video Fabien Poirier et.al. 2410.18051v1 null
2024-10-23 Boundary topological insulators and superconductors of Altland-Zirnbauer tenfold classes Xun-Jiang Luo et.al. 2410.18015v1 null
2024-10-24 Effective Finite Time Stability Control for Human-Machine Shared Vehicle Following System Zihan Wang et.al. 2410.18007v2 null
2024-10-23 Benchmarking Foundation Models on Exceptional Cases: Dataset Creation and Validation Suho Kang et.al. 2410.18001v1 link
2024-10-23 Optical Generative Models Shiqi Chen et.al. 2410.17970v1 null
2024-10-23 A Wavelet Diffusion GAN for Image Super-Resolution Lorenzo Aloisi et.al. 2410.17966v1 null
2024-10-22 Altogether: Image Captioning via Re-aligning Alt-text Hu Xu et.al. 2410.17251v1 null
2024-10-22 Classifying rational polygons with small denominator and few interior lattice points Martin Bohnert et.al. 2410.17244v1 null
2024-10-22 Frontiers in Intelligent Colonoscopy Ge-Peng Ji et.al. 2410.17241v1 link
2024-10-22 Automated Spinal MRI Labelling from Reports Using a Large Language Model Robin Y. Park et.al. 2410.17235v1 link
2024-10-22 Few-shot In-Context Preference Learning Using Large Language Models Chao Yu et.al. 2410.17233v1 null
2024-10-22 Context-aware Prompt Tuning: Advancing In-Context Learning with Adversarial Methods Tsachi Blau et.al. 2410.17222v1 null
2024-10-22 The Decision Problem for Regular First-Order Theories Umang Mathur et.al. 2410.17185v1 null
2024-10-22 Technical Report: Toward Applying Quantum Computing to Network Verification Kahlil Dozier et.al. 2410.17184v1 null
2024-10-22 KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional Elements Md Meftahul Ferdaus et.al. 2410.17172v1 link
2024-10-22 Are Visual-Language Models Effective in Action Recognition? A Comparative Study Mahmoud Ali et.al. 2410.17149v1 null
2024-10-21 SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Shuangrui Ding et.al. 2410.16268v1 link
2024-10-21 xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs Michael S. Ryoo et.al. 2410.16267v1 null
2024-10-21 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors Xi Liu et.al. 2410.16266v1 null
2024-10-21 Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos Gengshan Yang et.al. 2410.16259v1 null
2024-10-21 Serendipitous detection of an intense X-ray flare in the weak-line T Tauri star KM Ori with SRG/eROSITA Savithri H. Ezhikode et.al. 2410.16241v1 null
2024-10-21 MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report Samrajya Thapa et.al. 2410.16239v1 link
2024-10-21 Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment G. A. Nketiah et.al. 2410.16238v1 null
2024-10-22 Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models Giannis Daras et.al. 2410.16152v2 null
2024-10-21 An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection Chandravardhan Singh Raghaw et.al. 2410.16143v1 null
2024-10-21 Modeling dynamic neural activity by combining naturalistic video stimuli and stimulus-independent latent factors Finn Schmidt et.al. 2410.16136v1 null
2024-10-18 Real-time Fake News from Adversarial Feedback Sanxing Chen et.al. 2410.14651v1 null
2024-10-18 GenEOL: Harnessing the Generative Power of LLMs for Training-Free Sentence Embeddings Raghuveer Thirukovalluru et.al. 2410.14635v1 null
2024-10-18 You Shall Know a Tool by the Traces it Leaves: The Predictability of Sentiment Analysis Tools Daniel Baumartz et.al. 2410.14626v1 null
2024-10-18 Learning to Control the Smoothness of Graph Convolutional Network Features Shih-Hsin Wang et.al. 2410.14604v1 null
2024-10-18 Optimizing Attention with Mirror Descent: Generalized Max-Margin Token Selection Aaron Alvarado Kristanto Julistiono et.al. 2410.14581v1 null
2024-10-18 A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification Maksuda Akter et.al. 2410.14536v1 null
2024-10-18 Less is More: Selective Reduction of CT Data for Self-Supervised Pre-Training of Deep Learning Models with Contrastive Learning Improves Downstream Classification Performance Daniel Wolf et.al. 2410.14524v1 link
2024-10-18 Influence of anisotropy on the study of critical behavior of spin models by machine learning methods Diana Sukhoverkhova et.al. 2410.14523v1 null
2024-10-18 A character approach to the ISR property Artem Dudko et.al. 2410.14517v1 null
2024-10-18 Efficient Annotator Reliability Assessment and Sample Weighting for Knowledge-Based Misinformation Detection on Social Media Owen Cook et.al. 2410.14515v1 link
2024-10-17 DepthSplat: Connecting Gaussian Splatting and Depth Haofei Xu et.al. 2410.13862v1 link
2024-10-17 Adaptive Subsampling and Learned Model Improve Spatiotemporal Resolution of Tactile Skin Ariel Slepyan et.al. 2410.13847v1 null
2024-10-17 VidPanos: Generative Panoramic Videos from Casual Panning Videos Jingwei Ma et.al. 2410.13832v1 null
2024-10-17 DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control Yujie Wei et.al. 2410.13830v1 null
2024-10-17 Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks Clément Playout et.al. 2410.13822v1 link
2024-10-17 Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance Mitsuhiko Nakamoto et.al. 2410.13816v1 null
2024-10-17 A Pattern to Align Them All: Integrating Different Modalities to Define Multi-Modal Entities Gianluca Apriceno et.al. 2410.13803v1 link
2024-10-17 MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations Liang Xu et.al. 2410.13790v1 link
2024-10-17 Strong-to-weak spontaneous symmetry breaking meets average symmetry-protected topological order Yuchen Guo et.al. 2410.13734v1 null
2024-10-17 Representing Model Weights with Language using Tree Experts Eliahu Horwitz et.al. 2410.13569v1 null
2024-10-16 Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception Jihao Zhao et.al. 2410.12788v1 null
2024-10-16 The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio Sicong Leng et.al. 2410.12787v1 null
2024-10-16 Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions Zhenyu Jiang et.al. 2410.12773v1 null
2024-10-16 Vaccinating Federated Learning for Robust Modulation Classification in Distributed Wireless Networks Hunmin Lee et.al. 2410.12772v1 null
2024-10-16 Phase retrieval via media diversity Yan Cheng et.al. 2410.12767v1 null
2024-10-16 SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation Jaehong Yoon et.al. 2410.12761v1 null
2024-10-16 Unitary Multi-Margin BERT for Robust Natural Language Processing Hao-Yuan Chang et.al. 2410.12759v1 null
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742v1 null
2024-10-16 How much time do we have before catastrophic disclosure occurs? Matthew Szydagis et.al. 2410.12738v1 null
2024-10-16 Machine Learning-Augmented Ontology-Based Data Access for Renewable Energy Data Marco Calautti et.al. 2410.12734v1 null
2024-10-15 High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion Junhwa Hur et.al. 2410.11838v1 null
2024-10-15 Contrastive Touch-to-Touch Pretraining Samanta Rodriguez et.al. 2410.11834v1 null
2024-10-15 CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos Nikita Karaev et.al. 2410.11831v1 null
2024-10-15 Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos Zhouxia Wang et.al. 2410.11828v1 null
2024-10-15 On representations of Arthur type and unitary dual for classical groups Alexander Hazeltine et.al. 2410.11806v1 null
2024-10-16 Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices Zhiyuan Ma et.al. 2410.11795v2 null
2024-10-15 OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation Jinhan Li et.al. 2410.11792v1 null
2024-10-15 Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability Tsz Ting Chung et.al. 2410.11786v1 null
2024-10-15 On the Training Convergence of Transformers for In-Context Classification Wei Shen et.al. 2410.11778v1 null
2024-10-15 Temporal resolution enhancement in Structured Illumination Microscopy using cascaded reconstruction Doron Shterman et.al. 2410.11770v1 null
2024-10-14 Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models Jingzhi Bao et.al. 2410.10821v1 null
2024-10-14 TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Mu Cai et.al. 2410.10818v1 null
2024-10-14 LVD-2M: A Long-take Video Dataset with Temporally Dense Captions Tianwei Xiong et.al. 2410.10816v1 link
2024-10-14 Depth Any Video with Scalable Synthetic Data Honghui Yang et.al. 2410.10815v1 null
2024-10-14 Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies Yanjie Ze et.al. 2410.10803v1 link
2024-10-14 Boosting Camera Motion Control for Video Diffusion Transformers Soon Yau Cheong et.al. 2410.10802v1 null
2024-10-14 Probabilistic Degeneracy Detection for Point-to-Plane Error Minimization Johan Hatleskog et.al. 2410.10784v1 null
2024-10-14 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications Eduardo R. Corral-Soto et.al. 2410.10782v1 null
2024-10-14 ControlMM: Controllable Masked Motion Generation Ekkasit Pinyoanuntapong et.al. 2410.10780v1 null
2024-10-14 Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention Dejia Xu et.al. 2410.10774v1 null
2024-10-11 Optimal Downsampling for Imbalanced Classification with Generalized Linear Models Yan Chen et.al. 2410.08994v1 null
2024-10-11 Realizing Linear Synaptic Plasticity in Electric Double Layer-Gated Transistors for Improved Predictive Accuracy and Efficiency in Neuromorphic Computing Nithil Harris Manimaran et.al. 2410.08978v1 null
2024-10-11 ALVIN: Active Learning Via INterpolation Michalis Korakakis et.al. 2410.08972v1 null
2024-10-11 Evaluating Federated Kolmogorov-Arnold Networks on Non-IID Data Arthur Mendonça Sasse et.al. 2410.08961v1 null
2024-10-11 Lifted Coefficient of Determination: Fast model-free prediction intervals and likelihood-free model comparison Daniel Salnikov et.al. 2410.08958v1 null
2024-10-11 Rapid Grassmannian Averaging with Chebyshev Polynomials Brighton Ancelin et.al. 2410.08956v1 null
2024-10-11 Local moduli in the special 2-flags of length 5 Piotr Mormul et.al. 2410.08951v1 null
2024-10-11 On the Adversarial Transferability of Generalized "Skip Connections" Yisen Wang et.al. 2410.08950v1 null
2024-10-11 Enhancing Motion Variation in Text-to-Motion Models via Pose and Video Conditioned Editing Clayton Leite et.al. 2410.08931v1 null
2024-10-11 Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images Virmarie Maquiling et.al. 2410.08926v1 null
2024-10-10 LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts Anh-Quan Cao et.al. 2410.08211v1 null
2024-10-10 Scaling Laws For Diffusion Transformers Zhengyang Liang et.al. 2410.08184v1 null
2024-10-10 RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image Xiaoxue Chen et.al. 2410.08181v1 null
2024-10-10 A note on the symplectic classification of almost-toric systems Xiudi Tang et.al. 2410.08175v1 null
2024-10-10 Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models Qingni Wang et.al. 2410.08174v1 null
2024-10-10 Progressive Autoregressive Video Diffusion Models Desai Xie et.al. 2410.08151v1 link
2024-10-10 Robust AI-Generated Text Detection by Restricted Embeddings Kristian Kuznetsov et.al. 2410.08113v1 null
2024-10-10 Color-Guided Flying Pixel Correction in Depth Images Ekamresh Vasudevan et.al. 2410.08084v1 null
2024-10-10 Dynamic Object Catching with Quadruped Robot Front Legs André Schakkal et.al. 2410.08065v1 null
2024-10-10 A Target-Aware Analysis of Data Augmentation for Hate Speech Detection Camilla Casula et.al. 2410.08053v1 null
2024-10-09 MM-Ego: Towards Building Egocentric Multimodal LLMs Hanrong Ye et.al. 2410.07177v1 null
2024-10-09 One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation Fabian Paischer et.al. 2410.07170v1 null
2024-10-09 Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis Bohan Zeng et.al. 2410.07155v1 link
2024-10-09 Mental Disorders Detection in the Era of Large Language Models Gleb Kuzmin et.al. 2410.07129v1 null
2024-10-09 Thing2Reality: Transforming 2D Content into Conditioned Multiviews and 3D Gaussian Objects for XR Communication Erzhen Hu et.al. 2410.07119v1 null
2024-10-09 JPEG Inspired Deep Learning Ahmed H. Salamah et.al. 2410.07081v1 null
2024-10-09 Retrieval-Augmented Decision Transformer: External Memory for In-context RL Thomas Schmied et.al. 2410.07071v1 null
2024-10-09 TinyEmo: Scaling down Emotional Reasoning via Metric Projection Cristian Gutierrez et.al. 2410.07062v1 link
2024-10-09 Z-upscaling: Optical Flow Guided Frame Interpolation for Isotropic Reconstruction of 3D EM Volumes Fisseha A. Ferede et.al. 2410.07043v1 link
2024-10-09 Optimizing Estimators of Squared Calibration Errors in Classification Sebastian G. Gruber et.al. 2410.07014v1 null
2024-10-07 Fine-Tuning CLIP's Last Visual Projector: A Few-Shot Cornucopia Mohammad Fahes et.al. 2410.05270v1 link
2024-10-07 Grounding Partially-Defined Events in Multimodal Data Kate Sanders et.al. 2410.05267v1 null
2024-10-07 DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control Kaifeng Zhao et.al. 2410.05260v1 null
2024-10-07 SePPO: Semi-Policy Preference Optimization for Diffusion Alignment Daoan Zhang et.al. 2410.05255v1 link
2024-10-07 Causal Micro-Narratives Mourad Heddaya et.al. 2410.05252v1 null
2024-10-07 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249v1 null
2024-10-07 The Dawn of Video Generation: Preliminary Explorations with SORA-like Models Ailing Zeng et.al. 2410.05227v1 null
2024-10-07 Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality Ge Ya et.al. 2410.05203v1 link
2024-10-07 Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge Senorita Deb et.al. 2410.05189v1 null
2024-10-07 VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks Ziyan Jiang et.al. 2410.05160v1 null
2024-10-04 Spatial Hyperspheric Models for Compositional Data Michael R. Schwob et.al. 2410.03648v1 null
2024-10-04 HyperCMR: Enhanced Multi-Contrast CMR Reconstruction with Eagle Loss Ruru Xu et.al. 2410.03624v1 null
2024-10-04 Crystallography, Group Cohomology, and Lieb-Schultz-Mattis Constraints Chunxiao Liu et.al. 2410.03607v1 null
2024-10-04 LeLaN: Learning A Language-Conditioned Navigation Policy from In-the-Wild Videos Noriaki Hirose et.al. 2410.03603v1 null
2024-10-04 Training Over a Distribution of Hyperparameters for Enhanced Performance and Adaptability on Imbalanced Classification Kelsey Lieberman et.al. 2410.03588v1 null
2024-10-04 A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development Jesper Knapp et.al. 2410.03580v1 null
2024-10-04 Re-examining Sexism and Misogyny Classification with Annotator Attitudes Aiqi Jiang et.al. 2410.03543v1 null
2024-10-04 Classification-Denoising Networks Louis Thiry et.al. 2410.03505v1 null
2024-10-04 MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation Hongcheng Wang et.al. 2410.03488v1 null
2024-10-04 A Multimodal Framework for Deepfake Detection Kashish Gandhi et.al. 2410.03487v1 null
2024-10-03 Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats Mingyang Xie et.al. 2410.02764v1 null
2024-10-03 Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos Jianrui Zhang et.al. 2410.02763v1 null
2024-10-03 Loong: Generating Minute-level Long Videos with Autoregressive Language Models Yuqing Wang et.al. 2410.02757v1 null
2024-10-03 An Online Automatic Modulation Classification Scheme Based on Isolation Distributional Kernel Xinpeng Li et.al. 2410.02750v1 null
2024-10-03 OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable? Liangze Jiang et.al. 2410.02735v1 null
2024-10-03 Liouville's theorem in calibrated geometries Toni Ikonen et.al. 2410.02722v1 null
2024-10-03 Curvature Diversity-Driven Deformation and Domain Alignment for Point Cloud Mengxi Wu et.al. 2410.02720v1 link
2024-10-03 AlzhiNet: Traversing from 2DCNN to 3DCNN, Towards Early Detection and Diagnosis of Alzheimer's Disease Romoke Grace Akindele et.al. 2410.02714v1 null
2024-10-04 Video Instruction Tuning With Synthetic Data Yuanhan Zhang et.al. 2410.02713v2 null
2024-10-03 Impact of a reclassification on Web of Science articles on bibliometric indicators Agénor Lahatte et.al. 2410.02701v1 null
2024-10-02 Loki: An Open-Source Tool for Fact Verification Haonan Li et.al. 2410.01794v1 null
2024-10-03 Application of convolutional neural networks for extensive air shower separation in the SPHERE-3 experiment E. L. Entina et.al. 2410.01781v2 null
2024-10-03 TopER: Topological Embeddings in Graph Representation Learning Astrit Tola et.al. 2410.01778v2 null
2024-10-02 Trained Transformer Classifiers Generalize and Exhibit Benign Overfitting In-Context Spencer Frei et.al. 2410.01774v1 null
2024-10-02 SegHeD: Segmentation of Heterogeneous Data for Multiple Sclerosis Lesions with Anatomical Constraints Berke Doga Basaran et.al. 2410.01766v1 null
2024-10-02 LightSC: The Making of a Usable Security Classification Tool for DevSecOps Manish Shrestha et.al. 2410.01762v1 null
2024-10-02 Integrating Protein Sequence and Expression Level to Analysis Molecular Characterization of Breast Cancer Subtypes Hossein Sholehrasa et.al. 2410.01755v1 null
2024-10-02 Unitary Representations of the Isometry Groups of Urysohn Spaces Rémi Barritault et.al. 2410.01725v1 null
2024-10-02 COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation Mingzhen Sun et.al. 2410.01718v1 null
2024-10-02 Rabi oscillations at three-photon laser excitation of a single rubidium Rydberg atom in an optical dipole trap I. I. Beterov et.al. 2410.01703v1 null
2024-09-30 Continuously Improving Mobile Manipulation with Autonomous Real-World RL Russell Mendonca et.al. 2409.20568v1 null
2024-09-30 MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning Haotian Zhang et.al. 2409.20566v1 null
2024-09-30 DressRecon: Freeform 4D Human Reconstruction from Monocular Video Jeff Tan et.al. 2409.20563v1 null
2024-09-30 LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner Xiaopan Zhang et.al. 2409.20560v1 null
2024-09-30 Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos Md Mohaiminul Islam et.al. 2409.20557v1 null
2024-09-30 Inverse Painting: Reconstructing The Painting Process Bowei Chen et.al. 2409.20556v1 null
2024-09-30 UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models Qiaojun Yu et.al. 2409.20551v1 null
2024-09-30 Statistical view of orbital circularisation with 14 000 characterised TESS eclipsing binaries L. W. IJspeert et.al. 2409.20540v1 null
2024-09-30 Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers Lirui Wang et.al. 2409.20537v1 link
2024-09-30 Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single Images Bahri Batuhan Bilecen et.al. 2409.20530v1 null
2024-09-27 PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation Shaowei Liu et.al. 2409.18964v1 link
2024-09-27 LML: Language Model Learning a Dataset for Data-Augmented Prediction Praneeth Vadlapati et.al. 2409.18957v1 link
2024-09-27 Unconditional stability of a recurrent neural circuit implementing divisive normalization Shivang Rawat et.al. 2409.18946v1 null
2024-09-27 From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding Heqing Zou et.al. 2409.18938v1 null
2024-09-27 Subspace Preserving Quantum Convolutional Neural Network Architectures Léo Monbroussou et.al. 2409.18918v1 null
2024-09-27 Improving Visual Object Tracking through Visual Prompting Shih-Fang Chen et.al. 2409.18901v1 link
2024-09-27 Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors Yunlong Lin et.al. 2409.18899v1 null
2024-09-27 Suicide Phenotyping from Clinical Notes in Safety-Net Psychiatric Hospital Using Multi-Label Classification with Pre-Trained Language Models Zehan Li et.al. 2409.18878v1 null
2024-09-27 Simulating Dynamic Tumor Contrast Enhancement in Breast MRI using Conditional Generative Adversarial Networks Richard Osuala et.al. 2409.18872v1 null
2024-09-27 Fusion Systems and Simple Groups With Class Two Sylow $p$-subgroups Martin van Beek et.al. 2409.18870v1 null
2024-09-26 EgoLM: Multi-Modal Language Model of Egocentric Motions Fangzhou Hong et.al. 2409.18127v1 null
2024-09-26 LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness Chenming Zhu et.al. 2409.18125v1 null
2024-09-26 RT-GuIDE: Real-Time Gaussian splatting for Information-Driven Exploration Yuezhan Tao et.al. 2409.18122v1 null
2024-09-26 Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D Reconstruction Justin Kerr et.al. 2409.18121v1 null
2024-09-26 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding Ye Liu et.al. 2409.18111v1 link
2024-09-26 MALPOLON: A Framework for Deep Species Distribution Modeling Theo Larcher et.al. 2409.18102v1 null
2024-09-26 Incorporating sparse labels into biologging studies using hidden Markov models with weighted likelihoods Evan Sidrow et.al. 2409.18091v1 null
2024-09-26 Stable Video Portraits Mirela Ostrek et.al. 2409.18083v1 null
2024-09-26 Graded contractions on the orthogonal Lie algebras of dimensions 7 and 8 Cristina Draper et.al. 2409.18069v1 null
2024-09-26 LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field Huan Wang et.al. 2409.18057v1 link
2024-09-25 DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion Yukun Huang et.al. 2409.17145v1 null
2024-09-25 Streaming Neural Images Marcos V. Conde et.al. 2409.17134v1 null
2024-09-25 Assessing the Level of Toxicity Against Distinct Groups in Bangla Social Media Comments: A Comprehensive Investigation Mukaffi Bin Moin et.al. 2409.17130v1 null
2024-09-25 Classification of Gleason Grading in Prostate Cancer Histopathology Images Using Deep Learning Techniques: YOLO, Vision Transformers, and Vision Mamba Amin Malekmohammadi et.al. 2409.17122v1 link
2024-09-25 Counting Triangles in Triangles Jim Propp et.al. 2409.17117v1 null
2024-09-25 BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices Yongqi Xu et.al. 2409.17093v1 link
2024-09-25 Accumulator-Aware Post-Training Quantization Ian Colbert et.al. 2409.17092v1 null
2024-09-25 Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification Xinrui Zhou et.al. 2409.17091v1 null
2024-09-25 SEN12-WATER: A New Dataset for Hydrological Applications and its Benchmarking Luigi Russo et.al. 2409.17087v1 null
2024-09-25 The Effect of Perceptual Metrics on Music Representation Learning for Genre Classification Tashi Namgyal et.al. 2409.17069v1 null
2024-09-24 Self-Supervised Any-Point Tracking by Contrastive Random Walks Ayush Shrivastava et.al. 2409.16288v1 link
2024-09-24 Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking Xi Wang et.al. 2409.16287v1 null
2024-09-24 Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation Homanga Bharadhwaj et.al. 2409.16283v1 null
2024-09-24 Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation Yong Xien Chng et.al. 2409.16278v1 null
2024-09-24 Compressed Depth Map Super-Resolution and Restoration: AIM 2024 Challenge Results Marcos V. Conde et.al. 2409.16277v1 null
2024-09-24 CDChat: A Large Multimodal Model for Remote Sensing Change Description Mubashir Noman et.al. 2409.16261v1 link
2024-09-24 Empirically Exploring the Space of Monostationarity in Dual Phosphorylation May Cai et.al. 2409.16234v1 null
2024-09-24 VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection Sunghyun Ahn et.al. 2409.16225v1 link
2024-09-24 Upper-body free-breathing Magnetic Resonance Fingerprinting applied to the quantification of water T1 and fat fraction Constantin Slioussarenko et.al. 2409.16200v1 null
2024-09-24 Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking Jun Bai et.al. 2409.16198v1 null
2024-09-20 Gender Representation and Bias in Indian Civil Service Mock Interviews Somonnoy Banerjee et.al. 2409.12194v3 null
2024-09-18 DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control Zichen Jeff Cui et.al. 2409.12192v1 null
2024-09-18 Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Peng Wang et.al. 2409.12191v1 link
2024-09-18 multiPI-TransBTS: A Multi-Path Learning Framework for Brain Tumor Image Segmentation Based on Multi-Physical Information Hongjun Zhu et.al. 2409.12167v1 link
2024-09-18 JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation Sai Tanmay Reddy Chakkera et.al. 2409.12156v1 null
2024-09-18 Autopet III challenge: Incorporating anatomical knowledge into nnUNet for lesion segmentation in PET/CT Hamza Kalisch et.al. 2409.12155v1 link
2024-09-18 MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion Kalakonda Sai Shashank et.al. 2409.12140v1 null
2024-09-18 Mirages in the Energy Landscape of Soft Sphere Packings Praharsh Suryadevara et.al. 2409.12113v1 null
2024-09-18 SPRMamba: Surgical Phase Recognition for Endoscopic Submucosal Dissection with Mamba Xiangning Zhang et.al. 2409.12108v1 null
2024-09-18 Unveiling the Secrets of New Physics Through Top Quark Tagging Rameswar Sahu et.al. 2409.12085v1 null
2024-09-17 Systematic analysis of Parity-Violating modes Hong-Ming Zhu et.al. 2409.11400v1 null
2024-09-17 Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements Jipeng Yan et.al. 2409.11391v1 null
2024-09-17 Normalization in Proportional Feature Spaces Alexandre Benatti et.al. 2409.11389v1 null
2024-09-17 Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification Fatema-E- Jannat et.al. 2409.11375v1 null
2024-09-17 Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks Edgar Heinert et.al. 2409.11373v1 null
2024-09-17 Compact Implicit Neural Representations for Plane Wave Images Mathilde Monvoisin et.al. 2409.11370v1 null
2024-09-17 OSV: One Step is Enough for High-Quality Image to Video Generation Xiaofeng Mao et.al. 2409.11367v1 null
2024-09-17 THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models Mengfei Liang et.al. 2409.11353v1 null
2024-09-17 CLIP Adaptation by Intra-modal Overlap Reduction Alexey Kravets et.al. 2409.11338v1 null
2024-09-17 LPT++: Efficient Training on Mixture of Long-tailed Experts Bowen Dong et.al. 2409.11323v1 null
2024-09-16 Enhancing Video Transmission with Machine Learning based Routing in Software-Defined Networks Anıl Dursun İpek et.al. 2409.10512v1 null
2024-09-16 Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance Simone Maurizio La Cava et.al. 2409.10481v1 null
2024-09-16 Real-Time Whole-Body Control of Legged Robots with Model-Predictive Path Integral Control Juan Alvarez-Padilla et.al. 2409.10469v1 null
2024-09-16 Assortativity in sympatric speciation and species classification Joao U. F. Lizarraga et.al. 2409.10466v1 null
2024-09-16 Kolmogorov-Arnold Networks in Low-Data Regimes: A Comparative Study with Multilayer Perceptrons Farhad Pourkamali-Anaraki et.al. 2409.10463v1 null
2024-09-16 Deep-Wide Learning Assistance for Insect Pest Classification Toan Nguyen et.al. 2409.10445v1 link
2024-09-16 A point process approach for the classification of noisy calcium imaging data Arianna Burzacchi et.al. 2409.10409v1 null
2024-09-16 MOST: MR reconstruction Optimization for multiple downStream Tasks via continual learning Hwihun Jeong et.al. 2409.10394v1 link
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362v1 null
2024-09-16 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation? Téo Guichoux et.al. 2409.10357v1 null
2024-09-13 An Efficient and Streaming Audio Visual Active Speaker Detection System Arnav Kundu et.al. 2409.09018v1 null
2024-09-13 Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation Qingwen Bu et.al. 2409.09016v1 link
2024-09-13 Model-independent variable selection via the rule-based variable priorit Min Lu et.al. 2409.09003v1 null
2024-09-13 Biomimetic Frontend for Differentiable Audio Processing Ruolan Leslie Famularo et.al. 2409.08997v1 link
2024-09-13 Comparative Analysis of Pretrained Audio Representations in Music Recommender Systems Yan-Martin Tamm et.al. 2409.08987v1 link
2024-09-13 Fast DCT+: A Family of Fast Transforms Based on Rank-One Updates of the Path Graph Samuel Fernández-Menduiña et.al. 2409.08970v1 null
2024-09-13 Pushing the boundaries of event subsampling in event-based video classification using CNNs Hesam Araghi et.al. 2409.08953v1 link
2024-09-13 Pushing Joint Image Denoising and Classification to the Edge Thomas C Markhorst et.al. 2409.08943v1 null
2024-09-13 LLM-based Weak Supervision Framework for Query Intent Classification in Video Search Farnoosh Javadi et.al. 2409.08931v1 null
2024-09-13 Classification of electronic structures and state preparation for quantum computation of reaction chemistry Maximilian Mörchen et.al. 2409.08910v1 null
2024-09-12 Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor Andrea Conti et.al. 2409.08277v1 null
2024-09-12 Hand-Object Interaction Pretraining from Videos Himanshu Gaurav Singh et.al. 2409.08273v1 null
2024-09-12 DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer Runjia Li et.al. 2409.08271v1 null
2024-09-12 OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering Jiahao Nick Li et.al. 2409.08250v1 null
2024-09-12 A review of compact geodesic orbit manifolds and the g.o. condition for $\SU(5)/\s(\U(2)\times \U(2))$ Andreas Arvanitoyeorgos et.al. 2409.08247v1 null
2024-09-12 Model Ensemble for Brain Tumor Segmentation in Magnetic Resonance Imaging Daniel Capellán-Martín et.al. 2409.08232v1 null
2024-09-12 CliquePH: Higher-Order Information for Graph Neural Networks through Persistent Homology on Clique Graphs Davide Buffelli et.al. 2409.08217v1 null
2024-09-12 LT3SD: Latent Trees for 3D Scene Diffusion Quan Meng et.al. 2409.08215v1 null
2024-09-12 Gaussian Garments: Reconstructing Simulation-Ready Clothing with Photorealistic Appearance from Multi-View Video Boxiang Rong et.al. 2409.08189v1 null
2024-09-13 Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification Soufiyan Bahadi et.al. 2409.08188v2 null
2024-09-11 Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Haibo Yang et.al. 2409.07452v1 link
2024-09-11 VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos Yan-Bo Lin et.al. 2409.07450v1 null
2024-09-11 Autonomous loading of ore piles with Load-Haul-Dump machines using Deep Reinforcement Learning Rodrigo Salas et.al. 2409.07449v1 null
2024-09-11 StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos Sijie Zhao et.al. 2409.07447v1 null
2024-09-11 Deep Neural Network-Based Sign Language Recognition: A Comprehensive Approach Using Transfer Learning with Explainability A. E. M Ridwan et.al. 2409.07426v1 null
2024-09-11 Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy Somayeh Pakdelmoez et.al. 2409.07422v1 null
2024-09-11 Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging Yunzhen Wang et.al. 2409.07417v1 null
2024-09-11 NVRC: Neural Video Representation Compression Ho Man Kwan et.al. 2409.07414v1 null
2024-09-12 Robust Robot Walker: Learning Agile Locomotion over Tiny Traps Shaoting Zhu et.al. 2409.07409v2 null
2024-09-11 Revisiting Static Feature-Based Android Malware Detection Md Tanvirul Alam et.al. 2409.07397v1 null
2024-09-10 A study on Deep Convolutional Neural Networks, Transfer Learning and Ensemble Model for Breast Cancer Detection Md Taimur Ahad et.al. 2409.06699v1 null
2024-09-10 DANCE: Deep Learning-Assisted Analysis of Protein Sequences Using Chaos Enhanced Kaleidoscopic Images Taslim Murad et.al. 2409.06694v1 null
2024-09-10 Benchmarking Sub-Genre Classification For Mainstage Dance Music Hongzhi Shu et.al. 2409.06690v1 null
2024-09-10 A comprehensive study on Blood Cancer detection and classification using Convolutional Neural Network Md Taimur Ahad et.al. 2409.06689v1 null
2024-09-10 A study on deep feature extraction to detect and classify Acute Lymphoblastic Leukemia (ALL) Sabit Ahamed Preanto et.al. 2409.06687v1 null
2024-09-10 Constructing an Interpretable Deep Denoiser by Unrolling Graph Laplacian Regularizer Seyed Alireza Hosseini et.al. 2409.06676v1 null
2024-09-10 Bulk and atmospheric metallicities as direct probes of sequentially varying accretion mechanisms of gas and solids onto planets Yasuhiro Hasegawa et.al. 2409.06670v1 null
2024-09-10 Data Collection-free Masked Video Modeling Yuchi Ishikawa et.al. 2409.06665v1 null
2024-09-10 World-Grounded Human Motion Recovery via Gravity-View Coordinates Zehong Shen et.al. 2409.06662v1 null
2024-09-10 Classifying Functions via growth rates of repeated iterations Titus Hilberdink et.al. 2409.06661v1 null
2024-09-09 Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments Haritheja Etukuru et.al. 2409.05865v1 null
2024-09-09 Neural MP: A Generalist Neural Motion Planner Murtaza Dalal et.al. 2409.05864v1 null
2024-09-09 LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation Henghui Ding et.al. 2409.05847v1 null
2024-09-10 Finite-size topological phases from semimetals Adipta Pal et.al. 2409.05842v2 null
2024-09-09 Fast Generation of Custom Floating-Point Spatial Filters on FPGAs Nelson Campos et.al. 2409.05837v1 null
2024-09-09 Limits on the computational expressivity of non-equilibrium biophysical processes Carlos Floyd et.al. 2409.05827v1 null
2024-09-09 A Flexible Framework for Universal Computational Aberration Correction via Automatic Lens Library Generation and Domain Adaptation Qi Jiang et.al. 2409.05809v1 null
2024-09-09 A CLIP-based siamese approach for meme classification Javier Huertas-Tato et.al. 2409.05772v1 null
2024-09-09 Consensus-based Distributed Quantum Kernel Learning for Speech Recognition Kuan-Cheng Chen et.al. 2409.05770v1 null
2024-09-09 A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR Giovanni Morrone et.al. 2409.05750v1 null
2024-09-06 Synergy and Synchrony in Couple Dances Vongani Maluleke et.al. 2409.04440v1 null
2024-09-06 VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation Yecheng Wu et.al. 2409.04429v1 null
2024-09-06 Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques Davide Clode da Silva et.al. 2409.04424v1 null
2024-09-06 Virtual Reality-Based Preoperative Planning for Optimized Trocar Placement in Thoracic Surgery: A Preliminary Study Arash Harirpoush et.al. 2409.04414v1 null
2024-09-06 Quantum Kernel Methods under Scrutiny: A Benchmarking Study Jan Schnabel et.al. 2409.04406v1 null
2024-09-09 Question-Answering Dense Video Events Hangyu Qin et.al. 2409.04388v2 null
2024-09-06 Empirical Bayesian image restoration by Langevin sampling with a denoising diffusion implicit prior Charlesquin Kemajou Mbakam et.al. 2409.04384v1 null
2024-09-06 Enhancing Skin Lesion Diagnosis with Ensemble Learning Xiaoyi Liu et.al. 2409.04381v1 null
2024-09-06 Tykhyy's Conjecture on finite mapping class group orbits Samuel Bronstein et.al. 2409.04379v1 null
2024-09-06 The Impact of Scanner Domain Shift on Deep Learning Performance in Medical Imaging: an Experimental Study Gregory Szumel et.al. 2409.04368v1 null
2024-09-05 Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding Yunze Man et.al. 2409.03757v1 link
2024-09-05 Dynamics of Supervised and Reinforcement Learning in the Non-Linear Perceptron Christian Schmid et.al. 2409.03749v1 null
2024-09-05 Orbital Support and Evolution of CX/OX Structures in Boxy/Peanut Bars Behzad Tahmasebzadeh et.al. 2409.03746v1 null
2024-09-05 Libra: Architectural Support For Principled, Secure And Efficient Balanced Execution On High-End Processors (Extended Version) Hans Winderix et.al. 2409.03743v1 null
2024-09-05 Classification and Prediction of Heart Diseases using Machine Learning Algorithms Akua Sekyiwaa Osei-Nkwantabisa et.al. 2409.03697v1 null
2024-09-05 View-Invariant Policy Learning via Zero-Shot Novel View Synthesis Stephen Tian et.al. 2409.03685v1 null
2024-09-05 Threat Classification on Deployed Optical Networks Using MIMO Digital Fiber Sensing, Wavelets, and Machine Learning Khouloud Abdelli et.al. 2409.03667v1 null
2024-09-05 Limited but consistent gains in adversarial robustness by co-training object recognition models with human EEG Manshan Guo et.al. 2409.03646v1 null
2024-09-05 Variance reduction in Texas hold'em and in video poker Stewart N. Ethier et.al. 2409.03607v1 null
2024-09-05 SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing Lingyu Xiong et.al. 2409.03605v1 null
2024-09-04 SITAR: Semi-supervised Image Transformer for Action Recognition Owais Iqbal et.al. 2409.02910v1 null
2024-09-04 GraphTrials: Visual Proofs of Graph Properties Henry Förster et.al. 2409.02907v1 null
2024-09-04 Classification of spin-$1/2$ fermionic quantum spin liquids on the trillium lattice Ming-Hao Li et.al. 2409.02898v1 null
2024-09-04 LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture Xidong Wang et.al. 2409.02889v1 link
2024-09-04 CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS Differently Jonathan Zalach et.al. 2409.02885v1 null
2024-09-04 Look Into the LITE in Deep Learning for Time Series Classification Ali Ismail-Fawaz et.al. 2409.02869v1 null
2024-09-04 Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models Zhibin Liu et.al. 2409.02851v1 null
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838v1 null
2024-09-04 Evolution of radiation profiles in a strongly baffled divertor on MAST Upgrade Fabio Federici et.al. 2409.02837v1 null
2024-09-04 Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models Moein Shahiki Tash et.al. 2409.02836v1 null
2024-08-30 Bridging Episodes and Semantics: A Novel Framework for Long-Form Video Understanding Gueter Josmy Faure et.al. 2408.17443v1 link
2024-08-30 SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists Raoyuan Zhao et.al. 2408.17437v1 link
2024-08-30 CinePreGen: Camera Controllable Video Previsualization via Engine-powered Diffusion Yiran Chen et.al. 2408.17424v1 null
2024-09-03 Open-vocabulary Temporal Action Localization using VLMs Naoki Wake et.al. 2408.17422v2 null
2024-08-30 Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes Li Zhang et.al. 2408.17421v1 link
2024-08-30 End-to-End Learning for Task-Oriented Semantic Communications Over MIMO Channels: An Information-Theoretic Framework Chang Cai et.al. 2408.17397v1 null
2024-08-30 Equivariant isomorphism of Quantum Lens Spaces of low dimension Søren Eilers et.al. 2408.17386v1 null
2024-08-30 LASSO-MOGAT: A Multi-Omics Graph Attention Framework for Cancer Classification Fadi Alharbi et.al. 2408.17384v1 null
2024-08-30 Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain Francesca Grasso et.al. 2408.17362v1 link
2024-08-30 Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method Yuji Lin et.al. 2408.17339v1 null
2024-08-29 SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners Ziyu Guo et.al. 2408.16768v1 link
2024-08-29 ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model Fangfu Liu et.al. 2408.16767v1 null
2024-08-29 OmniRe: Omni Urban Scene Reconstruction Ziyu Chen et.al. 2408.16760v1 null
2024-08-29 Assessing Large Language Models for Online Extremism Research: Identification, Explanation, and New Knowledge Beidi Dong et.al. 2408.16749v1 null
2024-08-29 Automatic detection of Mild Cognitive Impairment using high-dimensional acoustic features in spontaneous speech Cong Zhang et.al. 2408.16732v1 null
2024-08-29 VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation Shiwei Wu et.al. 2408.16730v1 null
2024-08-29 Prediction-Feedback DETR for Temporal Action Detection Jihwan Kim et.al. 2408.16729v1 null
2024-08-29 A GREAT Architecture for Edge-Based Graph Problems Like TSP Attila Lischka et.al. 2408.16717v1 null
2024-08-29 One-Shot Learning Meets Depth Diffusion in Multi-Object Videos Anisha Jain et.al. 2408.16704v1 null
2024-08-29 RoboMNIST: A Multimodal Dataset for Multi-Robot Activity Recognition Using WiFi Sensing, Video, and Audio Kian Behzad et.al. 2408.16703v1 null
2024-08-29 Spatio-Temporal Context Prompting for Zero-Shot Action Detection Wei-Jhe Huang et.al. 2408.15996v2 null
2024-08-28 TEDRA: Text-based Editing of Dynamic and Photoreal Actors Basavaraj Sunagad et.al. 2408.15995v1 null
2024-08-28 Minimizing movements solutions for a monotone model of droplet motion Carson Collins et.al. 2408.15984v1 null
2024-08-28 VLT/MUSE detection of accretion-ejection associated with the close stellar companion in the HT Lup system Sebastián Jorquera et.al. 2408.15976v1 null
2024-08-28 1+1d SPT phases with fusion category symmetry: interface modes and non-abelian Thouless pump Kansei Inamura et.al. 2408.15960v1 null
2024-08-28 Generating Binary Species Range Maps Filip Dorm et.al. 2408.15956v1 null
2024-08-28 Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games Nicholas R. Waytowich et.al. 2408.15950v1 null
2024-08-28 Auxiliary Input in Training: Incorporating Catheter Features into Deep Learning Models for ECG-Free Dynamic Coronary Roadmapping Yikang Liu et.al. 2408.15947v1 null
2024-08-28 A latticed total K-theory Qingnan An et.al. 2408.15941v1 null
2024-08-28 Local Descriptors Weighted Adaptive Threshold Filtering For Few-Shot Learning Bingchen Yan et.al. 2408.15924v1 null
2024-08-27 GenRec: Unifying Video Generation and Recognition with Diffusion Models Zejia Weng et.al. 2408.15241v1 null
2024-08-27 Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation Xiaojuan Wang et.al. 2408.15239v1 null
2024-08-27 DCT-CryptoNets: Scaling Private Inference in the Frequency Domain Arjun Roy et.al. 2408.15231v1 null
2024-08-27 SAM & SAM 2 in 3D Slicer: SegmentWithSAM Extension for Annotating Medical Images Zafer Yildiz et.al. 2408.15224v1 link
2024-08-27 Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment Xuan Xu et.al. 2408.15218v1 null
2024-08-27 Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance Weiyi Zhang et.al. 2408.15217v1 null
2024-08-27 Classifying populist language in American presidential and governor speeches using automatic text analysis Olaf van der Veen et.al. 2408.15213v1 null
2024-08-27 Sec2Sec Co-attention for Video-Based Apparent Affective Prediction Mingwei Sun et.al. 2408.15209v1 link
2024-08-27 Automatic 8-tissue Segmentation for 6-month Infant Brains Yilan Dong et.al. 2408.15198v1 null
2024-08-27 Infusing Acoustic Pause Context into Text-Based Dementia Assessment Franziska Braun et.al. 2408.15188v1 null
2024-08-26 Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos Qirui Chen et.al. 2408.14469v1 null
2024-08-26 K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences Zhikai Li et.al. 2408.14468v1 null
2024-08-26 Reconstructing physiological signals from fMRI across the adult lifespan Shiyu Wang et.al. 2408.14453v1 null
2024-08-26 Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition Axel Klawonn et.al. 2408.14442v1 null
2024-08-26 Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification Mahrukh Awan et.al. 2408.14441v1 null
2024-08-26 Radiance Cascades: A Novel High-Resolution Formal Solution for Multidimensional Non-LTE Radiative Transfer Christopher M. J. Osborne et.al. 2408.14425v1 null
2024-08-26 Learning Tree-Structured Composition of Data Augmentation Dongyue Li et.al. 2408.14381v1 link
2024-08-26 Probing Causality Manipulation of Large Language Models Chenyang Zhang et.al. 2408.14380v1 link
2024-08-26 GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy Peiyan Li et.al. 2408.14368v1 null
2024-08-26 An Embedding is Worth a Thousand Noisy Labels Francesco Di Salvo et.al. 2408.14358v1 null
2024-08-23 Ensemble Modeling of Multiple Physical Indicators to Dynamically Phenotype Autism Spectrum Disorder Marie Huynh et.al. 2408.13255v1 null
2024-08-23 Domain-specific long text classification from sparse relevant information Célia D'Cruz et.al. 2408.13253v1 null
2024-08-23 CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities Tao Wu et.al. 2408.13239v1 null
2024-08-23 D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching Jingyu Liu et.al. 2408.13226v1 null
2024-08-23 ResSR: A Residual Approach to Super-Resolving Multispectral Images Haley Duba-Sullivan et.al. 2408.13225v1 null
2024-08-23 EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods Hongcheng Ding et.al. 2408.13214v1 null
2024-08-23 Instruct-DeBERTa: A Hybrid Approach for Aspect-based Sentiment Analysis on Textual Reviews Dineth Jayakody et.al. 2408.13202v1 null
2024-08-23 EAViT: External Attention Vision Transformer for Audio Classification Aquib Iqbal et.al. 2408.13201v1 null
2024-08-23 Deep Learning for Lung Disease Classification Using Transfer Learning and a Customized CNN Architecture with Attention Xiaoyi Liu et.al. 2408.13180v1 null
2024-08-23 Augmented Functional Random Forests: Classifier Construction and Unbiased Functional Principal Components Importance through Ad-Hoc Conditional Permutations Fabrizio Maturo et.al. 2408.13179v1 null
2024-08-22 Automating Deformable Gasket Assembly Simeon Adebola et.al. 2408.12593v1 null
2024-08-22 xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations Can Qin et.al. 2408.12590v1 null
2024-08-22 Real-Time Video Generation with Pyramid Attention Broadcast Xuanlei Zhao et.al. 2408.12588v1 link
2024-08-22 Enhanced Parking Perception by Multi-Task Fisheye Cross-view Transformers Antonyo Musabini et.al. 2408.12575v1 null
2024-08-22 MuMA-ToM: Multi-modal Multi-Agent Theory of Mind Haojun Shi et.al. 2408.12574v1 null
2024-08-22 Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers Sayed Mohammad Vakilzadeh Hatefi et.al. 2408.12568v1 null
2024-08-22 ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation Lujia Zhong et.al. 2408.12561v1 link
2024-08-22 Exploring the Role of Audio in Multimodal Misinformation Detection Moyang Liu et.al. 2408.12558v1 null
2024-08-22 Automatic Organ and Pan-cancer Segmentation in Abdomen CT: the FLARE 2023 Challenge Jun Ma et.al. 2408.12534v1 null
2024-08-22 UMAD: University of Macau Anomaly Detection Benchmark Dataset Dong Li et.al. 2408.12527v1 link
2024-08-21 Great Memory, Shallow Reasoning: Limits of $k$NN-LMs Shangyi Geng et.al. 2408.11815v1 link
2024-08-21 EmbodiedSAM: Online Segment Any 3D Thing in Real Time Xiuwei Xu et.al. 2408.11811v1 null
2024-08-21 Approaching Deep Learning through the Spectral Dynamics of Weights David Yunis et.al. 2408.11804v1 link
2024-08-21 Practical token pruning for foundation models in few-shot conversational virtual assistant systems Haode Qi et.al. 2408.11799v1 null
2024-08-21 Critique-out-Loud Reward Models Zachary Ankner et.al. 2408.11791v1 link
2024-08-21 DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework Zhifei Xie et.al. 2408.11788v1 null
2024-08-21 NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation Zhenye Lou et.al. 2408.11787v1 link
2024-08-21 Timeline and Boundary Guided Diffusion Network for Video Shadow Detection Haipeng Zhou et.al. 2408.11785v1 link
2024-08-21 SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance Zhiqiang Wu et.al. 2408.11760v1 null
2024-08-21 Improving the Scan-rescan Precision of AI-based CMR Biomarker Estimation Dewmini Hasara Wickremasinghe et.al. 2408.11754v1 null
2024-08-20 Discriminant Analysis in stationary time series based on robust cepstral coefficients Jonathan de Souza Matias et.al. 2408.11012v1 null
2024-08-20 Audio Match Cutting: Finding and Creating Matching Audio Transitions in Movies and Videos Dennis Fedorishin et.al. 2408.10998v1 null
2024-08-20 Denoising Plane Wave Ultrasound Images Using Diffusion Probabilistic Models Hojat Asgariandehkordi et.al. 2408.10987v1 null
2024-08-20 ISLES'24: Improving final infarct prediction in ischemic stroke using multimodal imaging and clinical data Ezequiel de la Rosa et.al. 2408.10966v1 null
2024-08-20 Multichannel Attention Networks with Ensembled Transfer Learning to Recognize Bangla Handwritten Charecter Farhanul Haque et.al. 2408.10955v1 null
2024-08-20 Wave-Mask/Mix: Exploring Wavelet-Based Augmentations for Time Series Forecasting Dona Arabi et.al. 2408.10951v1 link
2024-08-20 Proxona: Leveraging LLM-Driven Personas to Enhance Creators' Understanding of Their Audience Yoonseo Choi et.al. 2408.10937v1 null
2024-08-20 SDI-Net: Toward Sufficient Dual-View Interaction for Low-light Stereo Image Enhancement Linlin Hu et.al. 2408.10934v1 null
2024-08-20 ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining Qi Ma et.al. 2408.10906v1 null
2024-08-20 ViLReF: A Chinese Vision-Language Retinal Foundation Model Shengzhu Yang et.al. 2408.10894v1 link
2024-08-19 Some model theory of quadratic geometries Charlotte Kestner et.al. 2408.10196v1 null
2024-08-19 Area under the ROC Curve has the Most Consistent Evaluation for Binary Classification Jing Li et.al. 2408.10193v1 null
2024-08-20 LongVILA: Scaling Long-Context Visual Language Models for Long Videos Fuzhao Xue et.al. 2408.10188v2 link
2024-08-19 SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models Anke Tang et.al. 2408.10174v1 link
2024-08-19 Galaxy Zoo: Morphologies based on UKIDSS NIR Imaging for 71,052 Galaxies Karen L. Masters et.al. 2408.10160v1 null
2024-08-19 Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video Shuxian Wang et.al. 2408.10153v1 null
2024-08-19 Biharmonic conformal immersions into a 3-dimensional conformally flat space Ze-Ping Wang et.al. 2408.10144v1 null
2024-08-19 Perceptual Depth Quality Assessment of Stereoscopic Omnidirectional Images Wei Zhou et.al. 2408.10134v1 null
2024-08-19 UNINEXT-Cutie: The 1st Solution for LSVOS Challenge RVOS Track Hao Fang et.al. 2408.10129v1 null
2024-08-19 Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track Feiyu Pan et.al. 2408.10125v1 null
2024-08-16 Quantum Annealing for Enhanced Feature Selection in Single-Cell RNA Sequencing Data Analysis Selim Romero et.al. 2408.08867v1 null
2024-08-16 DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models Eman Ali et.al. 2408.08855v1 null
2024-08-16 ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis Yubao Zhao et.al. 2408.08849v1 null
2024-08-16 HistoGym: A Reinforcement Learning Environment for Histopathological Image Analysis Zhi-Bo Liu et.al. 2408.08847v1 link
2024-08-16 LEVIS: Large Exact Verifiable Input Spaces for Neural Networks Mohamad Fares El Hajj Chehade et.al. 2408.08824v1 null
2024-08-16 Optimal Symmetries in Binary Classification Vishal S. Ngairangbam et.al. 2408.08823v1 null
2024-08-16 Leveraging FourierKAN Classification Head for Pre-Trained Transformer-based Text Classification Abdullah Al Imran et.al. 2408.08803v1 null
2024-08-16 Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers Zihang Song et.al. 2408.08794v1 null
2024-08-16 Assessing Generalization Capabilities of Malaria Diagnostic Models from Thin Blood Smears Louise Guillon et.al. 2408.08792v1 null
2024-08-16 A Disease-Specific Foundation Model Using Over 100K Fundus Images: Release and Validation for Abnormality and Multi-Disease Classification on Downstream Tasks Boa Jang et.al. 2408.08790v1 link
2024-08-15 HyperTaxel: Hyper-Resolution for Taxel-Based Tactile Signals Through Contrastive Learning Hongyu Li et.al. 2408.08312v1 null
2024-08-15 Gauge-invariant optical selection rules for excitons Tharindu Fernando et.al. 2408.08311v1 null
2024-08-15 Accelerated Image-Aware Generative Diffusion Modeling Tanmay Asthana et.al. 2408.08306v1 null
2024-08-15 SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training Gengwei Zhang et.al. 2408.08295v1 link
2024-08-15 Marker or Markerless? Mode-Switchable Optical Tactile Sensing for Diverse Robot Tasks Ni Ou et.al. 2408.08276v1 null
2024-08-15 Snuffy: Efficient Whole Slide Image Classifier Hossein Jafarinia et.al. 2408.08258v1 link
2024-08-15 Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective Zixuan Pan et.al. 2408.08228v1 link
2024-08-15 RED-CT: A Systems Design Methodology for Using LLM-labeled Data to Train and Deploy Edge Classifiers for Computational Social Science David Farr et.al. 2408.08217v1 null
2024-08-15 Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices Tess Watt et.al. 2408.08215v1 null
2024-08-15 Learned Multimodal Compression for Autonomous Driving Hadi Hadizadeh et.al. 2408.08211v1 null
2024-08-14 End-to-end Semantic-centric Video-based Multimodal Affective Computing Ronghao Lin et.al. 2408.07694v1 null
2024-08-15 A Spitting Image: Modular Superpixel Tokenization in Vision Transformers Marius Aasan et.al. 2408.07680v2 link
2024-08-14 G$^2$V$^2$former: Graph Guided Video Vision Transformer for Face Anti-Spoofing Jingyi Yang et.al. 2408.07675v1 null
2024-08-14 Graph Triple Attention Network: A Decoupled Perspective Xiaotang Wang et.al. 2408.07654v1 link
2024-08-14 Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving Yuqing Wen et.al. 2408.07605v1 null
2024-08-14 Disentangle and denoise: Tackling context misalignment for video moment retrieval Kaijing Ma et.al. 2408.07600v1 null
2024-08-14 Theoretical and Practical Progress in Hyperspectral Pixel Unmixing with Large Spectral Libraries from a Sparse Perspective Jade Preston et.al. 2408.07580v1 null
2024-08-14 TabularBench: Benchmarking Adversarial Robustness for Tabular Deep Learning in Real-world Use-cases Thibault Simonetto et.al. 2408.07579v1 link
2024-08-14 DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model Erez Yosef et.al. 2408.07541v1 null
2024-08-14 Improved 3D Whole Heart Geometry from Sparse CMR Slices Yiyang Xu et.al. 2408.07532v1 link
2024-08-13 On Networks and their Applications: Stability of Gene Regulatory Networks and Gene Function Prediction using Autoencoders Hamza Coban et.al. 2408.07064v1 null
2024-08-13 Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality Yu-Chih Chen et.al. 2408.07041v1 null
2024-08-13 PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology Xiaomin Wu et.al. 2408.07037v1 null
2024-08-13 Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines Samuel Fernández Menduiña et.al. 2408.07028v1 null
2024-08-13 Event-Stream Super Resolution using Sigma-Delta Neural Network Waseem Shariff et.al. 2408.06968v1 null
2024-08-13 DyG-Mamba: Continuous State Space Modeling on Dynamic Graphs Dongyuan Li et.al. 2408.06966v1 null
2024-08-13 OpenResearcher: Unleashing AI for Accelerated Scientific Research Yuxiang Zheng et.al. 2408.06941v1 link
2024-08-13 Diagnosis extraction from unstructured Dutch echocardiogram reports using span- and document-level characteristic classification Bauke Arends et.al. 2408.06930v1 null
2024-08-13 Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries Qi Song et.al. 2408.06901v1 null
2024-08-13 Entendre, a Social Bot Detection Tool for Niche, Fringe, and Extreme Social Media Pranav Venkatesh et.al. 2408.06900v1 null
2024-08-12 Is it a work or leisure travel? Applying text classification to identify work-related travel on social networks Lucas Félix et.al. 2408.06341v1 null
2024-08-12 Moo-ving Beyond Tradition: Revolutionizing Cattle Behavioural Phenotyping with Pose Estimation Techniques Navid Ghassemi et.al. 2408.06336v1 null
2024-08-12 LOLgorithm: Integrating Semantic,Syntactic and Contextual Elements for Humor Classification Tanisha Khurana et.al. 2408.06335v1 null
2024-08-12 From SAM to SAM 2: Exploring Improvements in Meta's Segment Anything Model Athulya Sundaresan Geetha et.al. 2408.06305v1 null
2024-08-12 Sparsity Based Multi-Source Robust 3D Localization Using a Moving Receiver Amir Mansourian et.al. 2408.06274v1 null
2024-08-12 Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance Manuel Milling et.al. 2408.06264v1 null
2024-08-12 Deep Learning System Boundary Testing through Latent Space Style Mixing Amr Abdellatif et.al. 2408.06258v1 null
2024-08-12 Rethinking Video with a Universal Event-Based Representation Andrew Freeman et.al. 2408.06248v1 null
2024-08-12 A Comprehensive Case Study on the Performance of Machine Learning Methods on the Classification of Solar Panel Electroluminescence Images Xinyi Song et.al. 2408.06229v1 link
2024-08-12 ARCADE: An Augmented Reality Display Environment for Multimodal Interaction with Conversational Agents Carolin Schindler et.al. 2408.06222v1 null
2024-08-09 VITA: Towards Open-Source Interactive Omni Multimodal LLM Chaoyou Fu et.al. 2408.05211v1 null
2024-08-09 Kalman-Inspired Feature Propagation for Video Face Super-Resolution Ruicheng Feng et.al. 2408.05205v1 null
2024-08-09 HistoKernel: Whole Slide Image Level Maximum Mean Discrepancy Kernels for Pan-Cancer Predictive Modelling Piotr Keller et.al. 2408.05195v1 link
2024-08-09 Cross-Domain Learning for Video Anomaly Detection with Limited Supervision Yashika Jain et.al. 2408.05191v1 null
2024-08-09 Holomorphic vector fields with real integral manifolds Martin Kolář et.al. 2408.05186v1 null
2024-08-09 MADE-WIC: Multiple Annotated Datasets for Exploring Weaknesses In Code Moritz Mock et.al. 2408.05163v1 null
2024-08-09 Meta-Learning Guided Label Noise Distillation for Robust Signal Modulation Classification Xiaoyang Hao et.al. 2408.05151v1 null
2024-08-09 Sportify: Question Answering with Embedded Visualizations and Personified Narratives for Sports Video Chunggi Lee et.al. 2408.05123v1 null
2024-08-09 Cautious Calibration in Binary Classification Mari-Liis Allikivi et.al. 2408.05120v1 null
2024-08-09 Beyond the Eye: A Relational Model for Early Dementia Detection Using Retinal OCTA Images Shouyue Liu et.al. 2408.05117v1 null
2024-08-08 Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics Ruining Li et.al. 2408.04631v1 null
2024-08-08 LogogramNLP: Comparing Visual and Textual Representations of Ancient Logographic Writing Systems for NLP Danlu Chen et.al. 2408.04628v1 null
2024-08-08 Transformer Explainer: Interactive Learning of Text-Generative Models Aeree Cho et.al. 2408.04619v1 null
2024-08-08 Quantifying the Impact of Population Shift Across Age and Sex for Abdominal Organ Segmentation Kate Čevora et.al. 2408.04610v1 null
2024-08-08 Enhanced Prototypical Part Network (EPPNet) For Explainable Image Classification Via Prototypes Bhushan Atote et.al. 2408.04606v1 null
2024-08-08 SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation Jieming Yu et.al. 2408.04593v1 null
2024-08-08 Learn To Learn More Precisely Runxi Cheng et.al. 2408.04590v1 null
2024-08-08 SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals Haoran Zheng et.al. 2408.04575v1 null
2024-08-08 Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches Yongzhi Xu et.al. 2408.04567v1 null
2024-08-08 MemeMind at ArAIEval Shared Task: Spotting Persuasive Spans in Arabic Text with Persuasion Techniques Identification Md Rafiul Biswas et.al. 2408.04540v1 null
2024-08-07 How Well Can Vision Language Models See Image Details? Chenhui Gou et.al. 2408.03940v1 null
2024-08-07 Fast Sprite Decomposition from Animated Graphics Tomoyuki Suzuki et.al. 2408.03923v1 null
2024-08-07 FMiFood: Multi-modal Contrastive Learning for Food Image Classification Xinyue Pan et.al. 2408.03922v1 null
2024-08-07 Holomorphic foliations tangent to Rolle-pfaffian hypersurfaces Arturo Fernández-Pérez et.al. 2408.03914v1 null
2024-08-07 AdapMTL: Adaptive Pruning Framework for Multitask Learning Model Mingcan Xiang et.al. 2408.03913v1 null
2024-08-07 Achieving Human Level Competitive Robot Table Tennis David B. D'Ambrosio et.al. 2408.03906v1 null
2024-08-07 Lightweight Video Denoising Using a Classic Bayesian Backbone Clément Bled et.al. 2408.03904v1 null
2024-08-07 Retrieval Augmentation via User Interest Clustering Hanjia Lyu et.al. 2408.03886v1 null
2024-08-07 Global-Local Progressive Integration Network for Blind Image Quality Assessment Xiaoqi Wang et.al. 2408.03885v1 null
2024-08-07 Knowledge Probing for Graph Representation Learning Mingyu Zhao et.al. 2408.03877v1 null
2024-08-06 LLaVA-OneVision: Easy Visual Task Transfer Bo Li et.al. 2408.03326v1 null
2024-08-06 ClassiFIM: An Unsupervised Method To Detect Phase Transitions Victor Kasatkin et.al. 2408.03323v1 null
2024-08-06 Segment Anything in Medical Images and Videos: Benchmark and Deployment Jun Ma et.al. 2408.03322v1 null
2024-08-06 MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation Xiaofeng Mao et.al. 2408.03312v1 null
2024-08-06 Left of Fab: Securing Design and Collaboration in the Semiconductor Value Chain John C. Hoag et.al. 2408.03295v1 null
2024-08-06 Biomedical SAM 2: Segment Anything in Biomedical Images and Videos Zhiling Yan et.al. 2408.03286v1 null
2024-08-06 ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer Jiazhi Guan et.al. 2408.03284v1 null
2024-08-06 Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments Angie Boggust et.al. 2408.03274v1 null
2024-08-07 BVI-AOM: A New Training Dataset for Deep Video Compression Optimization Jakub Nawała et.al. 2408.03265v2 null
2024-08-06 Analysis of Partially-Calibrated Sparse Subarrays for Direction Finding with Extended Degrees of Freedom W. S. Leite et.al. 2408.03236v1 null
2024-08-05 Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics Shishira R Maiya et.al. 2408.02672v1 null
2024-08-05 Interactive 3D Medical Image Segmentation with SAM 2 Chuyun Shen et.al. 2408.02635v1 null
2024-08-05 VidGen-1M: A Large-Scale Dataset for Text-to-video Generation Zhiyu Tan et.al. 2408.02629v1 null
2024-08-05 DanModCap: Designing a Danmaku Moderation Tool for Video-Sharing Platforms that Leverages Impact Captions Siying Hu et.al. 2408.02574v1 null
2024-08-05 Cross-Modality Clustering-based Self-Labeling for Multimodal Data Classification Paweł Zyblewski et.al. 2408.02568v1 null
2024-08-05 HQOD: Harmonious Quantization for Object Detection Long Huang et.al. 2408.02561v1 null
2024-08-05 The effect of dynamical states on galaxy clusters populations. I. Classification of dynamical states S. Véliz Astudillo et.al. 2408.02519v1 null
2024-08-05 Automatic rating of incomplete hippocampal inversions evaluated across multiple cohorts Lisa Hemforth et.al. 2408.02496v1 null
2024-08-05 HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions Chiranjeev Chiranjeev et.al. 2408.02494v1 null
2024-08-05 Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection Ting Lei et.al. 2408.02484v1 null
2024-08-02 Conditional LoRA Parameter Generation Xiaolong Jin et.al. 2408.01415v1 null
2024-08-02 Derivation of Back-propagation for Graph Convolutional Networks using Matrix Calculus and its Application to Explainable Artificial Intelligence Yen-Che Hsiao et.al. 2408.01408v1 null
2024-08-02 NOLO: Navigate Only Look Once Bohan Zhou et.al. 2408.01384v1 null
2024-08-02 Explaining a probabilistic prediction on the simplex with Shapley compositions Paul-Gauthier Noé et.al. 2408.01382v1 null
2024-08-02 Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification Muhammad Ahmad et.al. 2408.01372v1 null
2024-08-02 Classification of marked elliptic root systems with non-reduced quotient A. Fialowski et.al. 2408.01358v1 null
2024-08-02 Harmonized connectome resampling for variance in voxel sizes Elyssa M. McMaster et.al. 2408.01351v1 null
2024-08-02 Human foraging strategies flexibly adapt to resource distribution and time constraints Valeria Simonelli et.al. 2408.01350v1 null
2024-08-02 PC$^2$: Pseudo-Classification Based Pseudo-Captioning for Noisy Correspondence Learning in Cross-Modal Retrieval Yue Duan et.al. 2408.01349v1 null
2024-08-02 Prompt Refinement or Fine-tuning? Best Practices for using LLMs in Computational Social Science Tasks Anders Giovanni Møller et.al. 2408.01346v1 null
2024-08-01 Text-Guided Video Masked Autoencoder David Fan et.al. 2408.00759v1 null
2024-08-01 Segment anything model 2: an application to 2D and 3D medical images Haoyu Dong et.al. 2408.00756v1 null
2024-08-01 Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model Benlin Liu et.al. 2408.00754v1 null
2024-08-01 CERT-ED: Certifiably Robust Text Classification for Edit Distance Zhuoqun Huang et.al. 2408.00728v1 null
2024-08-01 SAM 2: Segment Anything in Images and Videos Nikhila Ravi et.al. 2408.00714v1 null
2024-08-01 Investigating Brain Connectivity and Regional Statistics from EEG for early stage Parkinson's Classification Amarpal Sahota et.al. 2408.00711v1 null
2024-08-01 Point-supervised Brain Tumor Segmentation with Box-prompted MedSAM Xiaofeng Liu et.al. 2408.00706v1 null
2024-08-01 Granular-Balls based Fuzzy Twin Support Vector Machine for Classification Lixi Zhao et.al. 2408.00699v1 null
2024-08-01 ExpertAF: Expert Actionable Feedback from Video Kumar Ashutosh et.al. 2408.00672v1 null
2024-08-01 AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models Daqin Luo et.al. 2408.00665v1 null
2024-07-31 The Llama 3 Herd of Models Abhimanyu Dubey et.al. 2407.21783v1 null
2024-07-31 RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining Hongtao Wu et.al. 2407.21773v1 null
2024-07-31 ReplanVLM: Replanning Robotic Tasks with Visual Language Models Aoran Mei et.al. 2407.21762v1 null
2024-07-31 Learning Video Context as Interleaved Multimodal Sequences Kevin Qinghong Lin et.al. 2407.21757v1 null
2024-08-01 Topological Woodward-Hoffmann classification for cycloadditions in polycyclic aromatic azomethine ylides Juan Li et.al. 2407.21756v2 null
2024-07-31 A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation Mothilal Asokan et.al. 2407.21739v1 null
2024-07-31 Leveraging Self-Supervised Learning for Fetal Cardiac Planes Classification using Ultrasound Scan Videos Joseph Geo Benjamin et.al. 2407.21738v1 null
2024-07-31 Artificial Intelligence Approaches for Energy Efficiency: A Review Alberto Pasqualetto et.al. 2407.21726v1 null
2024-07-31 Open-Vocabulary Audio-Visual Semantic Segmentation Ruohao Guo et.al. 2407.21721v1 null
2024-07-31 Tora: Trajectory-oriented Diffusion Transformer for Video Generation Zhenghao Zhang et.al. 2407.21705v1 null
2024-07-30 Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation Marcelo Matheus Gauy et.al. 2407.20989v1 null
2024-07-30 Transfer Learning for Multi-material Classification of Transition Metal Dichalcogenides with Atomic Force Microscopy Isaiah A. Moses et.al. 2407.20975v1 null
2024-07-30 MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions Xiaowei Chi et.al. 2407.20962v1 link
2024-07-30 EAR: Edge-Aware Reconstruction of 3-D vertebrae structures from bi-planar X-ray images Lixing Tan et.al. 2407.20937v1 null
2024-07-30 Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering Yanpeng Zhao et.al. 2407.20908v1 link
2024-07-30 Simultaneous Multi-Slice Diffusion Imaging using Navigator-free Multishot Spiral Acquisition Yuancheng Jiang et.al. 2407.20904v1 null
2024-07-30 Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach Adam Wojciechowski et.al. 2407.20899v1 null
2024-07-30 MambaCapsule: Towards Transparent Cardiac Disease Diagnosis with Electrocardiography Using Mamba Capsule Network Yinlong Xu et.al. 2407.20893v1 null
2024-07-30 Shift operators and their classification Maria Carvalho et.al. 2407.20890v1 null
2024-07-30 Effective Black Box Testing of Sentiment Analysis Classification Networks Parsa Karbasizadeh et.al. 2407.20884v1 null
2024-07-29 SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction Çağhan Köksal et.al. 2407.20214v1 null
2024-07-30 SpaER: Learning Spatio-temporal Equivariant Representations for Fetal Brain Motion Tracking Jian Wang et.al. 2407.20198v2 null
2024-07-29 Radiance Fields for Robotic Teleoperation Maximum Wilder-Smith et.al. 2407.20194v1 null
2024-07-29 Theia: Distilling Diverse Vision Foundation Models for Robot Learning Jinghuan Shang et.al. 2407.20179v1 link
2024-07-29 LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework Zhenqi He et.al. 2407.20172v1 link
2024-07-29 Diffusion Feedback Helps CLIP See Better Wenxuan Wang et.al. 2407.20171v1 null
2024-07-29 Language-Conditioned Offline RL for Multi-Robot Navigation Steven Morad et.al. 2407.20164v1 null
2024-07-29 Quantum Machine Learning Architecture Search via Deep Reinforcement Learning Xin Dai et.al. 2407.20147v1 null
2024-07-30 AxiomVision: Accuracy-Guaranteed Adaptive Visual Model Selection for Perspective-Aware Video Analytics Xiangxiang Dai et.al. 2407.20124v2 link
2024-07-29 Integrable and superintegrable quantum mechanical systems with position dependent masses invariant with respect to one parametric Lie groups. 2. Systems with dilatation and shift symmetries A. G. Nikitin et.al. 2407.20112v1 null
2024-07-26 HRP: Human Affordances for Robotic Pre-Training Mohan Kumar Srirama et.al. 2407.18911v1 null
2024-07-26 Wolf: Captioning Everything with a World Summarization Framework Boyi Li et.al. 2407.18908v1 null
2024-07-26 A Scalable Quantum Non-local Neural Network for Image Classification Sparsh Gupta et.al. 2407.18906v1 link
2024-07-26 Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment Yuze Zheng et.al. 2407.18854v1 null
2024-07-26 The Role of Temporal Hierarchy in Spiking Neural Networks Filippo Moro et.al. 2407.18838v1 null
2024-07-26 Learning the Chaotic and Regular Nature of Trajectories in Hamiltonian Systems with Lagrangian descriptors Javier Jiménez López et.al. 2407.18831v1 null
2024-07-26 Binary orbit and disks properties of the RW Aur system using ALMA observations N. T. Kurtovic et.al. 2407.18828v1 null
2024-07-26 Three-dimensional ultrasound-based online system for automated ovarian follicle measurement Pedro Royo et.al. 2407.18818v1 null
2024-07-26 Automatic Detection of Moral Values in Music Lyrics Vjosa Preniqi et.al. 2407.18787v1 null
2024-07-26 Deep learning interpretable analysis for carbon star identification in Gaia DR3 Shuo Ye et.al. 2407.18754v1 null
2024-07-25 Review of Degenerate Higher Order Scalar Tensor Theories in Cosmology Andrei Lazanu et.al. 2407.18234v1 null
2024-07-25 One-point Statistics in various cosmic environments in the presence of massive neutrinos Mohadese Khoshtinat et.al. 2407.18233v1 null
2024-07-26 Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation Razieh Azizi et.al. 2407.18195v2 null
2024-07-25 PianoMime: Learning a Generalist, Dexterous Piano Player from Internet Demonstrations Cheng Qian et.al. 2407.18178v1 null
2024-07-26 On-chip near-infrared spectroscopic sensing with over 520nm bandwidth Chunhui Yao et.al. 2407.18172v2 null
2024-07-25 IRIS: Wireless Ring for Vision-based Smart Home Interaction Maruchi Kim et.al. 2407.18141v1 null
2024-07-25 XS-VID: An Extremely Small Video Object Detection Dataset Jiahao Guo et.al. 2407.18137v1 null
2024-07-25 Estimating Earthquake Magnitude in Sentinel-1 Imagery via Ranking Daniele Rege Cambrin et.al. 2407.18128v1 null
2024-07-25 Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images Roberto Di Via et.al. 2407.18125v1 null
2024-07-25 Multi-Resolution Histopathology Patch Graphs for Ovarian Cancer Subtyping Jack Breen et.al. 2407.18105v1 link
2024-07-24 SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency Yiming Xie et.al. 2407.17470v1 null
2024-07-24 SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning Jianpeng Yao et.al. 2407.17460v1 null
2024-07-24 EuroCropsML: A Time Series Benchmark Dataset For Few-Shot Crop Type Classification Joana Reuss et.al. 2407.17458v1 null
2024-07-24 HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation Zhenzhi Wang et.al. 2407.17438v1 link
2024-07-24 Systematic study of High $E_J/E_C$ transmon qudits up to $d = 12$ Z. Wang et.al. 2407.17407v1 null
2024-07-24 Self-Calibrated Variance-Stabilizing Transformations for Real-World Image Denoising Sébastien Herbreteau et.al. 2407.17399v1 null
2024-07-24 Sampling-Based Hierarchical Trajectory Planning for Formation Flight Qingzhao Liu et.al. 2407.17392v1 null
2024-07-24 2D and 3D Deep Learning Models for MRI-based Parkinson's Disease Classification: A Comparative Analysis of Convolutional Kolmogorov-Arnold Networks, Convolutional Neural Networks, and Graph Convolutional Networks Salil B Patel et.al. 2407.17380v1 null
2024-07-24 Entropy Reweighted Conformal Classification Rui Luo et.al. 2407.17377v1 null
2024-07-24 MuST: Multi-Scale Transformers for Surgical Phase Recognition Alejandra Pérez et.al. 2407.17361v1 link
2024-07-23 Explanation Regularisation through the Lens of Attributions Pedro Ferreira et.al. 2407.16693v1 null
2024-07-23 On the local cohomology of secant varieties Sebastian Olano et.al. 2407.16688v1 null
2024-07-23 AutoRG-Brain: Grounded Report Generation for Brain MRI Jiayu Lei et.al. 2407.16684v1 null
2024-07-24 Goedel logics: Prenex fragments Matthias Baaz et.al. 2407.16683v2 null
2024-07-24 A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data Adrian Remonda et.al. 2407.16680v2 link
2024-07-23 From Imitation to Refinement -- Residual RL for Precise Visual Assembly Lars Ankile et.al. 2407.16677v1 null
2024-07-23 FakingRecipe: Detecting Fake News on Short Video Platforms from the Perspective of Creative Process Yuyan Bu et.al. 2407.16670v1 null
2024-07-23 EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval Thomas Hummel et.al. 2407.16658v1 link
2024-07-23 Fluorescence Diffraction Tomography using Explicit Neural Fields Renzhi He et.al. 2407.16657v1 null
2024-07-23 MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence Canyu Zhao et.al. 2407.16655v1 null
2024-07-22 AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description Junyu Xie et.al. 2407.15850v1 link
2024-07-22 SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models Mingze Xu et.al. 2407.15841v1 null
2024-07-23 QueST: Self-Supervised Skill Abstractions for Learning Continuous Control Atharva Mete et.al. 2407.15840v2 null
2024-07-22 Enhancing Cell Instance Segmentation in Scanning Electron Microscopy Images via a Deep Contour Closing Operator Florian Robert et.al. 2407.15817v1 null
2024-07-22 Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning Zhecheng Yuan et.al. 2407.15815v1 null
2024-07-22 The Evaporating Massive Embedded Stellar Cluster IRS 13 Close to Sgr A. II. Kinematic structure* Florian Peißker et.al. 2407.15800v1 null
2024-07-22 Adaptive Extensions of Unbiased Risk Estimators for Unsupervised Magnetic Resonance Image Denoising Reeshad Khan et.al. 2407.15799v1 null
2024-07-23 Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video Guiqiu Liao et.al. 2407.15794v2 null
2024-07-22 LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding Haoning Wu et.al. 2407.15754v1 link
2024-07-22 SAM2CLIP2SAM: Vision Language Model for Segmentation of 3D CT Scans for Covid-19 Detection Dimitrios Kollias et.al. 2407.15728v1 null
2024-07-19 DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks Sarah Jabbour et.al. 2407.14509v1 null
2024-07-19 T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation Kaiyue Sun et.al. 2407.14505v1 null
2024-07-19 Nonlinear Schrödinger Network Yiming Zhou et.al. 2407.14504v1 null
2024-07-19 Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery Sukrut Rao et.al. 2407.14499v1 link
2024-07-19 Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation Dongyang Wu et.al. 2407.14498v1 null
2024-07-19 Evaluating the Reliability of Self-Explanations in Large Language Models Korbinian Randl et.al. 2407.14487v1 link
2024-07-19 Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model Seonghui Min et.al. 2407.14434v1 null
2024-07-19 Dataset Distillation in Medical Imaging: A Feasibility Study Muyang Li et.al. 2407.14429v1 null
2024-07-19 Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models Hyun-Jic Oh et.al. 2407.14426v1 null
2024-07-19 Improving classification of road surface conditions via road area extraction and contrastive learning Linh Trinh et.al. 2407.14418v1 null
2024-07-18 GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model Abdelrahman Shaker et.al. 2407.13772v1 null
2024-07-18 Addressing Imbalance for Class Incremental Learning in Medical Image Classification Xuze Hao et.al. 2407.13768v1 null
2024-07-18 Shape of Motion: 4D Reconstruction from a Single Video Qianqian Wang et.al. 2407.13764v1 null
2024-07-18 Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion Boyang Deng et.al. 2407.13759v1 null
2024-07-18 Exploring Facial Biomarkers for Depression through Temporal Analysis of Action Units Aditya Parikh et.al. 2407.13753v1 null
2024-07-18 Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management Yoontae Hwang et.al. 2407.13751v1 null
2024-07-18 Pose-guided multi-task video transformer for driver action recognition Ricardo Pizarro et.al. 2407.13750v1 null
2024-07-18 Multi-Label Learning with Stronger Consistency Guarantees Anqi Mao et.al. 2407.13746v1 null
2024-07-18 Realizable $H$-Consistent and Bayes-Consistent Loss Functions for Learning to Defer Anqi Mao et.al. 2407.13732v1 null
2024-07-18 Enhanced $H$-Consistency Bounds Anqi Mao et.al. 2407.13722v1 null
2024-07-17 VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control Sherwin Bahmani et.al. 2407.12781v1 null
2024-07-17 Hallucination Index: An Image Quality Metric for Generative Reconstruction Models Matthew Tivnan et.al. 2407.12780v1 null
2024-07-17 LookupViT: Compressing visual information to a limited number of tokens Rajat Koner et.al. 2407.12753v1 null
2024-07-17 4Dynamic: Text-to-4D Generation with Hybrid Priors Yu-Jie Yuan et.al. 2407.12684v1 null
2024-07-17 Goldfish: Vision-Language Understanding of Arbitrarily Long Videos Kirolos Ataallah et.al. 2407.12679v1 null
2024-07-17 Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs Yiqing Shen et.al. 2407.12678v1 null
2024-07-17 CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems Jiankun Zhao et.al. 2407.12676v1 link
2024-07-17 Distilling Tiny and Ultra-fast Deep Neural Networks for Autonomous Navigation on Nano-UAVs Lorenzo Lamberti et.al. 2407.12675v1 null
2024-07-17 Enhancing the Utility of Privacy-Preserving Cancer Classification using Synthetic Data Richard Osuala et.al. 2407.12669v1 null
2024-07-17 Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge Andrea Albanese et.al. 2407.12663v1 null
2024-07-16 Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling Jaehyeok Kim et.al. 2407.11962v1 null
2024-07-16 A Transformer-based Approach for Augmenting Software Engineering Chatbots Datasets Ahmad Abdellatif et.al. 2407.11955v1 null
2024-07-16 Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation Olga Zatsarynna et.al. 2407.11954v1 null
2024-07-16 Temporally Consistent Stereo Matching Jiaxi Zeng et.al. 2407.11950v1 link
2024-07-17 Hierarchical Separable Video Transformer for Snapshot Compressive Imaging Ping Wang et.al. 2407.11946v2 link
2024-07-16 Tackling Oversmoothing in GNN via Graph Sparsification: A Truss-based Approach Tanvir Hossain et.al. 2407.11928v1 null
2024-07-16 The Strength of Bisymmetric Modes in SDSS-IV/MaNGA Barred Galaxy Kinematics Brian DiGiorgio Zanger et.al. 2407.11908v1 null
2024-07-16 GraphFM: A Scalable Framework for Multi-Graph Pretraining Divyansha Lachi et.al. 2407.11907v1 null
2024-07-16 SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge Hao Ding et.al. 2407.11906v1 null
2024-07-16 Automated production of batched unclonable micro-patterns anti-counterfeiting labels with strong robustness and rapid recognition speed Yuzheng He et.al. 2407.11886v1 null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964v1 link
2024-07-15 InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models Nirat Saini et.al. 2407.10958v1 null
2024-07-15 MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models Chengguang Gan et.al. 2407.10953v1 null
2024-07-15 IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation Yuanhao Zhai et.al. 2407.10937v1 link
2024-07-15 Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together Dilara Soylu et.al. 2407.10930v1 null
2024-07-15 In-Loop Filtering via Trained Look-Up Tables Zhuoyuan Li et.al. 2407.10926v1 null
2024-07-15 A Dual-Attention Aware Deep Convolutional Neural Network for Early Alzheimer's Detection Pandiyaraju V et.al. 2407.10921v1 null
2024-07-16 DataDream: Few-shot Guided Dataset Generation Jae Myung Kim et.al. 2407.10910v2 link
2024-07-15 Interpreting Hand gestures using Object Detection and Digits Classification Sangeetha K et.al. 2407.10902v1 null
2024-07-15 Leveraging Multimodal CycleGAN for the Generation of Anatomically Accurate Synthetic CT Scans from MRIs Leonardo Crespi et.al. 2407.10888v1 null
2024-07-12 Non-Hermitian Origin of Wannier Localizability and Detachable Topological Boundary States Daichi Nakamura et.al. 2407.09458v1 null
2024-07-12 Let Me DeCode You: Decoder Conditioning with Tabular Data Tomasz Szczepański et.al. 2407.09437v1 link
2024-07-12 Rethinking temporal self-similarity for repetitive action counting Yanan Luo et.al. 2407.09431v1 null
2024-07-12 TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models Hang Zou et.al. 2407.09424v1 null
2024-07-12 A grid of self-consistent MSG (MARCS-StaticWeather-GGchem) cool stellar, sub-stellar, and exoplanetary model atmospheres Uffe G. Jørgensen et.al. 2407.09397v1 null
2024-07-12 Open-Canopy: A Country-Scale Benchmark for Canopy Height Estimation at Very High Resolution Fajwel Fogel et.al. 2407.09392v1 link
2024-07-12 Radiance Fields from Photons Sacha Jungerman et.al. 2407.09386v1 null
2024-07-12 Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation Zhilin Zhu et.al. 2407.09367v1 link
2024-07-12 Novel clustered federated learning based on local loss Endong Gu et.al. 2407.09360v1 link
2024-07-12 Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems Ziyuan Luo et.al. 2407.09352v1 null
2024-07-11 Video Diffusion Alignment via Reward Gradients Mihir Prabhudesai et.al. 2407.08737v1 link
2024-07-11 Real-Time Anomaly Detection and Reactive Planning with Large Language Models Rohan Sinha et.al. 2407.08735v1 null
2024-07-11 WhisperNetV2: SlowFast Siamese Network For Lip-Based Biometrics Abdollah Zakeri et.al. 2407.08717v1 null
2024-07-11 Sensor-Aware Classifiers for Energy-Efficient Time Series Applications on IoT Devices Dina Hussein et.al. 2407.08715v1 null
2024-07-11 Towards Efficient Deployment of Hybrid SNNs on Neuromorphic and Edge AI Hardware James Seekings et.al. 2407.08704v1 null
2024-07-11 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing et.al. 2407.08701v1 null
2024-07-11 ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions Jiu Feng et.al. 2407.08691v1 link
2024-07-11 Generalizable Implicit Motion Modeling for Video Frame Interpolation Zujin Guo et.al. 2407.08680v1 null
2024-07-11 Still-Moving: Customized Video Generation without Customized Video Data Hila Chefer et.al. 2407.08674v1 null
2024-07-11 NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning Yi Zhang et.al. 2407.08672v1 null
2024-07-10 LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models Feng Li et.al. 2407.07895v1 link
2024-07-10 Vegetable Peeling: A Case Study in Constrained Dexterous Manipulation Tao Chen et.al. 2407.07884v1 null
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860v1 null
2024-07-11 Functional Assessment of Cerebral Capillaries using Single Capillary Reporters in Ultrasound Localization Microscopy Stephen A Lee et.al. 2407.07857v2 null
2024-07-10 Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification Mei Qiu et.al. 2407.07842v1 null
2024-07-10 Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective Shengjia Chen et.al. 2407.07841v1 link
2024-07-10 Probe and Prejudice: Classification of compact objects and model comparison using EOS knowledge Hauke Koehn et.al. 2407.07837v1 null
2024-07-10 RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement Honglie Chen et.al. 2407.07825v1 null
2024-07-10 New Gravitational Wave Discoveries Enabled by Machine Learning Alexandra E. Koloniari et.al. 2407.07820v1 null
2024-07-10 The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others Daniel Sikar et.al. 2407.07818v1 null
2024-07-09 V-VIPE: Variational View Invariant Pose Embedding Mara Levy et.al. 2407.07092v1 null
2024-07-09 Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic Ruochen Jin et.al. 2407.07089v1 link
2024-07-09 MoSt-DSA: Modeling Motion and Structural Interactions for Direct Multi-Frame Interpolation in DSA Images Ziyang Xu et.al. 2407.07078v1 link
2024-07-09 MADE-for-ASD: A Multi-Atlas Deep Ensemble Network for Diagnosing Autism Spectrum Disorder Md Rakibul Hasan et.al. 2407.07076v1 null
2024-07-10 CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement Wei Wang et.al. 2407.07056v2 null
2024-07-09 Latent Space Imaging Matheus Souza et.al. 2407.07052v1 null
2024-07-09 Simple and Interpretable Probabilistic Classifiers for Knowledge Graphs Christian Riefolo et.al. 2407.07045v1 null
2024-07-09 Free Fermionic Constructions of Heterotic Strings Ioannis Florakis et.al. 2407.07034v1 null
2024-07-09 Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition Daiqing Wu et.al. 2407.07026v1 null
2024-07-09 Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization Jeongseok Hyun et.al. 2407.07024v1 link
2024-07-08 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Orr Zohar et.al. 2407.06189v1 link
2024-07-08 Classification of Cellular Automata based on the Hamming distance Gaspar Alfaro et.al. 2407.06175v1 null
2024-07-08 The Tug-of-War Between Deepfake Generation and Detection Hannah Lee et.al. 2407.06174v1 null
2024-07-08 PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes Mohammad Reza Karimi Dastjerdi et.al. 2407.06150v1 null
2024-07-08 Physics-informed machine learning approaches to reactor antineutrino detection Sophia Farrell et.al. 2407.06139v1 null
2024-07-08 Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities Avinash Anand et.al. 2407.06125v1 null
2024-07-08 Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation Xinyu Bai et.al. 2407.06095v1 null
2024-07-08 ERR@HRI 2024 Challenge: Multimodal Detection of Errors and Failures in Human-Robot Interactions Micol Spitale et.al. 2407.06094v1 null
2024-07-08 Artificial Intuition: Efficient Classification of Scientific Abstracts Harsh Sakhrani et.al. 2407.06093v1 null
2024-07-08 Assessing Cardiomegaly in Dogs Using a Simple CNN Model Nikhil Deekonda et.al. 2407.06092v1 null
2024-07-05 VCoME: Verbal Video Composition with Multimodal Editing Effects Weibo Gong et.al. 2407.04697v1 null
2024-07-05 Enhancing Vehicle Re-identification and Matching for Weaving Analysis Mei Qiu et.al. 2407.04688v1 null
2024-07-05 Embracing Massive Medical Data Yu-Cheng Chou et.al. 2407.04687v1 link
2024-07-05 Is plantar thermography a valid digital biomarker for characterising diabetic foot ulceration risk? Akshay Jagadeesh et.al. 2407.04676v1 null
2024-07-05 AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation Yuhan Zhu et.al. 2407.04603v1 null
2024-07-05 Multimodal Classification via Modal-Aware Interactive Enhancement Qing-Yuan Jiang et.al. 2407.04587v1 null
2024-07-05 A Degree Bound for Planar Functions Christof Beierle et.al. 2407.04570v1 null
2024-07-05 Pencils of plane cubics with one base point Riccardo Moschetti et.al. 2407.04569v1 null
2024-07-05 Anticipating Solar Flares Hugh S. Hudson et.al. 2407.04567v1 null
2024-07-05 Real Time Emotion Analysis Using Deep Learning for Education, Entertainment, and Beyond Abhilash Khuntia et.al. 2407.04560v1 null
2024-07-03 InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Pan Zhang et.al. 2407.03320v1 link
2024-07-03 Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations Trevor Ablett et.al. 2407.03311v1 link
2024-07-03 Accelerated Proton Resonance Frequency-based Magnetic Resonance Thermometry by Optimized Deep Learning Method Sijie Xu et.al. 2407.03308v1 link
2024-07-03 HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization Yucheng Tang et.al. 2407.03307v1 null
2024-07-03 VCHAR:Variance-Driven Complex Human Activity Recognition framework with Generative Representation Yuan Sun et.al. 2407.03291v1 null
2024-07-03 Using Photoplethysmography to Detect Real-time Blood Pressure Changes with a Calibration-free Deep Learning Model Jingyuan Hong et.al. 2407.03274v1 null
2024-07-03 Modern Neighborhood Components Analysis: A Deep Tabular Baseline Two Decades Later Han-Jia Ye et.al. 2407.03257v1 link
2024-07-03 STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data Kheir Eddine Daouadi et.al. 2407.03253v1 null
2024-07-03 ACTRESS: Active Retraining for Semi-supervised Visual Grounding Weitai Kang et.al. 2407.03251v1 null
2024-07-04 TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach Weikun Peng et.al. 2407.03245v2 null
2024-07-02 Characterizing the Interpretability of Attention Maps in Digital Pathology Tomé Albuquerque et.al. 2407.02484v1 null
2024-07-02 Ensemble of pre-trained language models and data augmentation for hate speech detection from Arabic tweets Kheir Eddine Daouadi et.al. 2407.02448v1 null
2024-07-02 PLeaS -- Merging Models with Permutations and Least Squares Anshul Nasery et.al. 2407.02447v1 null
2024-07-02 Evaluating the Robustness of Adverse Drug Event Classification Models Using Templates Dorothea MacPhail et.al. 2407.02432v1 null
2024-07-02 AXIAL: Attention-based eXplainability for Interpretable Alzheimer's Localized Diagnosis using 2D CNNs on 3D MRI brain scans Gabriele Lozupone et.al. 2407.02418v1 link
2024-07-03 Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs Jinmin Li et.al. 2407.02411v2 null
2024-07-02 Tiny-PULP-Dronets: Squeezing Neural Networks for Faster and Lighter Inference on Multi-Tasking Autonomous Nano-Drones Lorenzo Lamberti et.al. 2407.02405v1 null
2024-07-03 A neural networks method to search for long transient gravitational waves Francesca Attadio et.al. 2407.02391v2 null
2024-07-02 Real HSI-MSI-PAN image dataset for the hyperspectral/multi-spectral/panchromatic image fusion and super-resolution fields Shuangliang Li et.al. 2407.02387v1 link
2024-07-02 OpenSlot: Mixed Open-set Recognition with Object-centric Learning Xu Yin et.al. 2407.02386v1 null
2024-06-28 Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs Sukmin Yun et.al. 2406.20098v1 link
2024-06-28 LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression Jieneng Chen et.al. 2406.20092v1 link
2024-06-28 Minimax And Adaptive Transfer Learning for Nonparametric Classification under Distributed Differential Privacy Constraints Arnab Auddy et.al. 2406.20088v1 null
2024-06-28 Extreme horizon equation Wojciech Kamiński et.al. 2406.20068v1 null
2024-06-28 Modeling and LQR Control of Insect Sized Flapping Wing Robot Daksh Dhingra et.al. 2406.20061v1 null
2024-06-28 Pairwise Difference Learning for Classification Mohamed Karim Belaid et.al. 2406.20031v1 link
2024-06-28 On the Trade-off between Flatness and Optimization in Distributed Learning Ying Cao et.al. 2406.20006v1 null
2024-06-28 Malaria Cell Detection Using Deep Neural Networks Saurabh Sawant et.al. 2406.20005v1 null
2024-06-28 Impact of Initialization on Intra-subject Pediatric Brain MR Image Registration: A Comparative Analysis between SyN ANTs and Deep Learning-Based Approaches Andjela Dimitrijevic et.al. 2406.19943v1 link
2024-07-01 GRACE: Graph-Regularized Attentive Convolutional Entanglement with Laplacian Smoothing for Robust DeepFake Video Detection Chih-Chung Hsu et.al. 2406.19941v2 link
2024-06-27 ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos Jr-Jen Chen et.al. 2406.19392v1 link
2024-06-27 Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads Ali Khaleghi Rahimian et.al. 2406.19391v1 link
2024-06-27 OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding Tao Zhang et.al. 2406.19389v1 null
2024-06-27 Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model Haobo Yuan et.al. 2406.19369v1 null
2024-06-27 IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language Lucky Susanto et.al. 2406.19349v1 null
2024-06-27 Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation Yushun Tang et.al. 2406.19341v1 null
2024-06-28 LiverUSRecon: Automatic 3D Reconstruction and Volumetry of the Liver with a Few Partial Ultrasound Scans Kaushalya Sivayogaraj et.al. 2406.19336v2 null
2024-06-27 PNeRV: A Polynomial Neural Representation for Videos Sonam Gupta et.al. 2406.19299v1 null
2024-06-27 Leveraging Contrastive Learning for Enhanced Node Representations in Tokenized Graph Transformers Jinsong Chen et.al. 2406.19258v1 null
2024-06-27 Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment Hao Fei et.al. 2406.19255v1 null
2024-06-26 Towards Compositionality in Concept Learning Adam Stein et.al. 2406.18534v1 link
2024-06-26 MatchTime: Towards Automatic Soccer Game Commentary Generation Jiayuan Rao et.al. 2406.18530v1 null
2024-06-26 MultiDiff: Consistent Novel View Synthesis from a Single Image Norman Müller et.al. 2406.18524v1 null
2024-06-26 ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation Shenghai Yuan et.al. 2406.18522v1 null
2024-06-27 Distinguishing mechanisms of social contagion from local network view Elsa Andres et.al. 2406.18519v2 null
2024-06-26 Assessment of Clonal Hematopoiesis of Indeterminate Potential from Cardiac Magnetic Resonance Imaging using Deep Learning in a Cardio-oncology Population Sangeon Ryu et.al. 2406.18508v1 null
2024-06-26 Robust Surgical Phase Recognition From Annotation Efficient Supervision Or Rubin et.al. 2406.18481v1 null
2024-06-26 Universal Anomaly Detection at the LHC: Transforming Optimal Classifiers and the DDD Method Sascha Caron et.al. 2406.18469v1 null
2024-06-26 An Autotuning-based Optimization Framework for Mixed-kernel SVM Classifications in Smart Pixel Datasets and Heterojunction Transistors Xingfu Wu et.al. 2406.18445v1 null
2024-06-26 Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling Abril Corona-Figueroa et.al. 2406.18422v1 null
2024-06-25 Text-Animator: Controllable Visual Text Video Generation Lin Liu et.al. 2406.17777v1 null
2024-06-25 MotionBooth: Motion-Aware Customized Text-to-Video Generation Jianzong Wu et.al. 2406.17758v1 null
2024-06-25 Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical Investigation Tushar Prasanna Swaminathan et.al. 2406.17749v1 null
2024-06-25 Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning Arijit Sehanobish et.al. 2406.17740v1 null
2024-06-25 Mask-Guided Attention U-Net for Enhanced Neonatal Brain Extraction and Image Preprocessing Bahram Jafrasteh et.al. 2406.17709v1 link
2024-06-25 SurgeMOD: Translating image-space tissue motions into vision-based surgical forces Mikel De Iturrate Reyzabal et.al. 2406.17707v1 link
2024-06-25 Dualities for universal (co)acting Hopf monoids Ana Agore et.al. 2406.17684v1 null
2024-06-25 Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation Xuming Zhang et.al. 2406.17679v1 null
2024-06-25 Lifting of locally initial objects and universal (co)acting Hopf algebras Ana Agore et.al. 2406.17677v1 null
2024-06-25 Brain Tumor Classification using Vision Transformer with Selective Cross-Attention Mechanism and Feature Calibration Mohammad Ali Labbaf Khaniki et.al. 2406.17670v1 null
2024-06-24 StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal Chongjie Ye et.al. 2406.16864v1 null
2024-06-24 FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models Haonan Qiu et.al. 2406.16863v1 link
2024-06-24 Dreamitate: Real-World Visuomotor Policy Learning via Video Generation Junbang Liang et.al. 2406.16862v1 null
2024-06-24 Long Context Transfer from Language to Vision Peiyuan Zhang et.al. 2406.16852v1 link
2024-06-24 Unsupervised Domain Adaptation for Pediatric Brain Tumor Segmentation Jingru Fu et.al. 2406.16848v1 null
2024-06-24 Exploring Factual Entailment with NLI: A News Media Study Guy Mor-Lan et.al. 2406.16842v1 null
2024-06-24 A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking Lorenzo Shaikewitz et.al. 2406.16837v1 null
2024-06-24 USDC: A Dataset of $\underline{U}$ser $\underline{S}$tance and $\underline{D}$ogmatism in Long $\underline{C}$onversations Mounika Marreddy et.al. 2406.16833v1 null
2024-06-24 The classification of simple complex Lie superalgebras of polynomial vector fields and their deformations Dimitry Leites et.al. 2406.16760v1 null
2024-06-24 The MRI Scanner as a Diagnostic: Image-less Active Sampling Yuning Du et.al. 2406.16754v1 null
2024-06-21 Full-Scale Indexing and Semantic Annotation of CT Imaging: Boosting FAIRness Hannes Ulrich et.al. 2406.15340v1 null
2024-06-21 Image Conductor: Precision Control for Interactive Video Synthesis Yaowei Li et.al. 2406.15339v1 null
2024-06-21 An End-to-End, Segmentation-Free, Arabic Handwritten Recognition Model on KHATT Sondos Aabed et.al. 2406.15329v1 null
2024-06-21 Fine-grained Attention in Hierarchical Transformers for Tabular Time-series Raphael Azorin et.al. 2406.15327v1 link
2024-06-21 NLP-KG: A System for Exploratory Search of Scientific Literature in Natural Language Processing Tim Schopf et.al. 2406.15294v1 link
2024-06-21 Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics Weijia Zhang et.al. 2406.15264v1 null
2024-06-24 VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation Xuan He et.al. 2406.15252v2 null
2024-06-21 Retrieval Augmented Zero-Shot Text Classification Tassallah Abdullahi et.al. 2406.15241v1 null
2024-06-21 Model Equivalences Michael Benedikt et.al. 2406.15235v1 null
2024-06-21 Rate-Splitting Multiple Access for Overloaded Multi-group Multicast: A First Experimental Study Xinze Lyu et.al. 2406.15217v1 null
2024-06-20 A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models Xincheng Shuai et.al. 2406.14555v1 link
2024-06-21 Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation Eyal Michaeli et.al. 2406.14551v2 link
2024-06-20 IRASim: Learning Interactive Real-Robot Action Simulators Fangqi Zhu et.al. 2406.14540v1 null
2024-06-20 Epicardium Prompt-guided Real-time Cardiac Ultrasound Frame-to-volume Registration Long Lei et.al. 2406.14534v1 link
2024-06-20 Local symmetries in partially ordered sets Christoph Minz et.al. 2406.14533v1 null
2024-06-20 Fantastic Copyrighted Beasts and How (Not) to Generate Them Luxi He et.al. 2406.14526v1 null
2024-06-20 MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding Xinyu Fang et.al. 2406.14515v1 link
2024-06-20 V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data Rotem Shalev-Arkushin et.al. 2406.14510v1 null
2024-06-20 LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors Sheikh Asif Imran et.al. 2406.14498v1 link
2024-06-20 African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification Gregor Geigle et.al. 2406.14496v1 null
2024-06-18 DrVideo: Document Retrieval Based Long Video Understanding Ziyu Ma et.al. 2406.12846v1 null
2024-06-18 LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging Jinuk Kim et.al. 2406.12837v1 link
2024-06-18 GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation Ci-Siang Lin et.al. 2406.12834v1 null
2024-06-18 VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing Jing Gu et.al. 2406.12831v1 null
2024-06-18 Neural Approximate Mirror Maps for Constrained Diffusion Models Berthy T. Feng et.al. 2406.12816v1 null
2024-06-18 Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation Nikolas Koutsoubis et.al. 2406.12815v1 link
2024-06-18 Probabilistic Temporal Prediction of Continuous Disease Trajectories and Treatment Effects Using Neural SDEs Joshua Durso-Finley et.al. 2406.12807v1 null
2024-06-18 Composited-Nested-Learning with Data Augmentation for Nested Named Entity Recognition Xingming Liao et.al. 2406.12779v1 null
2024-06-18 Medvedev degrees of subshifts on groups Sebastián Barbieri et.al. 2406.12777v1 null
2024-06-18 Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video Xiangming Zhu et.al. 2406.12769v1 null
2024-06-17 Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% Lei Zhu et.al. 2406.11837v1 link
2024-06-17 Spectral Introspection Identifies Group Training Dynamics in Deep Neural Networks for Neuroimaging Bradley T. Baker et.al. 2406.11825v1 null
2024-06-17 Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation Alexander Raistrick et.al. 2406.11824v1 null
2024-06-17 VideoLLM-online: Online Video Large Language Model for Streaming Video Joya Chen et.al. 2406.11816v1 null
2024-06-17 Faces of Experimental Pain: Transferability of Deep Learned Heat Pain Features to Electrical Pain Pooja Prajod et.al. 2406.11808v1 null
2024-06-17 Mix-Domain Contrastive Learning for Unpaired H&E-to-IHC Stain Translation Song Wang et.al. 2406.11799v1 null
2024-06-17 CELL your Model: Contrastive Explanation Methods for Large Language Models Ronny Luss et.al. 2406.11785v1 null
2024-06-17 Task Me Anything Jieyu Zhang et.al. 2406.11775v1 link
2024-06-17 Domain Generalization for In-Orbit 6D Pose Estimation Antoine Legrand et.al. 2406.11743v1 null
2024-06-17 Lightweight Model Pre-training via Language Guided Knowledge Distillation Mingsheng Li et.al. 2406.11689v1 link
2024-06-14 VideoGUI: A Benchmark for GUI Automation from Instructional Videos Kevin Qinghong Lin et.al. 2406.10227v1 null
2024-06-14 Short Film Dataset (SFD): A Benchmark for Story-Level Video Understanding Ridouane Ghermi et.al. 2406.10221v1 null
2024-06-14 SSTFB: Leveraging self-supervised pretext learning and temporal self-attention with feature branching for real-time video polyp segmentation Ziang Xu et.al. 2406.10200v1 null
2024-06-14 CarLLaVA: Vision language models for camera-only closed-loop driving Katrin Renz et.al. 2406.10165v1 null
2024-06-14 Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition Guinan Li et.al. 2406.10152v1 null
2024-06-14 Training-free Camera Control for Video Generation Chen Hou et.al. 2406.10126v1 null
2024-06-14 Modified Risk Formulation for Improving the Prediction of Knee Osteoarthritis Progression Haresh Rengaraj Rajamohan et.al. 2406.10119v1 null
2024-06-14 ECGMamba: Towards Efficient ECG Classification with BiSSM Yupeng Qiang et.al. 2406.10098v1 null
2024-06-14 Biomarker based Cancer Classification using an Ensemble with Pre-trained Models Chongmin Lee et.al. 2406.10087v1 null
2024-06-14 On the Evaluation of Speech Foundation Models for Spoken Language Understanding Siddhant Arora et.al. 2406.10083v1 null
2024-06-13 VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding Muhammad Maaz et.al. 2406.09418v1 link
2024-06-13 An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Duy-Kien Nguyen et.al. 2406.09415v1 null
2024-06-13 CodedEvents: Optimal Point-Spread-Function Engineering for 3D-Tracking with Event Cameras Sachin Shah et.al. 2406.09409v1 null
2024-06-13 Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion Linzhan Mou et.al. 2406.09402v1 null
2024-06-13 OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation Junke Wang et.al. 2406.09399v1 link
2024-06-13 Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA Jongwoo Park et.al. 2406.09396v1 null
2024-06-13 LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living Rajatsubhra Chakraborty et.al. 2406.09390v1 null
2024-06-13 Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior Baiang Li et.al. 2406.09389v1 null
2024-06-13 Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition Youngtaek Oh et.al. 2406.09388v1 link
2024-06-13 SimGen: Simulator-conditioned Driving Scene Generation Yunsong Zhou et.al. 2406.09386v1 null
2024-06-12 On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models Hashmat Shadab Malik et.al. 2406.08486v1 link
2024-06-12 RMem: Restricted Memory Banks Improve Video Object Segmentation Junbao Zhou et.al. 2406.08476v1 null
2024-06-12 AToM-Bot: Embodied Fulfillment of Unspoken Human Needs with Affective Theory of Mind Wei Ding et.al. 2406.08455v1 null
2024-06-12 Transformation-Dependent Adversarial Attacks Yaoteng Tan et.al. 2406.08443v1 null
2024-06-12 A Sticker is Worth a Thousand Words: Characterizing the Use of Stickers in WhatsApp Political Groups in Brazil Philipe Melo et.al. 2406.08429v1 null
2024-06-12 Improving Noise Robustness through Abstractions and its Impact on Machine Learning Alfredo Ibias et.al. 2406.08428v1 null
2024-06-12 OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Qingyun Li et.al. 2406.08418v1 link
2024-06-13 MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos Xuehai He et.al. 2406.08407v2 link
2024-06-12 Eyes Wide Unshut: Unsupervised Mistake Detection in Egocentric Video by Detecting Unpredictable Gaze Michele Mazzamuto et.al. 2406.08379v1 null
2024-06-12 2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction Tianqi Chen et.al. 2406.08374v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. Guzmán et.al. 2406.07535v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-10 NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing Ting-Hsuan Chen et.al. 2406.06523v1 null
2024-06-10 Data Augmentation for Multivariate Time Series Classification: An Experimental Study Romain Ilbert et.al. 2406.06518v1 null
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512v1 null
2024-06-10 Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer Sigal Raab et.al. 2406.06508v1 link
2024-06-10 Equivariant Neural Tangent Kernels Philipp Misof et.al. 2406.06504v1 null
2024-06-10 Viscous shock fluctuations in KPZ Alexander Dunlap et.al. 2406.06502v1 null
2024-06-10 NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative Asmar Nadeem et.al. 2406.06499v1 null
2024-06-10 Demonstrating HumanTHOR: A Simulation Platform and Benchmark for Human-Robot Collaboration in a Shared Workspace Chenxu Wang et.al. 2406.06498v1 null
2024-06-10 Graph-Based Bidirectional Transformer Decision Threshold Adjustment Algorithm for Class-Imbalanced Molecular Data Nicole Hayes et.al. 2406.06479v1 null
2024-06-10 DiffAudit: Auditing Privacy Practices of Online Services for Children and Adolescents Olivia Figueira et.al. 2406.06473v1 null
2024-06-07 DVOS: Self-Supervised Dense-Pattern Video Object Segmentation Keyhan Najafian et.al. 2406.05131v1 null
2024-06-07 Compositional Curvature Bounds for Deep Neural Networks Taha Entesari et.al. 2406.05119v1 null
2024-06-07 Large Generative Graph Models Yu Wang et.al. 2406.05109v1 null
2024-06-07 A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification Christian Giannetti et.al. 2406.05096v1 null
2024-06-10 Discovery of An Apparent Red, High-Velocity Type Ia Supernova at z = 2.9 with JWST J. D. R. Pierel et.al. 2406.05089v2 null
2024-06-07 CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion Xingrui Wang et.al. 2406.05082v1 null
2024-06-10 Discovery of a Relativistic Stripped Envelope Type Ic-BL Supernova at z = 2.83 with JWST M. R. Siebert et.al. 2406.05076v2 null
2024-06-07 Diving Deep into the Motion Representation of Video-Text Models Chinmaya Devaraj et.al. 2406.05075v1 null
2024-06-07 Hibou: A Family of Foundational Vision Transformers for Pathology Dmitry Nechaev et.al. 2406.05074v1 null
2024-06-07 Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations Benjamin Fresz et.al. 2406.05068v1 link
2024-06-06 Verbalized Machine Learning: Revisiting Machine Learning with Language Models Tim Z. Xiao et.al. 2406.04344v1 null
2024-06-07 Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion Fangfu Liu et.al. 2406.04338v2 null
2024-06-06 Parameter-Inverted Image Pyramid Networks Xizhou Zhu et.al. 2406.04330v1 link
2024-06-06 ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Lin Chen et.al. 2406.04325v1 null
2024-06-06 SF-V: Single Forward Video Generation Model Zhixing Zhang et.al. 2406.04324v1 null
2024-06-06 ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories Qianlan Yang et.al. 2406.04323v1 null
2024-06-06 VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling Zeyue Tian et.al. 2406.04321v1 link
2024-06-06 Chimera: Effectively Modeling Multivariate Time Series with 2-Dimensional State Space Models Ali Behrouz et.al. 2406.04320v1 null
2024-06-06 Adaptive Sampling of k-Space in Magnetic Resonance for Rapid Pathology Prediction Chen-Yu Yen et.al. 2406.04318v1 null
2024-06-06 Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks Tristan Cinquin et.al. 2406.04317v1 null
2024-06-05 Grokking Modular Polynomials Darshil Doshi et.al. 2406.03495v1 null
2024-06-05 The Logarithmic Memristor-Based Bayesian Machine Clément Turck et.al. 2406.03492v1 null
2024-06-05 Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review Sonia Bbouzidi et.al. 2406.03478v1 null
2024-06-05 Node-wise Filtering in Graph Neural Networks: A Mixture of Experts Approach Haoyu Han et.al. 2406.03464v1 null
2024-06-05 Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts Dominik Scheuble et.al. 2406.03461v1 null
2024-06-05 FILS: Self-Supervised Video Feature Prediction In Semantic Language Space Mona Ahmadian et.al. 2406.03447v1 null
2024-06-05 Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input Joachim Ott et.al. 2406.03439v1 null
2024-06-05 Stabilizing massless fields with fluxes in Landau-Ginzburg models Katrin Becker et.al. 2406.03435v1 null
2024-06-05 Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis Moein Heidari et.al. 2406.03430v1 link
2024-06-05 Post-hoc Part-prototype Networks Andong Tan et.al. 2406.03421v1 null
2024-06-05 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541v2 null
2024-06-04 ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Tianchen Zhao et.al. 2406.02540v1 null
2024-06-04 Enhancing predictive imaging biomarker discovery through treatment effect analysis Shuhan Xiao et.al. 2406.02534v1 null
2024-06-04 ReLUs Are Sufficient for Learning Implicit Neural Representations Joseph Shenouda et.al. 2406.02529v1 link
2024-06-04 RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots Soroush Nasiriany et.al. 2406.02523v1 null
2024-06-04 DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering Zhongpai Gao et.al. 2406.02518v1 null
2024-06-04 V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation Cong Wang et.al. 2406.02511v1 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509v1 null
2024-06-04 Endomorphisms of Artin groups of type $\tilde A_n$ Luis Paris et.al. 2406.02484v1 null
2024-06-04 Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion Colin Hansen et.al. 2406.02477v1 null
2024-05-31 Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis Chaoyou Fu et.al. 2405.21075v1 null
2024-05-31 Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights Xin Wen et.al. 2405.21070v1 link
2024-05-31 You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet Zhen Qin et.al. 2405.21022v1 null
2024-05-31 Beyond Conventional Parametric Modeling: Data-Driven Framework for Estimation and Prediction of Time Activity Curves in Dynamic PET Imaging Niloufar Zakariaei et.al. 2405.21021v1 null
2024-05-31 The classification of dp-minimal integral domains Christian d'Elbée et.al. 2405.21014v1 null
2024-05-31 Early Stopping Criteria for Training Generative Adversarial Networks in Biomedical Imaging Muhammad Muneeb Saad et.al. 2405.20987v1 null
2024-05-31 PUAL: A Classifier on Trifurcate Positive-Unlabeled Data Xiaoke Wang et.al. 2405.20970v1 null
2024-05-31 Aligning Multiclass Neural Network Classifier Criterion with Task Performance via $F_β$-Score Nathan Tsoi et.al. 2405.20954v1 null
2024-05-31 Standard model of electromagnetism and chirality in crystals R. Winkler et.al. 2405.20940v1 null
2024-05-31 MALT: Multi-scale Action Learning Transformer for Online Action Detection Zhipeng Yang et.al. 2405.20892v1 null
2024-05-30 MotionLLM: Understanding Human Behaviors from Human Motions and Videos Ling-Hao Chen et.al. 2405.20340v1 null
2024-05-30 OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving Lening Wang et.al. 2405.20337v1 link
2024-05-30 VividDream: Generating 3D Scene with Ambient Dynamics Yao-Chih Lee et.al. 2405.20334v1 null
2024-05-30 SurgiTrack: Fine-Grained Multi-Class Multi-Tool Tracking in Surgical Videos Chinedu Innocent Nwoye et.al. 2405.20333v1 null
2024-05-31 4DHands: Reconstructing Interactive Hands in 4D with Transformers Dixuan Lin et.al. 2405.20330v2 null
2024-05-30 MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion Shuyuan Tu et.al. 2405.20325v1 null
2024-05-30 Vision-based Manipulation from Single Human Video with Open-World Object Graphs Yifeng Zhu et.al. 2405.20321v1 null
2024-05-30 Improving the Training of Rectified Flows Sangyun Lee et.al. 2405.20320v1 link
2024-05-30 CausalQuest: Collecting Natural Causal Questions for AI Agents Roberto Ceraolo et.al. 2405.20318v1 link
2024-05-30 Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models Himangi Mittal et.al. 2405.20305v1 null
2024-05-29 X-VILA: Cross-Modality Alignment for Large Language Model Hanrong Ye et.al. 2405.19335v1 null
2024-05-29 LLMs Meet Multimodal Generation and Editing: A Survey Yingqing He et.al. 2405.19334v1 link
2024-05-29 Multi-Modal Generative Embedding Model Feipeng Ma et.al. 2405.19333v1 null
2024-05-29 NPGA: Neural Parametric Gaussian Avatars Simon Giebenhain et.al. 2405.19331v1 null
2024-05-29 Normative Modules: A Generative Agent Architecture for Learning Norms that Supports Multi-Agent Cooperation Atrisha Sarkar et.al. 2405.19328v1 null
2024-05-29 DGD: Dynamic 3D Gaussians Distillation Isaac Labe et.al. 2405.19321v1 null
2024-05-29 Real-Time Environment Condition Classification for Autonomous Vehicles Marco Introvigne et.al. 2405.19305v1 null
2024-05-29 Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare Hanwei Zhu et.al. 2405.19298v1 null
2024-05-29 Archetype-Based Redshift Estimation for the Dark Energy Spectroscopic Instrument Survey Abhijeet Anand et.al. 2405.19288v1 null
2024-05-29 A study on the adequacy of common IQA measures for medical images Anna Breger et.al. 2405.19224v1 null
2024-05-28 Classifying Overlapping Gaussian Mixtures in High Dimensions: From Optimal Classifiers to Neural Nets Khen Cohen et.al. 2405.18427v1 null
2024-05-28 GFlow: Recovering 4D World from Monocular Video Shizun Wang et.al. 2405.18426v1 null
2024-05-28 Hierarchical World Models as Visual Whole-Body Humanoid Controllers Nicklas Hansen et.al. 2405.18418v1 null
2024-05-28 3D StreetUnveiler with Semantic-Aware 2DGS Jingwei Xu et.al. 2405.18416v1 null
2024-05-28 Why are Visually-Grounded Language Models Bad at Image Classification? Yuhui Zhang et.al. 2405.18415v1 link
2024-05-28 Towards a Sampling Theory for Implicit Neural Representations Mahrokh Najaf et.al. 2405.18410v1 null
2024-05-28 Phased Consistency Model Fu-Yun Wang et.al. 2405.18407v1 null
2024-05-28 RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives Jaehong Yoon et.al. 2405.18406v1 null
2024-05-28 MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning Somnath Kumar et.al. 2405.18358v1 null
2024-05-28 Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography Jie Liu et.al. 2405.18356v1 link
2024-05-27 Matryoshka Multimodal Models Mu Cai et.al. 2405.17430v1 null
2024-05-27 NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models Chankyu Lee et.al. 2405.17428v1 null
2024-05-27 MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds Jiahui Lei et.al. 2405.17421v1 null
2024-05-27 Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control Zhengfei Kuang et.al. 2405.17414v1 null
2024-05-27 Enhancing Music Genre Classification through Multi-Algorithm Analysis and User-Friendly Visualization Navin Kamuni et.al. 2405.17413v1 null
2024-05-27 The Peripatetic Hater: Predicting Movement Among Hate Subreddits Daniel Hickey et.al. 2405.17410v1 null
2024-05-27 Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer Ruizhi Shao et.al. 2405.17405v1 null
2024-05-27 Spectral Greedy Coresets for Graph Neural Networks Mucong Ding et.al. 2405.17404v1 null
2024-05-27 Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability Shenyuan Gao et.al. 2405.17398v1 link
2024-05-27 Non-Unitary Quantum Machine Learning Jamie Heredge et.al. 2405.17388v1 null
2024-05-24 Canonical Variates in Wasserstein Metric Space Jia Li et.al. 2405.15768v1 null
2024-05-24 Scaling Laws for Discriminative Classification in Large Language Models Dean Wyatte et.al. 2405.15765v1 null
2024-05-24 InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation Yuchi Wang et.al. 2405.15758v1 link
2024-05-24 Looking Backward: Streaming Video-to-Video Translation with Feature Banks Feng Liang et.al. 2405.15757v1 link
2024-05-24 Characterizing Discourse Group Roles in Inquiry-based University Science Labs Tong Wan et.al. 2405.15746v1 null
2024-05-24 Hierarchical Uncertainty Exploration via Feedforward Posterior Trees Elias Nehme et.al. 2405.15719v1 null
2024-05-24 EmpathicStories++: A Multimodal Dataset for Empathy towards Personal Experiences Jocelyn Shen et.al. 2405.15708v1 null
2024-05-24 Sums: Sniffing Unknown Multiband Signals under Low Sampling Rates Jinbo Peng et.al. 2405.15705v1 null
2024-05-24 realSEUDO for real-time calcium imaging analysis Iuliia Dmitrieva et.al. 2405.15701v1 null
2024-05-24 UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes Ted Lentsch et.al. 2405.15688v1 null
2024-05-23 PuzzleAvatar: Assembling 3D Avatars from Personal Albums Yuliang Xiu et.al. 2405.14869v1 null
2024-05-23 Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis Basile Van Hoorick et.al. 2405.14868v1 null
2024-05-23 Video Diffusion Models are Training-free Motion Interpreter and Controller Zeqi Xiao et.al. 2405.14864v1 null
2024-05-23 Synergistic Global-space Camera and Human Reconstruction from Videos Yizhou Zhao et.al. 2405.14855v1 null
2024-05-23 Domain Wall Magnetic Tunnel Junction Reliable Integrate and Fire Neuron Can Cui1 et.al. 2405.14851v1 null
2024-05-23 Learning to Detect and Segment Mobile Objects from Unlabeled Videos Yihong Sun et.al. 2405.14841v1 null
2024-05-23 Designing A Sustainable Marine Debris Clean-up Framework without Human Labels Raymond Wang et.al. 2405.14815v1 null
2024-05-23 As an AI Language Model, "Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making Shomik Jain et.al. 2405.14812v1 null
2024-05-23 Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics Jonas Spinner et.al. 2405.14806v1 null
2024-05-24 Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation Hongxu Jiang et.al. 2405.14802v2 link
2024-05-21 Comprehensive Multimodal Deep Learning Survival Prediction Enabled by a Transformer Architecture: A Multicenter Study in Glioblastoma Ahmed Gomaa et.al. 2405.12963v1 null
2024-05-21 Online Learning of Halfspaces with Massart Noise Ilias Diakonikolas et.al. 2405.12958v1 null
2024-05-21 Quantifying Uncertainty in Classification Performance: ROC Confidence Bands Using Conformal Prediction Zheshi Zheng et.al. 2405.12953v1 null
2024-05-21 Tutorly: Turning Programming Videos Into Apprenticeship Learning Environments with LLMs Wengxi Li et.al. 2405.12946v1 null
2024-05-21 Pytorch-Wildlife: A Collaborative Deep Learning Framework for Conservation Andres Hernandez et.al. 2405.12930v1 link
2024-05-21 Streamlining Software Reviews: Efficient Predictive Modeling with Minimal Examples Tim Menzies et.al. 2405.12920v1 null
2024-05-21 The $L_p$-dual space of a semisimple Lie group Bachir Bekka et.al. 2405.12919v1 null
2024-05-21 Topic Modelling Case Law Using a Large Language Model and a New Taxonomy for UK Law: AI Insights into Summary Judgment Holli Sargeant et.al. 2405.12910v1 link
2024-05-21 Decentralized Federated Learning Over Imperfect Communication Channels Weicai Li et.al. 2405.12894v1 null
2024-05-21 Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models Abdurahmman Alzahrani et.al. 2405.12884v1 null
2024-05-20 Images that Sound: Composing Images and Sounds on a Single Canvas Ziyang Chen et.al. 2405.12221v1 null
2024-05-20 Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen et.al. 2405.12211v1 null
2024-05-20 The sign of scalar curvature on Kähler blowups Garrett M. Brown et.al. 2405.12189v1 null
2024-05-20 Building Temporal Kernels with Orthogonal Polynomials Yan Ru Pei et.al. 2405.12179v1 link
2024-05-20 Wireless vs. Traditional Ultrasound Assessed Knee Cartilage Outcomes Utilizing Automated Gain and Normalization Techniques Arjun Parmar et.al. 2405.12172v1 null
2024-05-20 DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM Xuchen Li et.al. 2405.12139v1 null
2024-05-20 Alzheimer's Magnetic Resonance Imaging Classification Using Deep and Meta-Learning Models Nida Nasir et.al. 2405.12126v1 null
2024-05-20 An Active Learning Framework with a Class Balancing Strategy for Time Series Classification Shemonto Das et.al. 2405.12122v1 null
2024-05-20 AGNfitter-rx: Modelling the radio-to-X-ray SEDs of AGNs L. N. Martínez-Ramírez et.al. 2405.12111v1 null
2024-05-20 Real topological phonons in 3D carbon allotropes Xiaotian Wang et.al. 2405.12072v1 null
2024-05-17 Submodular Information Selection for Hypothesis Testing with Misclassification Penalties Jayanth Bhargav et.al. 2405.10930v1 null
2024-05-17 A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model Mingxiang Fu et.al. 2405.10890v1 null
2024-05-17 Multicenter Privacy-Preserving Model Training for Deep Learning Brain Metastases Autosegmentation Yixing Huang et.al. 2405.10870v1 null
2024-05-17 "Hall" transport of liquid crystal solitons in Couette flow Rodrigo C. V. Coelho et.al. 2405.10850v1 null
2024-05-17 Automatic segmentation of Organs at Risk in Head and Neck cancer patients from CT and MRI scans Sébastien Quetin et.al. 2405.10833v1 null
2024-05-17 Open-Vocabulary Spatio-Temporal Action Detection Tao Wu et.al. 2405.10832v1 null
2024-05-17 Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities Hao Zhou et.al. 2405.10825v1 null
2024-05-17 ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios Markus Bayer et.al. 2405.10808v1 null
2024-05-17 A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability Abdul Rehman et.al. 2405.10803v1 null
2024-05-17 Reduced storage direct tensor ring decomposition for convolutional neural networks compression Mateusz Gabor et.al. 2405.10802v1 link
2024-05-16 TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction Yunfan Jiang et.al. 2405.10315v1 null
2024-05-16 4D Panoptic Scene Graph Generation Jingkang Yang et.al. 2405.10305v1 link
2024-05-16 On Sample Selection for Continual Learning: a Video Streaming Case Study Alexander Dietmüller et.al. 2405.10290v1 null
2024-05-16 Quantum Vision Transformers for Quark-Gluon Classification Marçal Comajoan Cara et.al. 2405.10284v1 null
2024-05-16 Faces that Speak: Jointly Synthesising Talking Face and Speech from Text Youngjoon Jang et.al. 2405.10272v1 null
2024-05-16 A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision Charles Raude et.al. 2405.10266v1 null
2024-05-16 PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology George Shaikovski et.al. 2405.10254v1 null
2024-05-16 A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts Xinru Zhang et.al. 2405.10246v1 null
2024-05-16 Ternary mappings of some evolution algebras Candido Martin Gonzalez et.al. 2405.10241v1 null
2024-05-16 ENADPool: The Edge-Node Attention-based Differentiable Pooling for Graph Neural Networks Zhehan Zhao et.al. 2405.10218v1 null
2024-05-15 Classifying geospatial objects from multiview aerial imagery using semantic meshes David Russell et.al. 2405.09544v1 null
2024-05-15 Spectral complexity of deep neural networks Simmaco Di Lillo et.al. 2405.09541v1 null
2024-05-16 MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer Chengyu Wu et.al. 2405.09539v2 link
2024-05-15 Restoring balance: principled under/oversampling of data for optimal classification Emanuele Loffredo et.al. 2405.09535v1 null
2024-05-15 Tackling Distribution Shifts in Task-Oriented Communication with Information Bottleneck Hongru Li et.al. 2405.09514v1 null
2024-05-15 Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts Donya Rooein et.al. 2405.09482v1 null
2024-05-15 Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment Xinying Lin et.al. 2405.09472v1 null
2024-05-15 Non-contact Lung Disease Classification via OFDM-based Passive 6G ISAC Sensing Hasan Mujtaba Buttar et.al. 2405.09458v1 null
2024-05-15 Cohomogeneity one RCD-spaces Diego Corro et.al. 2405.09448v1 null
2024-05-15 M$^4$oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts Yufeng Jiang et.al. 2405.09446v1 null
2024-05-14 CinePile: A Long Video Question Answering Dataset and Benchmark Ruchit Rawal et.al. 2405.08813v1 null
2024-05-14 The Developing Human Connectome Project: A Fast Deep Learning-based Pipeline for Neonatal Cortical Surface Reconstruction Qiang Ma et.al. 2405.08783v1 null
2024-05-14 Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling Gregory Holste et.al. 2405.08780v1 null
2024-05-14 FolkTalent: Enhancing Classification and Tagging of Indian Folk Paintings Nancy Hada et.al. 2405.08776v1 null
2024-05-14 From Text to Context: An Entailment Approach for News Stakeholder Classification Alapan Kuila et.al. 2405.08751v1 null
2024-05-14 Enhancing Blind Video Quality Assessment with Rich Quality-aware Features Wei Sun et.al. 2405.08745v1 null
2024-05-14 The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks Carmela Calabrese et.al. 2405.08695v1 null
2024-05-14 Latent group structure in linear panel data models with endogenous regressors Junho Choi et.al. 2405.08687v1 null
2024-05-14 Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis Qingpeng Kong et.al. 2405.08681v1 link
2024-05-14 Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning Alain Riou et.al. 2405.08679v1 null
2024-05-14 MambaOut: Do We Really Need Mamba for Vision? Weihao Yu et.al. 2405.07992v2 link
2024-05-13 SPIN: Simultaneous Perception, Interaction and Navigation Shagun Uppal et.al. 2405.07991v1 null
2024-05-13 KG-Planner: Knowledge-Informed Graph Neural Planning for Collaborative Manipulators Wansong Liu et.al. 2405.07962v1 null
2024-05-13 An Algorithmic Classification of Generalized Pseudo-Anosov Homeomorphisms via Geometric Markov Partitions Inti Cruz Diaz et.al. 2405.07954v1 null
2024-05-13 Scene Action Maps: Behavioural Maps for Navigation without Metric Information Joel Loo et.al. 2405.07948v1 null
2024-05-14 PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition Ziyang Zhang et.al. 2405.07932v2 link
2024-05-13 Improving Multimodal Learning with Multi-Loss Gradient Modulation Konstantinos Kontras et.al. 2405.07930v1 null
2024-05-13 PLUTO: Pathology-Universal Transformer Dinkar Juyal et.al. 2405.07905v1 null
2024-05-13 Enhancing Clinically Significant Prostate Cancer Prediction in T2-weighted Images through Transfer Learning from Breast Cancer Chi-en Amy Tai et.al. 2405.07869v1 null
2024-05-13 Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging Chi-en Amy Tai et.al. 2405.07861v1 null
2024-05-10 Multi-Object Tracking in the Dark Xinzhe Wang et.al. 2405.06600v1 link
2024-05-10 Ice phase classification made easy with score-based denoising Hong Sun et.al. 2405.06599v1 null
2024-05-10 Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach Elham Ravanbakhsh et.al. 2405.06586v1 null
2024-05-10 Deep video representation learning: a survey Elham Ravanbakhsh et.al. 2405.06574v1 null
2024-05-10 The Role of Topological Photon Spheres in Constraining the Parameters of Black Holes Jafar Sadeghi et.al. 2405.06568v1 null
2024-05-10 OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation Jinwei Lin et.al. 2405.06547v1 link
2024-05-10 Separating States in Astronomical Sources Using Hidden Markov Models: With a Case Study of Flaring and Quiescence on EV Lac Robert Zimmerman et.al. 2405.06540v1 null
2024-05-10 Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation Xiaowen Ma et.al. 2405.06525v1 link
2024-05-10 Aspect-based Sentiment Evaluation of Chess Moves (ASSESS): an NLP-based Method for Evaluating Chess Strategies from Textbooks Haifa Alrdahi et.al. 2405.06499v1 null
2024-05-10 Improving Deep Learning Model Calibration for Cardiac Applications using Deterministic Uncertainty Networks and Uncertainty-aware Training Tareen Dawood et.al. 2405.06487v1 null
2024-05-09 A Universal Growth Rate for Learning with Smooth Surrogate Losses Anqi Mao et.al. 2405.05968v1 null
2024-05-09 Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask Zineb Senane et.al. 2405.05959v1 link
2024-05-09 Frame Interpolation with Consecutive Brownian Bridge Diffusion Zonglin Lyu et.al. 2405.05953v1 null
2024-05-09 Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers Peng Gao et.al. 2405.05945v1 link
2024-05-09 MRISegmentator-Abdomen: A Fully Automated Multi-Organ and Structure Segmentation Tool for T1-weighted Abdominal MRI Yan Zhuang et.al. 2405.05944v1 null
2024-05-09 Non-symplectic automorphisms of prime order of O'Grady's tenfolds and cubic fourfolds Simone Billi et.al. 2405.05932v1 null
2024-05-09 Deep Multi-Task Learning for Malware Image Classification Ahmed Bensaoud et.al. 2405.05906v1 null
2024-05-09 An RNN-policy gradient approach for quantum architecture search Gang Wang et.al. 2405.05892v1 null
2024-05-09 Composable Part-Based Manipulation Weiyu Liu et.al. 2405.05876v1 null
2024-05-09 ExACT: An End-to-End Autonomous Excavator System Using Action Chunking With Transformers Liangliang Chen et.al. 2405.05861v1 null
2024-05-08 Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo Nayantara Mudur et.al. 2405.05255v1 link
2024-05-08 Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models Hongjie Wang et.al. 2405.05252v1 null
2024-05-08 DanceCam: atmospheric turbulence mitigation in wide-field astronomical images with short-exposure video streams Spencer Bialek et.al. 2405.05250v1 null
2024-05-08 Deep learning-based variational autoencoder for classification of quantum and classical states of light Mahesh Bhupati et.al. 2405.05243v1 null
2024-05-08 On $\operatorname{Alt}(n)$-modules with an additive dimension when $n\le6$ Barry Chin et.al. 2405.05230v1 null
2024-05-08 Are Economically Advanced Countries More Efficient in Basic and Applied Research? Vladimír Holý et.al. 2405.05227v1 null
2024-05-08 Clustering Retail Products Based on Customer Behaviour Vladimír Holý et.al. 2405.05218v1 null
2024-05-08 FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models Jinglin Xu et.al. 2405.05216v1 link
2024-05-08 Graded Relevance Scoring of Written Essays with Dense Retrieval Salam Albatarni et.al. 2405.05200v1 null
2024-05-08 Is Transductive Learning Equivalent to PAC Learning? Shaddin Dughmi et.al. 2405.05190v1 null
2024-05-07 Switchable Decision: Dynamic Neural Generation Networks Shujian Zhang et.al. 2405.04513v1 null
2024-05-07 Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing Yi Zuo et.al. 2405.04496v1 null
2024-05-07 Exploration of Novel Neuromorphic Methodologies for Materials Applications Derek Gobin et.al. 2405.04478v1 null
2024-05-07 Generalized classical Yang-Baxter equation and regular decompositions Raschid Abedin et.al. 2405.04440v1 null
2024-05-07 On the classification of product-quotient surfaces with $q=0$, $p_g=3$ and their canonical map Federico Fallucca et.al. 2405.04425v1 null
2024-05-07 Vision Mamba: A Comprehensive Survey and Taxonomy Xiao Liu et.al. 2405.04404v1 link
2024-05-07 Efficient Online Set-valued Classification with Bandit Feedback Zhou Wang et.al. 2405.04393v1 null
2024-05-07 DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving Chen Min et.al. 2405.04390v1 null
2024-05-07 Parallelized Multi-Agent Bayesian Optimization in Lava Shay Snyder et.al. 2405.04387v1 null
2024-05-07 Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs Antonio Bikić et.al. 2405.04386v1 null
2024-05-06 Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs Muhammad Uzair Khattak et.al. 2405.03690v1 null
2024-05-06 All-in-One Deep Learning Framework for MR Image Reconstruction Geunu Jeong et.al. 2405.03684v1 null
2024-05-06 ScrewMimic: Bimanual Imitation from Human Videos with Screw Space Projection Arpit Bahety et.al. 2405.03666v1 null
2024-05-06 CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification Sankalp Sinha et.al. 2405.03660v1 null
2024-05-06 Collecting Consistently High Quality Object Tracks with Minimal Human Involvement by Using Self-Supervised Learning to Detect Tracker Errors Samreen Anjum et.al. 2405.03643v1 null
2024-05-06 Classification of Breast Cancer Histopathology Images using a Modified Supervised Contrastive Learning Method Matina Mahdizadeh Sani et.al. 2405.03642v1 link
2024-05-06 Nonequilibrium relaxation and odd-even effect in finite-temperature electron gases Eric Nilsson et.al. 2405.03635v1 null
2024-05-06 Nonnegative Matrix Factorization in Dimensionality Reduction: A Survey Farid Saberi-Movahed et.al. 2405.03615v1 null
2024-05-06 Dual Relation Mining Network for Zero-Shot Learning Jinwei Han et.al. 2405.03613v1 null
2024-05-06 Communities for the Lagrangian Dynamics of the Turbulent Velocity Gradient Tensor: A Network Participation Approach Christopher J. Keylock et.al. 2405.03589v1 null
2024-05-03 DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos Wen-Hsuan Chu et.al. 2405.02280v1 null
2024-05-03 Transversely Projective Structures on Smooth Foliations on Surfaces Gabriel Fazoli et.al. 2405.02273v1 null
2024-05-03 On its way to the neutron star-white dwarf binary graveyard, IGR J16194-2810, a first ascent M giant X-ray binary K. H. Hinkle et.al. 2405.02270v1 null
2024-05-03 Validating Gaia DR3 Pulsating Variable Classifications with TESS: Building Reliable $δ$ Scuti and $γ$ Doradus Stars Catalogs (In Progress) Ai-Ying Zhou et.al. 2405.02264v1 null
2024-05-03 Subgraph2vec: A random walk-based algorithm for embedding knowledge graphs Elika Bozorgi et.al. 2405.02240v1 null
2024-05-03 Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks Lujing Zhang et.al. 2405.02225v1 null
2024-05-03 Designed Dithering Sign Activation for Binary Neural Networks Brayan Monroy et.al. 2405.02220v1 null
2024-05-03 Multispectral Fine-Grained Classification of Blackgrass in Wheat and Barley Crops Madeleine Darbyshire et.al. 2405.02218v1 null
2024-05-03 Non-Destructive Peat Analysis using Hyperspectral Imaging and Machine Learning Yijun Yan et.al. 2405.02191v1 null
2024-05-03 Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset Hsuvas Borkakoty et.al. 2405.02175v1 null
2024-05-02 Confronting sparse Gaia DR3 photometry with TESS for a sample of about 60,000 hot massive non-radial pulsators Daniel Hey et.al. 2405.01539v1 null
2024-05-02 Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks Murtaza Dalal et.al. 2405.01534v1 null
2024-05-02 Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models Nishad Singhi et.al. 2405.01531v1 null
2024-05-02 Track2Act: Predicting Point Tracks from Internet Videos enables Diverse Zero-shot Robot Manipulation Homanga Bharadhwaj et.al. 2405.01527v1 null
2024-05-03 A separability-based approach to quantifying generalization: which layer is best? Luciano Dyballa et.al. 2405.01524v2 null
2024-05-02 Grand Design vs. Multi-Armed Spiral Galaxies: Dependence on Galaxy Structure Beverly J. Smith et.al. 2405.01516v1 null
2024-05-03 Accelerating Convergence in Bayesian Few-Shot Classification Tianjun Ke et.al. 2405.01507v2 link
2024-05-02 PAM-UNet: Shifting Attention on Region of Interest in Medical Images Abhijit Das et.al. 2405.01503v1 null
2024-05-02 Exploring Privacy Issues in Mission Critical Communication: Navigating 5G and Beyond Networks Prajnamaya Dass et.al. 2405.01492v1 null
2024-05-02 Designing Algorithmic Recommendations to Achieve Human-AI Complementarity Bryce McLaughlin et.al. 2405.01484v1 null
2024-05-01 Quantum algorithms for matrix geometric means Nana Liu et.al. 2405.00673v1 null
2024-05-01 Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays Andrei Chubarau et.al. 2405.00670v1 null
2024-05-01 Screening of BindingDB database ligands against EGFR, HER2, Estrogen, Progesterone and NF-kB receptors based on machine learning and molecular docking Parham Rezaee et.al. 2405.00647v1 null
2024-05-01 Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling Yida Mu et.al. 2405.00611v1 null
2024-05-01 Investigating Automatic Scoring and Feedback using Large Language Models Gloria Ashiya Katuka et.al. 2405.00602v1 null
2024-05-01 Discovering robust biomarkers of neurological disorders from functional MRI using graph neural networks: A Review Yi Hao Chan et.al. 2405.00577v1 null
2024-05-01 EALD-MLLM: Emotion Analysis in Long-sequential and De-identity videos with Multi-modal Large Language Model Deng Li et.al. 2405.00574v1 null
2024-05-01 Remote Sensing Data Assimilation with a Chained Hydrologic-hydraulic Model for Flood Forecasting Thanh Huy Nguyen et.al. 2405.00567v1 null
2024-05-01 Digital-analog quantum convolutional neural networks for image classification Anton Simen et.al. 2405.00548v1 null
2024-05-01 UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement Ruiquan Ge et.al. 2405.00542v1 link
2024-04-30 A Framework for Leveraging Human Computation Gaming to Enhance Knowledge Graphs for Accuracy Critical Generative AI Applications Steph Buongiorno et.al. 2404.19729v1 null
2024-04-30 Classification of simple 0-dimensional isolated complete intersection singularities Thuy Huong Pham et.al. 2404.19728v1 null
2024-04-30 PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios Jingbo Wang et.al. 2404.19722v1 null
2024-04-30 PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games Steph Buongiorno et.al. 2404.19721v1 null
2024-04-30 ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents Hoang-Thang Ta et.al. 2404.19714v1 null
2024-04-30 A rank decomposition for the topological classification of neural representations Kosio Beshkov et.al. 2404.19710v1 null
2024-04-30 Neural Controlled Differential Equations with Quantum Hidden Evolutions Lingyi Yang et.al. 2404.19673v1 link
2024-04-30 Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity Lei Wang et.al. 2404.19666v1 null
2024-04-30 Towards Generalist Robot Learning from Internet Video: A Survey Robert McCarthy et.al. 2404.19664v1 null
2024-04-30 Regularization of Riemannian optimization: Application to process tomography and quantum machine learning Felix Soest et.al. 2404.19659v1 null
2024-04-29 Hallucination of Multimodal Large Language Models: A Survey Zechen Bai et.al. 2404.18930v1 link
2024-04-29 Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing Leonardo Rossi et.al. 2404.18924v1 null
2024-04-29 Anomaly and invertible field theory with higher-form symmetry: Extended group cohomology Shi Chen et.al. 2404.18921v1 null
2024-04-29 A Survey on Diffusion Models for Time Series and Spatio-Temporal Data Yiyuan Yang et.al. 2404.18886v1 link
2024-04-29 A Multilevel Strategy to Improve People Tracking in a Real-World Scenario Cristiano B. de Oliveira et.al. 2404.18876v1 null
2024-04-29 A Survey on Vision Mamba: Models, Applications and Challenges Rui Xu et.al. 2404.18861v1 link
2024-04-29 ConPro: Learning Severity Representation for Medical Images using Contrastive Learning and Preference Optimization Hong Nguyen et.al. 2404.18831v1 link
2024-04-29 Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior Zhiyuan Li et.al. 2404.18820v1 null
2024-04-29 Certification of Speaker Recognition Models to Additive Perturbations Dmitrii Korzh et.al. 2404.18791v1 null
2024-04-29 Understanding Radicals via Orbital Parities Reza G. Shirazi et.al. 2404.18787v1 null
2024-04-26 Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos Zhengze Xu et.al. 2404.17571v1 null
2024-04-26 Multifold topological semimetals Iñigo Robredo et.al. 2404.17539v1 null
2024-04-26 Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models Yuhang Huang et.al. 2404.17534v1 null
2024-04-26 Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations Puhao Li et.al. 2404.17521v1 link
2024-04-26 Learning text-to-video retrieval from image captioning Lucas Ventura et.al. 2404.17498v1 null
2024-04-26 Tabular Data Contrastive Learning via Class-Conditioned and Feature-Correlation Based Augmentation Wei Cui et.al. 2404.17489v1 link
2024-04-26 Low Cost Machine Vision for Insect Classification Danja Brandt et.al. 2404.17488v1 null
2024-04-26 Conformal Prediction with Learned Features Shayan Kiyani et.al. 2404.17487v1 null
2024-04-26 Sparse Reconstruction of Optical Doppler Tomography Based on State Space Model Zhenghong Li et.al. 2404.17484v1 null
2024-04-26 One-Shot Image Restoration Deborah Pereg et.al. 2404.17426v1 null
2024-04-25 Made to Order: Discovering monotonic temporal changes via self-supervised video ordering Charig Yang et.al. 2404.16828v1 null
2024-04-25 ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images Weiqi Li et.al. 2404.16825v1 null
2024-04-25 V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection Xuanyu Zhang et.al. 2404.16824v1 null
2024-04-25 Learning Visuotactile Skills with Two Multifingered Hands Toru Lin et.al. 2404.16823v1 link
2024-04-25 Meta-Transfer Derm-Diagnosis: Exploring Few-Shot Learning and Transfer Learning for Skin Disease Classification in Long-Tail Distribution Zeynep Özdemir et.al. 2404.16814v1 null
2024-04-25 Transformer-Based Local Feature Matching for Multimodal Image Registration Remi Delaunay et.al. 2404.16802v1 null
2024-04-25 DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks Tongzhou Mu et.al. 2404.16779v1 null
2024-04-25 Modeling Selective Feature Attention for Representation-based Siamese Text Matching Jianxiang Zang et.al. 2404.16776v1 link
2024-04-25 Classifying One-Dimensional Quantum States Prepared by a Single Round of Measurements Rahul Sahay et.al. 2404.16753v1 null
2024-04-25 Characterizing Solar Center-to-Limb Radial-Velocity Variability with SDO Michael L. Palumbo III et.al. 2404.16747v1 null
2024-04-24 Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models Xu Shen et.al. 2404.15625v1 null
2024-04-24 Layer Ensemble Averaging for Improving Memristor-Based Artificial Neural Network Performance Osama Yousuf et.al. 2404.15621v1 null
2024-04-24 A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution Zhixiong Yang et.al. 2404.15620v1 link
2024-04-24 MDDD: Manifold-based Domain Adaptation with Dynamic Distribution for Non-Deep Transfer Learning in Cross-subject and Cross-session EEG-based Emotion Recognition Ting Luo et.al. 2404.15615v1 null
2024-04-24 Federated Learning with Only Positive Labels by Exploring Label Correlations Xuming An et.al. 2404.15598v1 null
2024-04-24 A Survey of Deep Long-Tail Classification Advancements Charika de Alvis et.al. 2404.15593v1 null
2024-04-24 Domain Adaptation for Learned Image Compression with Supervised Adapters Alberto Presta et.al. 2404.15591v1 null
2024-04-24 Brain Storm Optimization Based Swarm Learning for Diabetic Retinopathy Image Classification Liang Qu et.al. 2404.15585v1 null
2024-04-24 Research on OPF control of three-phase four-wire low-voltage distribution network considering uncertainty Rui Wang et.al. 2404.15584v1 null
2024-04-24 MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis Jiaxin Zhuang et.al. 2404.15580v1 null
2024-04-23 ID-Animator: Zero-Shot Identity-Preserving Human Video Generation Xuanhua He et.al. 2404.15275v1 link
2024-04-23 Metric-guided Image Reconstruction Bounds via Conformal Prediction Matt Y Cheung et.al. 2404.15274v1 link
2024-04-23 Quantum optical classifier with superexponential speedup Simone Roncallo et.al. 2404.15266v1 null
2024-04-23 TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting Jiahe Li et.al. 2404.15264v1 null
2024-04-23 Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization Lahav Lipson et.al. 2404.15263v1 link
2024-04-23 FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent Cameron Smith et.al. 2404.15259v1 null
2024-04-23 Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions Xingguang Zhang et.al. 2404.15252v1 null
2024-04-23 Unifying the Temperature Dependent Dynamics of Glasses Joseph B. Schlenoff et.al. 2404.15250v1 null
2024-04-23 Mining Invariance from Nonlinear Multi-Environment Data: Binary Classification Austin Goddard et.al. 2404.15245v1 null
2024-04-23 Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models Aidan Z. H. Yang et.al. 2404.15236v1 null
2024-04-22 AutoAD III: The Prequel -- Back to the Pixels Tengda Han et.al. 2404.14412v1 null
2024-04-22 Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses Inhee Lee et.al. 2404.14410v1 null
2024-04-22 Hyp-OC: Hyperbolic One Class Classification for Face Anti-Spoofing Kartik Narayan et.al. 2404.14406v1 null
2024-04-22 A mean curvature flow arising in adversarial training Leon Bungert et.al. 2404.14402v1 null
2024-04-22 TAVGBench: Benchmarking Text to Audible-Video Generation Yuxin Mao et.al. 2404.14381v1 link
2024-04-22 Rethinking Legal Compliance Automation: Opportunities with Large Language Models Shabnam Hassani et.al. 2404.14356v1 null
2024-04-22 On-the-Fly Point Annotation for Fast Medical Video Labeling Meyer Adrien et.al. 2404.14344v1 null
2024-04-22 X-Ray: A Sequential 3D Representation for Generation Tao Hu et.al. 2404.14329v1 null
2024-04-22 A Novel Approach to Chest X-ray Lung Segmentation Using U-net and Modified Convolutional Block Attention Module Mohammad Ali Labbaf Khaniki et.al. 2404.14322v1 null
2024-04-22 "I Upload...All Types of Different Things to Say, the World of Blindness Is More Than What They Think It Is": A Study of Blind TikTokers' Identity Work from a Flourishing Perspective Yao Lyu et.al. 2404.14305v1 null
2024-04-19 Data Alignment for Zero-Shot Concept Generation in Dermatology AI Soham Gadgil et.al. 2404.13043v1 null
2024-04-19 PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation Tianyuan Zhang et.al. 2404.13026v1 null
2024-04-19 BANF: Band-limited Neural Fields for Levels of Detail Reconstruction Ahan Shabanov et.al. 2404.13024v1 null
2024-04-19 Stronger Random Baselines for In-Context Learning Gregory Yauney et.al. 2404.13020v1 link
2024-04-19 A New Multi-Picture Architecture for Learned Video Deinterlacing and Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks Ronglei Ji et.al. 2404.13018v1 null
2024-04-19 Towards Robust Ferrous Scrap Material Classification with Deep Learning and Conformal Prediction Paulo Henrique dos Santos et.al. 2404.13002v1 null
2024-04-19 RadRotator: 3D Rotation of Radiographs with Diffusion Models Pouria Rouzrokh et.al. 2404.13000v1 null
2024-04-19 Nuclei Instance Segmentation of Cryosectioned H&E Stained Histological Images using Triple U-Net Architecture Zarif Ahmed et.al. 2404.12986v1 null
2024-04-19 Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics Xiaofei Wang et.al. 2404.12973v1 null
2024-04-19 Improving Pediatric Pneumonia Diagnosis with Adult Chest X-ray Images Utilizing Contrastive Learning and Embedding Similarity Mohammad Zunaed et.al. 2404.12958v1 null
2024-04-18 On the Content Bias in Fréchet Video Distance Songwei Ge et.al. 2404.12391v1 null
2024-04-18 Moving Object Segmentation: All You Need Is SAM (and Flow) Junyu Xie et.al. 2404.12389v1 null
2024-04-18 VideoGigaGAN: Towards Detail-rich Video Super-Resolution Yiran Xu et.al. 2404.12388v1 null
2024-04-18 Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Aitor Ormazabal et.al. 2404.12387v1 null
2024-04-18 G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis Yufei Ye et.al. 2404.12383v1 null
2024-04-18 Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Monocular Videos Isabella Liu et.al. 2404.12379v1 null
2024-04-18 RoboDreamer: Learning Compositional World Models for Robot Imagination Siyuan Zhou et.al. 2404.12377v1 null
2024-04-18 When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes Asaf Yehudai et.al. 2404.12365v1 null
2024-04-18 Inverse Neural Rendering for Explainable Multi-Object Tracking Julian Ost et.al. 2404.12359v1 null
2024-04-18 Improving the interpretability of GNN predictions through conformal-based graph sparsification Pablo Sanchez-Martin et.al. 2404.12356v1 link
2024-04-18 Dynamic Typography: Bringing Text to Life via Video Diffusion Prior Zichen Liu et.al. 2404.11614v2 null
2024-04-17 VG4D: Vision-Language Model Goes 4D Video Recognition Zhichao Deng et.al. 2404.11605v1 link
2024-04-17 Variational Bayesian Last Layers James Harrison et.al. 2404.11599v1 link
2024-04-17 State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend Fei Cui et.al. 2404.11576v1 null
2024-04-17 Simple Image Signal Processing using Global Context Guidance Omar Elezabi et.al. 2404.11569v1 link
2024-04-17 Spatio-Temporal Motion Retargeting for Quadruped Robots Taerim Yoon et.al. 2404.11557v1 null
2024-04-17 Predicting Long-horizon Futures by Conditioning on Geometry and Time Tarasha Khurana et.al. 2404.11554v1 null
2024-04-17 Carbon- and Oxygen-rich stars in MaStar: identification and classification Lewis Hill et.al. 2404.11541v1 null
2024-04-17 GenFighter: A Generative and Evolutive Textual Attack Removal Md Athikul Islam et.al. 2404.11538v1 null
2024-04-17 SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening Yu Zhong et.al. 2404.11537v1 null
2024-04-16 COMBO: Compositional World Models for Embodied Multi-Agent Cooperation Hongxin Zhang et.al. 2404.10775v1 null
2024-04-16 RapidVol: Rapid Reconstruction of 3D Ultrasound Volumes from Sensorless 2D Scans Mark C. Eid et.al. 2404.10766v1 null
2024-04-16 Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification Yu-Yang Li et.al. 2404.10757v1 null
2024-04-16 Integer-valued o-minimal functions Neer Bhardwaj et.al. 2404.10737v1 null
2024-04-16 Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning Hao-Lun Hsu et.al. 2404.10728v1 null
2024-04-16 AV-GAN: Attention-Based Varifocal Generative Adversarial Network for Uneven Medical Image Translation Zexin Li et.al. 2404.10714v1 null
2024-04-17 Dual Modalities of Text: Visual and Textual Generative Pre-training Yekun Chai et.al. 2404.10710v2 null
2024-04-16 Question Difficulty Ranking for Multiple-Choice Reading Comprehension Vatsal Raina et.al. 2404.10704v1 null
2024-04-16 Retrieval Augmented Verification : Unveiling Disinformation with Structured Representations for Zero-Shot Real-Time Evidence-guided Fact-Checking of Multi-modal Social media posts Arka Ujjal Dey et.al. 2404.10702v1 null
2024-04-16 Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs Georgy Perevozchikov et.al. 2404.10700v1 null
2024-04-15 Squish Jamming Samuel Poincloux et.al. 2404.09773v1 null
2024-04-15 Hilti SLAM Challenge 2023: Benchmarking Single + Multi-session SLAM across Sensor Constellations in Construction Ashish Devadas Nair et.al. 2404.09765v1 null
2024-04-15 Deep Learning-Based Segmentation of Tumors in PET/CT Volumes: Benchmark of Different Architectures and Training Strategies Monika Górka et.al. 2404.09761v1 null
2024-04-15 Quantization of Large Language Models with an Overdetermined Basis Daniil Merkulov et.al. 2404.09737v1 null
2024-04-15 FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features Andre Rochow et.al. 2404.09736v1 null
2024-04-15 Classification of finite type fusion quivers Ben Elias et.al. 2404.09714v1 null
2024-04-15 LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models Guangyan Li et.al. 2404.09695v1 null
2024-04-15 Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration Chenwei Lin et.al. 2404.09690v1 null
2024-04-15 Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition Tobias Weber et.al. 2404.09683v1 link
2024-04-15 Cluster analysis of the Roma-BZCAT blazars D. O. Kudryavtsev et.al. 2404.09667v1 null
2024-04-15 Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis Alessa Hering et.al. 2404.09666v1 null
2024-04-15 Closing the Gap in the Trade-off between Fair Representations and Accuracy Biswajit Rout et.al. 2404.09664v1 null
2024-04-15 If there's a Trigger Warning, then where's the Trigger? Investigating Trigger Warnings at the Passage Level Matti Wiegmann et.al. 2404.09615v1 link
2024-04-12 FCert: Certifiably Robust Few-Shot Classification in the Era of Foundation Models Yanting Wang et.al. 2404.08631v1 null
2024-04-12 Classification of Boolean Algebras through von Neumann regular $\mathcal{C}^{\infty}-$Rings Jean Cerqueira Berni et.al. 2404.08629v1 null
2024-04-12 Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation Yanhao Zheng et.al. 2404.08603v1 link
2024-04-12 Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation Abu Bakor Hayat Arnob et.al. 2404.08584v1 link
2024-04-12 Lossy Image Compression with Foundation Diffusion Models Lucas Relic et.al. 2404.08580v1 null
2024-04-12 IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic Chirag Parikh et.al. 2404.08561v1 null
2024-04-12 Scalability in Building Component Data Annotation: Enhancing Facade Material Classification with Synthetic Data Josie Harrison et.al. 2404.08557v1 null
2024-04-12 Benchmarking the Cell Image Segmentation Models Robustness under the Microscope Optical Aberrations Boyuan Peng et.al. 2404.08549v1 null
2024-04-12 VertAttack: Taking advantage of Text Classifiers' horizontal vision Jonathan Rusert et.al. 2404.08538v1 null
2024-04-12 Text Prompt with Normality Guidance for Weakly Supervised Video Anomaly Detection Zhiwei Yang et.al. 2404.08531v1 null
2024-04-11 Connecting NeRFs, Images, and Text Francesco Ballerini et.al. 2404.07993v1 null
2024-04-11 GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh Jing Wen et.al. 2404.07991v1 null
2024-04-11 WaveMo: Learning Wavefront Modulations to See Through Scattering Mingyang Xie et.al. 2404.07985v1 null
2024-04-11 Gaga: Group Any Gaussians via 3D-aware Memory Bank Weijie Lyu et.al. 2404.07977v1 null
2024-04-11 FusionMamba: Efficient Image Fusion with State Space Model Siran Peng et.al. 2404.07932v1 null
2024-04-11 HGRN2: Gated Linear RNNs with State Expansion Zhen Qin et.al. 2404.07904v1 link
2024-04-11 Q-ITAGS: Quality-Optimized Spatio-Temporal Heterogeneous Task Allocation with a Time Budget Glen Neville et.al. 2404.07902v1 null
2024-04-11 Auditing health-related recommendations in social media: A Case Study of Abortion on YouTube Mohammed Lahsaini et.al. 2404.07896v1 null
2024-04-11 Typical blocks of the category $\mathcal O$ and Whittaker modules for Takiff superalgebras Chih-Whi Chen et.al. 2404.07894v1 null
2024-04-11 Context-aware Video Anomaly Detection in Long-Term Datasets Zhengye Yang et.al. 2404.07887v1 null
2024-04-10 RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion Jaidev Shriram et.al. 2404.07199v1 null
2024-04-10 GCV-Turbo: End-to-end Acceleration of GNN-based Computer Vision Tasks on FPGA Bingyi Zhang et.al. 2404.07188v1 null
2024-04-10 Adinkras and Pure Spinors Richard Eager et.al. 2404.07167v1 null
2024-04-10 Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations Ofir Shifman et.al. 2404.07153v1 null
2024-04-10 Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization Michael Kohler et.al. 2404.07128v1 null
2024-04-10 Measuring proximity to standard planes during fetal brain ultrasound scanning Chiara Di Vece et.al. 2404.07124v1 null
2024-04-10 "My toxic trait is thinking I'll remember this": gaps in the learner experience of video tutorials for feature-rich software Ian Drosos et.al. 2404.07114v1 null
2024-04-10 The generic dual of p-adic groups and applications Chris Jantzen et.al. 2404.07111v1 null
2024-04-10 Learning Priors for Non Rigid SfM from Casual Videos Yoni Kasten et.al. 2404.07097v1 null
2024-04-10 VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning Alexandros Xenos et.al. 2404.07078v1 link
2024-04-09 MoReVQA: Exploring Modular Reasoning Models for Video Question Answering Juhong Min et.al. 2404.06511v1 null
2024-04-10 Reconstructing Hand-Held Objects in 3D Jane Wu et.al. 2404.06507v2 null
2024-04-09 A Machine Learning Framework for the Prediction of Grain Boundary Segregation in Chemically Complex Environments Doruk Aksoy et.al. 2404.06499v1 null
2024-04-10 Flying with Photons: Rendering Novel Views of Propagating Light Anagh Malik et.al. 2404.06493v2 null
2024-04-09 Uncovering Tidal Treasures: Automated Classification of Faint Tidal Features in DECaLS Data Alexander J. Gordon et.al. 2404.06487v1 null
2024-04-09 RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos Bochao Zou et.al. 2404.06483v1 null
2024-04-09 Laue Indexing with Optimal Transport Tomasz Kacprzak et.al. 2404.06478v1 link
2024-04-09 A comparative analysis of deep learning models for lung segmentation on X-ray images Weronika Hryniewska-Guzik et.al. 2404.06455v1 link
2024-04-09 QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding Yash Mehan et.al. 2404.06442v1 null
2024-04-09 ClassiPyGRB: Machine Learning-Based Classification and Visualization of Gamma Ray Bursts using t-SNE Keneth Garcia-Cifuentes et.al. 2404.06439v1 null
2024-04-08 MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Bo He et.al. 2404.05726v1 null
2024-04-08 Predicting Overtakes in Trucks Using CAN Data Talha Hanif Butt et.al. 2404.05723v1 null
2024-04-08 Case Study: Neural Network Malware Detection Verification for Feature and Image Datasets Preston K. Robinette et.al. 2404.05703v1 null
2024-04-08 Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding Ahmad Idrissi-Yaghir et.al. 2404.05694v1 null
2024-04-08 Evaluating the Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery Ionut M. Motoi et.al. 2404.05693v1 null
2024-04-08 AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation Jiannan Ge et.al. 2404.05667v1 null
2024-04-08 Oblique photons, plasmons, and current-plasmons in relativistic plasmas and their topological implications Hong Qin et.al. 2404.05636v1 null
2024-04-08 AnchorAL: Computationally Efficient Active Learning for Large and Imbalanced Datasets Pietro Lesci et.al. 2404.05623v1 null
2024-04-08 Experimental observation of a time rondeau crystal: Temporal Disorder in Spatiotemporal Order Leo Joon Il Moon et.al. 2404.05620v1 null
2024-04-08 Self-Explainable Affordance Learning with Embodied Caption Zhipeng Zhang et.al. 2404.05603v1 null
2024-04-05 On classification of global dynamics for energy-critical equivariant harmonic map heat flows and radial nonlinear heat equation Kihyun Kim et.al. 2404.04247v1 null
2024-04-05 Evaluating Adversarial Robustness: A Comparison Of FGSM, Carlini-Wagner Attacks, And The Role of Distillation as Defense Mechanism Trilokesh Ranjan Sarkar et.al. 2404.04245v1 null
2024-04-05 player2vec: A Language Modeling Approach to Understand Player Behavior in Games Tianze Wang et.al. 2404.04234v1 null
2024-04-05 Deep-learning Segmentation of Small Volumes in CT images for Radiotherapy Treatment Planning Jianxin Zhou et.al. 2404.04202v1 null
2024-04-05 SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers Weile Li et.al. 2404.04179v1 link
2024-04-05 Noisy Label Processing for Classification: A Survey Mengting Li et.al. 2404.04159v1 null
2024-04-05 Improving Detection in Aerial Images by Capturing Inter-Object Relationships Botao Ren et.al. 2404.04140v1 null
2024-04-05 Label Propagation for Zero-shot Classification with Vision-Language Models Vladan Stojnić et.al. 2404.04072v1 link
2024-04-05 VoicePilot: Harnessing LLMs as Speech Interfaces for Physically Assistive Robots Akhil Padmanabha et.al. 2404.04066v1 null
2024-04-05 Phase Binarization in Mutually Synchronized Bias Field-free Spin Hall Nano-oscillators for Reservoir Computing Sourabh Manna et.al. 2404.04023v1 null
2024-04-04 OW-VISCap: Open-World Video Instance Segmentation and Captioning Anwesa Choudhuri et.al. 2404.03657v1 null
2024-04-04 Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation Shuting He et.al. 2404.03645v1 link
2024-04-04 On the Efficiency of Convolutional Neural Networks Andrew Lavin et.al. 2404.03617v1 null
2024-04-04 Creator Hearts: Investigating the Impact Positive Signals from YouTube Creators in Shaping Comment Section Behavior Frederick Choi et.al. 2404.03612v1 null
2024-04-04 InsectMamba: Insect Pest Classification with State Space Model Qianning Wang et.al. 2404.03611v1 null
2024-04-04 DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images Zhou Jie et.al. 2404.03595v1 link
2024-04-04 Alzheimer's disease detection in PSG signals Lorena Gallego-Viñarás et.al. 2404.03549v1 null
2024-04-04 Towards Transcranial 3D Ultrasound Localization Microscopy of the Nonhuman Primate Brain Paul Xing et.al. 2404.03547v1 null
2024-04-04 Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models Siyuan Mei et.al. 2404.03541v1 null
2024-04-05 A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data Iqra Bano et.al. 2404.03493v2 null
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903v1 null
2024-04-03 Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds Kamalika Chaudhuri et.al. 2404.02866v1 link
2024-04-03 Semisimple Algebras of Vector Fields on $\mathbb{C}^{3}$ Sajid Ali et.al. 2404.02847v1 null
2024-04-03 GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation Meher Niger et.al. 2404.02813v1 null
2024-04-03 Generative-Contrastive Heterogeneous Graph Neural Network Yu Wang et.al. 2404.02810v1 null
2024-04-03 FPT: Feature Prompt Tuning for Few-shot Readability Assessment Ziyang Wang et.al. 2404.02772v1 link
2024-04-03 DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement Hao Wu et.al. 2404.02755v1 null
2024-04-03 Terraced Compression Method with Automated Threshold Selection for Multidimensional Image Clustering of Heterogeneous Bodies Jiatong Li et.al. 2404.02744v1 null
2024-04-03 Event Camera Demosaicing via Swin Transformer and Pixel-focus Loss Yunfan Lu et.al. 2404.02731v1 link
2024-04-03 Unblind Text Inputs: Predicting Hint-text of Text Input in Mobile Apps via LLM Zhe Liu et.al. 2404.02706v1 null
2024-04-02 Diffusion$^2$: Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148v1 link
2024-04-02 Multiparametric quantification and visualization of liver fat using ultrasound Jihye Baek et.al. 2404.02143v1 null
2024-04-03 ResNet with Integrated Convolutional Block Attention Module for Ship Classification Using Transfer Learning on Optical Satellite Imagery Ryan Donghan Kwon et.al. 2404.02135v2 null
2024-04-02 ViTamin: Designing Scalable Vision Models in the Vision-Language Era Jienneg Chen et.al. 2404.02132v1 link
2024-04-02 ImageNot: A contrast with ImageNet preserves model rankings Olawale Salaudeen et.al. 2404.02112v1 null
2024-04-02 CameraCtrl: Enabling Camera Control for Text-to-Video Generation Hao He et.al. 2404.02101v1 link
2024-04-02 Explainability in JupyterLab and Beyond: Interactive XAI Systems for Integrated and Collaborative Workflows Grace Guo et.al. 2404.02081v1 null
2024-04-02 Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation Hui Xiao et.al. 2404.02065v1 null
2024-04-02 Long-context LLMs Struggle with Long In-context Learning Tianle Li et.al. 2404.02060v1 link
2024-04-02 Deconstructing In-Context Learning: Understanding Prompts via Corruption Namrata Shivagunde et.al. 2404.02054v1 link
2024-03-29 Learn "No" to Say "Yes" Better: Improving Vision-Language Models via Negations Jaisidh Singh et.al. 2403.20312v1 link
2024-03-29 Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation Fangxu Yu et.al. 2403.20289v1 link
2024-03-29 Prototype-based Interpretable Breast Cancer Prediction Models: Analysis and Challenges Shreyasi Pathak et.al. 2403.20260v1 null
2024-03-29 Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions Runhao Zeng et.al. 2403.20254v1 null
2024-03-29 Latent Embedding Clustering for Occlusion Robust Head Pose Estimation José Celestino et.al. 2403.20251v1 null
2024-03-29 Long-Tailed Anomaly Detection with Learnable Class Names Chih-Hui Ho et.al. 2403.20236v1 null
2024-04-02 Artificial Neural Networks-based Real-time Classification of ENG Signals for Implanted Nerve Interfaces Antonio Coviello et.al. 2403.20234v2 null
2024-03-29 MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark Sanghyun Woo et.al. 2403.20225v1 null
2024-03-29 Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science Yazheng Yang et.al. 2403.20208v1 null
2024-03-29 The Future of Combating Rumors? Retrieval, Discrimination, and Generation Junhao Xu et.al. 2403.20204v1 null
2024-03-28 RSMamba: Remote Sensing Image Classification with State Space Model Keyan Chen et.al. 2403.19654v1 link
2024-03-28 Square patterns in dynamical orbits Vefa Goksel et.al. 2403.19642v1 null
2024-03-28 Siamese Vision Transformers are Scalable Audio-visual Learners Yan-Bo Lin et.al. 2403.19638v1 null
2024-03-28 Four-dimensional gradient Ricci solitons with (half) nonnegative isotropic curvature Huai-Dong Cao et.al. 2403.19627v1 null
2024-03-28 Top-$k$ Classification and Cardinality-Aware Prediction Anqi Mao et.al. 2403.19625v1 null
2024-03-28 RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents Zeren Chen et.al. 2403.19622v1 null
2024-03-28 SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects Avinash Ummadisingu et.al. 2403.19607v1 null
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600v1 link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593v1 null
2024-03-28 Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation Zhongliang Zhou et.al. 2403.19584v1 null
2024-03-27 MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering Guoxing Sun et.al. 2403.18820v1 null
2024-03-27 Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction Yiyao Zhang et.al. 2403.18776v1 null
2024-03-27 CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning Elliot Chane-Sane et.al. 2403.18765v1 null
2024-03-27 A vascular synthetic model for improved aneurysm segmentation and detection via Deep Neural Networks Rafic Nader et.al. 2403.18734v1 null
2024-03-27 Contrastive Learning with Orthonormal Anchors (CLOA) Huanran Li et.al. 2403.18699v1 null
2024-03-27 Annolid: Annotate, Segment, and Track Anything You Need Chen Yang et.al. 2403.18690v1 null
2024-03-27 InceptionTime vs. Wavelet -- A comparison for time series classification Daniel Klenkert et.al. 2403.18687v1 null
2024-03-27 TransFusion: Contrastive Learning with Transformers Huanran Li et.al. 2403.18681v1 null
2024-03-28 FluxGAT: Integrating Flux Sampling with Graph Neural Networks for Unbiased Gene Essentiality Classification Kieren Sharma et.al. 2403.18666v2 null
2024-03-27 Indecomposable set-theoretical solutions to the Yang-Baxter equation of size $p^2$ Carsten Dietzel et.al. 2403.18653v1 null
2024-03-26 Efficient Video Object Segmentation via Modulated Cross-Attention Memory Abdelrahman Shaker et.al. 2403.17937v1 link
2024-03-26 ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis Muhammad Hamza Mughal et.al. 2403.17936v1 null
2024-03-26 OmniVid: A Generative Framework for Universal Video Understanding Junke Wang et.al. 2403.17935v1 link
2024-03-26 Track Everything Everywhere Fast and Robustly Yunzhou Song et.al. 2403.17931v1 null
2024-03-26 FastCAR: Fast Classification And Regression Multi-Task Learning via Task Consolidation for Modelling a Continuous Property Variable of Object Classes Anoop Kini et.al. 2403.17926v1 null
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921v1 link
2024-03-26 TC4D: Trajectory-Conditioned Text-to-4D Generation Sherwin Bahmani et.al. 2403.17920v1 null
2024-03-26 AgentStudio: A Toolkit for Building General Virtual Agents Longtao Zheng et.al. 2403.17918v1 null
2024-03-26 Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos Akshay Paruchuri et.al. 2403.17915v1 null
2024-03-26 Hierarchical Multi-label Classification for Fine-level Event Extraction from Aviation Accident Reports Xinyu Zhao et.al. 2403.17914v1 null
2024-03-25 DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking Yichuan Li et.al. 2403.16786v1 null
2024-03-25 C-arm inverse geometry CT for 3D cardiac chamber mapping Jordan M. Slagowski et.al. 2403.16779v1 null
2024-03-25 Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases Sophie Starck et.al. 2403.16776v1 null
2024-03-25 As Good As A Coin Toss Human detection of AI-generated images, videos, audio, and audiovisual stimuli Di Cooke et.al. 2403.16760v1 null
2024-03-25 Creating a Digital Twin of Spinal Surgery: A Proof of Concept Jonas Hein et.al. 2403.16736v1 null
2024-03-25 A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models Nils Ingelhag et.al. 2403.16730v1 null
2024-03-25 One-Shot Domain Incremental Learning Yasushi Esaki et.al. 2403.16707v1 null
2024-03-25 Assessing the Performance of Deep Learning for Automated Gleason Grading in Prostate Cancer Dominik Müller et.al. 2403.16695v1 null
2024-03-25 DeepGleason: a System for Automated Gleason Grading of Prostate Cancer using Deep Neural Networks Dominik Müller et.al. 2403.16678v1 link
2024-03-25 FOOL: Addressing the Downlink Bottleneck in Satellite Computing with Neural Feature Compression Alireza Furutanpey et.al. 2403.16677v1 null
2024-03-25 A Novel Loss Function-based Support Vector Machine for Binary Classification Yan Li et.al. 2403.16654v1 null
2024-03-25 Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution Qingping Zheng et.al. 2403.16643v1 null
2024-03-25 Multi-Scale Texture Loss for CT denoising with GANs Francesco Di Feola et.al. 2403.16640v1 link
2024-03-25 AI-Generated Video Detection via Spatio-Temporal Anomaly Learning Jianfa Bai et.al. 2403.16638v1 null
2024-03-25 Distributed collaborative anomalous sound detection by embedding sharing Kota Dohi et.al. 2403.16610v1 null
2024-03-25 EDUE: Expert Disagreement-Guided One-Pass Uncertainty Estimation for Medical Image Segmentation Kudaibergen Abutalip et.al. 2403.16594v1 null
2024-03-22 LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models Yuzhang Shang et.al. 2403.15388v1 null
2024-03-22 Time-efficient, high-resolution 3T whole-brain relaxometry using Cartesian 3D MR-STAT with CSF suppression Hongyan Liu et.al. 2403.15379v1 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378v1 null
2024-03-22 InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Yi Wang et.al. 2403.15377v1 null
2024-03-22 Cascading Blackout Severity Prediction with Statistically-Augmented Graph Neural Networks Joe Gorka et.al. 2403.15363v1 null
2024-03-22 SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series Badri N. Patro et.al. 2403.15360v1 null
2024-03-22 Ultrasound Imaging based on the Variance of a Diffusion Restoration Model Yuxin Zhang et.al. 2403.15316v1 null
2024-03-22 Global Control for Local SO(3)-Equivariant Scale-Invariant Vessel Segmentation Patryk Rygiel et.al. 2403.15314v1 null
2024-03-22 Quantum-inspired classification via efficient simulation of Helstrom measurement Wooseop Hwang et.al. 2403.15308v1 null
2024-03-22 Reconnaissance ultracool spectra in the Euclid Deep Fields Jerry Jun-Yan Zhang et.al. 2403.15288v1 null
2024-03-21 Language Repository for Long Video Understanding Kumara Kahatapitiya et.al. 2403.14622v1 link
2024-03-22 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion Xiang Fan et.al. 2403.14617v2 null
2024-03-21 Explorative Inbetweening of Time and Space Haiwen Feng et.al. 2403.14611v1 null
2024-03-21 ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi et.al. 2403.14602v1 null
2024-03-21 PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model Zheng Zhang et.al. 2403.14598v1 link
2024-03-21 Large Language Models for Multi-Choice Question Classification of Medical Subjects Víctor Ponce-López et.al. 2403.14582v1 null
2024-03-21 DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video Narek Tumanyan et.al. 2403.14548v1 null
2024-03-21 Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images Tom Burgert et.al. 2403.14547v1 null
2024-03-21 Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets Ahmet Alp Kindiroglu et.al. 2403.14534v1 link
2024-03-21 Invisible Needle Detection in Ultrasound: Leveraging Mechanism-Induced Vibration Chenyang Li et.al. 2403.14523v1 null
2024-03-21 Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting Alicia Durrer et.al. 2403.14499v1 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800v1 null
2024-03-20 Hierarchical NeuroSymbolic Approach for Action Quality Assessment Lauren Okamoto et.al. 2403.13798v1 null
2024-03-20 Bridge the Modality and Capacity Gaps in Vision-Language Model Selection Chao Yi et.al. 2403.13797v1 null
2024-03-20 The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency and Usability in AI Matt White et.al. 2403.13784v1 null
2024-03-20 Gradings on associative triple systems of the second kind Alberto Daza-Garcia et.al. 2403.13775v1 null
2024-03-20 Towards Principled Representation Learning from Videos for Reinforcement Learning Dipendra Misra et.al. 2403.13765v1 null
2024-03-20 Enhancing Gait Video Analysis in Neurodegenerative Diseases by Knowledge Augmentation in Vision Language Model Diwei Wang et.al. 2403.13756v1 null
2024-03-20 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Fu-Yun Wang et.al. 2403.13745v1 null
2024-03-20 Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes Yifan Chen et.al. 2403.13724v1 null
2024-03-20 Improving the Adaptive Moment Estimation (ADAM) stochastic optimizer through an Implicit-Explicit (IMEX) time-stepping approach Abhinab Bhattacharjee et.al. 2403.13704v1 null
2024-03-19 LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression Zhuoshi Pan et.al. 2403.12968v1 null
2024-03-19 FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation Shuai Yang et.al. 2403.12962v1 link
2024-03-19 WHAC: World-grounded Humans and Cameras Wanqi Yin et.al. 2403.12959v1 null
2024-03-19 FutureDepth: Learning to Predict the Future Improves Video Depth Estimation Rajeev Yasarla et.al. 2403.12953v1 null
2024-03-19 Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models Elaine Sui et.al. 2403.12952v1 link
2024-03-19 Legendrian loops and cluster modular groups James Hughes et.al. 2403.12951v1 null
2024-03-19 Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers Vidhi Jain et.al. 2403.12943v1 null
2024-03-19 Contextual AD Narration with Interleaved Multimodal Sequence Hanlin Wang et.al. 2403.12922v1 null
2024-03-19 Semantic Layering in Room Segmentation via LLMs Taehyeon Kim et.al. 2403.12920v1 null
2024-03-19 Yell At Your Robot: Improving On-the-Fly from Language Corrections Lucy Xiaoyang Shi et.al. 2403.12910v1 null
2024-03-18 Time Series Compression using Quaternion Valued Neural Networks and Quaternion Backpropagation Johannes Pöppelbaum et.al. 2403.11722v1 null
2024-03-18 Virbo: Multimodal Multilingual Avatar Video Generation in Digital Marketing Juan Zhang et.al. 2403.11700v1 null
2024-03-18 A Spatial-Temporal Progressive Fusion Network for Breast Lesion Segmentation in Ultrasound Videos Zhengzheng Tu et.al. 2403.11699v1 null
2024-03-18 Object Segmentation-Assisted Inter Prediction for Versatile Video Coding Zhuoyuan Li et.al. 2403.11694v1 null
2024-03-19 MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation Haoyu Zhao et.al. 2403.11689v2 null
2024-03-18 Better (pseudo-)labels for semi-supervised instance segmentation François Porcher et.al. 2403.11675v1 null
2024-03-19 WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising Haoyu Zhao et.al. 2403.11672v2 null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667v1 null
2024-03-18 Combining Local and Global Perception for Autonomous Navigation on Nano-UAVs Lorenzo Lamberti et.al. 2403.11661v1 null
2024-03-18 LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model Yuxin Cao et.al. 2403.11656v1 null
2024-03-15 Strong and Controllable Blind Image Decomposition Zeyu Zhang et.al. 2403.10520v1 link
2024-03-15 Frozen Feature Augmentation for Few-Shot Image Classification Andreas Bär et.al. 2403.10519v1 null
2024-03-15 VideoAgent: Long-form Video Understanding with Large Language Model as Agent Xiaohan Wang et.al. 2403.10517v1 null
2024-03-15 Surveyor: Facilitating Discovery Within Video Games for Blind and Low Vision Players Vishnu Nair et.al. 2403.10512v1 null
2024-03-15 Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study Chenguang Wang et.al. 2403.10499v1 link
2024-03-15 Joint Multimodal Transformer for Dimensional Emotional Recognition in the Wild Paul Waligora et.al. 2403.10488v1 null
2024-03-15 Tensor Star Decomposition Wuyang Zhou et.al. 2403.10481v1 null
2024-03-15 Using an LLM to Turn Sign Spottings into Spoken Language Sentences Ozge Mercanoglu Sincan et.al. 2403.10434v1 null
2024-03-15 Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks Peter Leer et.al. 2403.10420v1 null
2024-03-15 A comparative study on machine learning approaches for rock mass classification using drilling data Tom F. Hansen et.al. 2403.10404v1 null
2024-03-14 Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models Akhil Kedia et.al. 2403.09635v1 link
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630v1 link
2024-03-14 From the Conformal Anomaly to the Virasoro Algebra Sid Maibach et.al. 2403.09628v1 null
2024-03-14 Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding Guo Chen et.al. 2403.09626v1 link
2024-03-14 Score-Guided Diffusion for 3D Human Recovery Anastasis Stathopoulos et.al. 2403.09623v1 link
2024-03-14 PosSAM: Panoptic Open-vocabulary Segment Anything Vibashan VS et.al. 2403.09620v1 null
2024-03-14 Explore In-Context Segmentation via Latent Diffusion Models Chaoyang Wang et.al. 2403.09616v1 null
2024-03-14 Compute-first optical detection for noise-resilient visual perception Jungmin Kim et.al. 2403.09612v1 null
2024-03-14 Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds Ilyass Moummad et.al. 2403.09598v1 link
2024-03-14 DungeonMaker: Embedding Tangible Creation and Destruction in Hybrid Board Games through Personal Fabrication Technology Evgeny Stemasov et.al. 2403.09592v1 null
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764v1 null
2024-03-13 Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches Yun Xin Teoh et.al. 2403.08761v1 null
2024-03-13 MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning Jialv Zou et.al. 2403.08760v1 link
2024-03-13 Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08758v1 null
2024-03-13 DAM: Dynamic Adapter Merging for Continual Video QA Learning Feng Cheng et.al. 2403.08755v1 link
2024-03-13 Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08749v1 null
2024-03-13 Torsion pairs, t-structures, and co-t-structures for completions of discrete cluster categories Sofia Franchini et.al. 2403.08735v1 null
2024-03-13 Euclid: Testing photometric selection of emission-line galaxy targets M. S. Cagliari et.al. 2403.08726v1 null
2024-03-13 Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment Paraskevas Pegios et.al. 2403.08700v1 null
2024-03-13 Implicit Regularization of Gradient Flow on One-Layer Softmax Attention Heejune Sheen et.al. 2403.08699v1 null
2024-03-12 OPEN TEACH: A Versatile Teleoperation System for Robotic Manipulation Aadhithya Iyer et.al. 2403.07870v1 null
2024-03-12 TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation Shivin Dass et.al. 2403.07869v1 null
2024-03-12 Iterative Graph Neural Network Enhancement via Frequent Subgraph Mining of Explanations Harish G. Naik et.al. 2403.07849v1 null
2024-03-12 When Eye-Tracking Meets Machine Learning: A Systematic Review on Applications in Medical Image Analysis Sahar Moradizeyveh et.al. 2403.07834v1 null
2024-03-12 DeliGrasp: Inferring Object Mass, Friction, and Compliance with LLMs for Adaptive and Minimally Deforming Grasp Policies William Xie et.al. 2403.07832v1 null
2024-03-12 A geometric model for the module category of a string algebra Karin Baur et.al. 2403.07810v1 null
2024-03-12 BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives Ivo M. Baltruschat et.al. 2403.07800v1 null
2024-03-12 A robust SVM-based approach with feature selection and outliers detection for classification problems Marta Baldomero-Naranjo et.al. 2403.07753v1 null
2024-03-12 Vision-based Vehicle Re-identification in Bridge Scenario using Flock Similarity Chunfeng Zhang et.al. 2403.07752v1 null
2024-03-12 Harnessing two-photon dissipation for enhanced quantum measurement and control Antoine Marquet et.al. 2403.07744v1 null
2024-03-11 Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling Wele Gedara Chaminda Bandara et.al. 2403.06978v1 link
2024-03-12 VideoMamba: State Space Model for Efficient Video Understanding Kunchang Li et.al. 2403.06977v2 link
2024-03-11 Memory-based Adapters for Online 3D Scene Perception Xiuwei Xu et.al. 2403.06974v1 null
2024-03-11 Explainable Transformer Prototypes for Medical Diagnoses Ugur Demir et.al. 2403.06961v1 link
2024-03-11 Quadruped-Frog: Rapid Online Optimization of Continuous Quadruped Jumping Guillaume Bellegarda et.al. 2403.06954v1 null
2024-03-11 Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer Siddhant Satyanaik et.al. 2403.06953v1 null
2024-03-11 Advancing Generalizable Remote Physiological Measurement through the Integration of Explicit and Implicit Prior Knowledge Yuting Zhang et.al. 2403.06947v1 link
2024-03-11 Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction Qing Xiao et.al. 2403.06940v1 null
2024-03-11 FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks Muhammad Saif Ullah Khan et.al. 2403.06904v1 null
2024-03-11 Benign overfitting in leaky ReLU networks with moderate input dimension Kedar Karhadkar et.al. 2403.06903v1 null
2024-03-08 Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos Tarun Kalluri et.al. 2403.05535v1 null
2024-03-08 Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets Lorenzo Brigato et.al. 2403.05532v1 null
2024-03-08 Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Machel Reid et.al. 2403.05530v1 null
2024-03-08 Take Your Best Shot: Sampling-Based Next-Best-View Planning for Autonomous Photography & Inspection Shijie Gao et.al. 2403.05477v1 null
2024-03-08 Will GPT-4 Run DOOM? Adrian de Wynter et.al. 2403.05468v1 null
2024-03-08 Evaluating AI and Human Authorship Quality in Academic Writing through Physics Essays Will Yeadon et.al. 2403.05458v1 null
2024-03-08 VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models Yabo Zhang et.al. 2403.05438v1 link
2024-03-08 OmniCount: Multi-label Object Counting with Semantic-Geometric Priors Anindya Mondal et.al. 2403.05435v1 null
2024-03-08 Infinite Translation Surfaces in the Wild Vincent Delecroix et.al. 2403.05424v1 null
2024-03-08 Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery Mubashir Noman et.al. 2403.05419v1 link
2024-03-07 DeepSee: Multidimensional Visualizations of Seabed Ecosystems Adam Coscia et.al. 2403.04761v1 link
2024-03-07 iScore: Visual Analytics for Interpreting How Language Models Automatically Score Summaries Adam Coscia et.al. 2403.04760v1 link
2024-03-07 KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts Adam Coscia et.al. 2403.04758v1 link
2024-03-07 Preliminary Guidelines For Combining Data Integration and Visual Data Analysis Adam Coscia et.al. 2403.04757v1 link
2024-03-07 Photonic probabilistic machine learning using quantum vacuum noise Seou Choi et.al. 2403.04731v1 null
2024-03-07 Analysis of Systems' Performance in Natural Language Processing Competitions Sergio Nava-Muñoz et.al. 2403.04693v1 null
2024-03-07 CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios Qilang Ye et.al. 2403.04640v1 link
2024-03-07 Scalable, Simulation-Guided Compliant Tactile Finger Design Yuxiang Ma et.al. 2403.04638v1 null
2024-03-08 Pix2Gif: Motion-Guided Diffusion for GIF Generation Hitesh Kandala et.al. 2403.04634v2 null
2024-03-07 MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder Lei Li et.al. 2403.04626v1 null
2024-03-06 3D Diffusion Policy Yanjie Ze et.al. 2403.03954v1 link
2024-03-06 Stop Regressing: Training Value Functions via Classification for Scalable Deep RL Jesse Farebrother et.al. 2403.03950v1 null
2024-03-06 Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation Marcel Torne et.al. 2403.03949v1 null
2024-03-06 DART: Implicit Doppler Tomography for Radar Novel View Synthesis Tianshu Huang et.al. 2403.03896v1 null
2024-03-06 Joint multi-task learning improves weakly-supervised biomarker prediction in computational pathology Omar S. M. El Nahhas et.al. 2403.03891v1 link
2024-03-06 Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation Xiao Ma et.al. 2403.03890v1 null
2024-03-06 Decoupled Vertical Federated Learning for Practical Training on Vertically Partitioned Data Avi Amalanshu et.al. 2403.03871v1 null
2024-03-06 X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification Hanzi Xu et.al. 2403.03863v1 link
2024-03-06 ProxNF: Neural Field Proximal Training for High-Resolution 4D Dynamic Image Reconstruction Luke Lozenski et.al. 2403.03860v1 null
2024-03-06 MedMamba: Vision Mamba for Medical Image Classification Yubiao Yue et.al. 2403.03849v1 link
2024-03-05 Extension Theory and Fermionic Strongly Fusion 2-Categories Thibault D. Décoppet et.al. 2403.03211v1 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206v1 null
2024-03-05 Behavior Generation with Latent Actions Seungjae Lee et.al. 2403.03181v1 link
2024-03-05 Deep-Learned Compression for Radio-Frequency Signal Classification Armani Rodriguez et.al. 2403.03150v1 null
2024-03-05 Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization Yuxin Guo et.al. 2403.03145v1 link
2024-03-05 Motion-Corrected Moving Average: Including Post-Hoc Temporal Information for Improved Video Segmentation Robert Mendel et.al. 2403.03120v1 null
2024-03-05 Equilibria in Two-Stage Facility Location with Atomic Clients Simon Krogmann et.al. 2403.03114v1 null
2024-03-05 Galaxies in the Zone of Avoidance: Misclassifications using machine learning tools P. Marchant Cortés et.al. 2403.03098v1 null
2024-03-05 Collective self-caging of active filaments in virtual confinement Maximilian Kurjahn et.al. 2403.03093v1 null
2024-03-05 A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives Simone Alberto Peirone et.al. 2403.03037v1 null
2024-03-03 Enhancing Retinal Vascular Structure Segmentation in Images With a Novel Design Two-Path Interactive Fusion Module Model Rui Yang et.al. 2403.01362v1 null
2024-03-02 Improve Cost Efficiency of Active Learning over Noisy Dataset Zan-Kai Chong et.al. 2403.01346v1 null
2024-03-02 An eternal hypersurface flow arising in centro-affine geometry Xinjie Jiang et.al. 2403.01340v1 null
2024-03-02 Image-Based Dietary Assessment: A Healthy Eating Plate Estimation System Assylzhan Izbassar et.al. 2403.01310v1 null
2024-03-02 VNLP: Turkish NLP Package Meliksah Turker et.al. 2403.01309v1 null
2024-03-02 Towards a classification of $p^2$-discriminant ideal twins over number fields Alyson Deines et.al. 2403.01287v1 null
2024-03-02 $π$-systems and the Embedding problem for rank $2$ Kac-Moody Lie algebras Irfan Habib et.al. 2403.01285v1 null
2024-03-02 Fast Low-parameter Video Activity Localization in Collaborative Learning Environments Venkatesh Jatla et.al. 2403.01281v1 null
2024-03-02 Rigidity results for group von Neumann algebras with diffuse center Ionuţ Chifan et.al. 2403.01280v1 null
2024-03-02 Can a Confident Prior Replace a Cold Posterior? Martin Marek et.al. 2403.01272v1 link
2024-02-29 Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Tsai-Shien Chen et.al. 2402.19479v1 null
2024-02-29 Towards Generalizable Tumor Synthesis Qi Chen et.al. 2402.19470v1 null
2024-02-29 Humanoid Locomotion as Next Token Prediction Ilija Radosavovic et.al. 2402.19469v1 null
2024-03-01 TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning Kate Sanders et.al. 2402.19467v2 null
2024-02-29 Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models Frederik Kunstner et.al. 2402.19449v1 null
2024-02-29 Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems Quentin Raymondaud et.al. 2402.19443v1 null
2024-02-29 Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation Jonathan Yang et.al. 2402.19432v1 null
2024-02-29 PaECTER: Patent-level Representation Learning using Citation-informed Transformers Mainak Ghosh et.al. 2402.19411v1 null
2024-02-29 Navigating Hallucinations for Reasoning of Unintentional Activities Shresth Grover et.al. 2402.19405v1 null
2024-02-29 A Newborn AGN in a Starforming Galaxy P. Arévalo et.al. 2402.19403v1 null
2024-02-28 Time-efficient filtering of polarimetric data by checking physical realizability of experimental Mueller matrices Tatiana Novikova et.al. 2402.18555v1 null
2024-02-28 Selection of appropriate multispectral camera exposure settings and radiometric calibration methods for applications in phenotyping and precision agriculture Vaishali Swaminathan et.al. 2402.18553v1 null
2024-02-28 Implicit Bias of Next-Token Prediction Christos Thrampoulidis et.al. 2402.18551v1 null
2024-02-28 Defect Detection in Tire X-Ray Images: Conventional Methods Meet Deep Structures Andrei Cozma et.al. 2402.18527v1 null
2024-02-28 Do galaxy mergers prefer under-dense environments? U. Sureshkumar et.al. 2402.18520v1 null
2024-02-28 Log Neural Controlled Differential Equations: The Lie Brackets Make a Difference Benjamin Walker et.al. 2402.18512v1 null
2024-02-28 Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling Mahdi Karami et.al. 2402.18508v1 null
2024-02-28 Detection of Micromobility Vehicles in Urban Traffic Videos Khalil Sabri et.al. 2402.18503v1 link
2024-02-28 Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification Garima Chhikara et.al. 2402.18502v1 null
2024-02-28 ROG$_{PL}$: Robust Open-Set Graph Learning via Region-Based Prototype Learning Qin Zhang et.al. 2402.18495v1 null
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768v1 null
2024-02-27 Towards Optimal Learning of Language Models Yuxian Gu et.al. 2402.17759v1 null
2024-02-27 An Eye Gaze Heatmap Analysis of Uncertainty Head-Up Display Designs for Conditional Automated Driving Michael A. Gerber et.al. 2402.17751v1 null
2024-02-27 Scaling on-chip photonic neural processors using arbitrarily programmable wave propagation Tatsuhiro Onodera et.al. 2402.17750v1 link
2024-02-27 Linking Order to Strength in Metals Nicolas Argibay et.al. 2402.17728v1 null
2024-02-27 MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation Hanan Gani et.al. 2402.17725v1 link
2024-02-27 Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners Yazhou Xing et.al. 2402.17723v1 null
2024-02-27 Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers Yiwei Lu et.al. 2402.17710v1 null
2024-02-27 NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents Tamara Czinczoll et.al. 2402.17682v1 null
2024-02-27 MCF-VC: Mitigate Catastrophic Forgetting in Class-Incremental Learning for Multimodal Video Captioning Huiyu Xiong et.al. 2402.17680v1 null
2024-02-26 Open Your Ears to Take a Look: A State-of-the-Art Report on the Integration of Sonification and Visualization Kajetan Enge et.al. 2402.16558v1 null
2024-02-26 LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification Yiping Song et.al. 2402.16515v1 null
2024-02-26 Photonic Neural Network Fabricated on Thin Film Lithium Niobate for High-Fidelity and Power-Efficient Matrix Computation Yong Zheng et.al. 2402.16513v1 null
2024-02-26 Intelligent Known and Novel Aircraft Recognition -- A Shift from Classification to Similarity Learning for Combat Identification Ahmad Saeed et.al. 2402.16486v1 null
2024-02-26 Edge Detectors Can Make Deep Convolutional Neural Networks More Robust Jin Ding et.al. 2402.16479v1 null
2024-02-26 Autonomous Integration of TSN-unaware Applications with QoS Requirements in TSN Networks Moritz Fluechter et.al. 2402.16454v1 null
2024-02-26 Retrouver l'inventeur-auteur : la lev{é}e d'homonymies d'autorat entre les brevets et les publications scientifiques David Reymond et.al. 2402.16440v1 null
2024-02-26 Improving behavior based authentication against adversarial attack using XAI Dong Qin et.al. 2402.16430v1 null
2024-02-26 Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction Wen-Yang Lu et.al. 2402.16371v1 null
2024-02-26 DEYO: DETR with YOLO for End-to-End Object Detection Haodong Ouyang et.al. 2402.16370v1 null
2024-02-26 SPINEPS -- Automatic Whole Spine Segmentation of T2-weighted MR images using a Two-Phase Approach to Multi-class Semantic and Instance Segmentation Hendrik Möller et.al. 2402.16368v1 link
2024-02-26 An Integrated Data Processing Framework for Pretraining Foundation Models Yiding Sun et.al. 2402.16358v1 link
2024-02-26 What Text Design Characterizes Book Genres? Daichi Haraguchi et.al. 2402.16356v1 null
2024-02-23 A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends Abolfazl Younesi et.al. 2402.15490v1 null
2024-02-23 Retinotopic Mapping Enhances the Robustness of Convolutional Neural Networks Jean-Nicolas Jérémie et.al. 2402.15480v1 null
2024-02-23 FAIR: Filtering of Automatically Induced Rules Divya Jyoti Bajpai et.al. 2402.15472v1 null
2024-02-23 GROS: A General Robust Aggregation Strategy Alejandro Cholaquidis et.al. 2402.15442v1 null
2024-02-23 Hierarchical Invariance for Robust and Interpretable Vision Tasks at Larger Scales Shuren Qi et.al. 2402.15430v1 link
2024-02-23 ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation Yi Zhang et.al. 2402.15429v1 link
2024-02-23 Understanding Entrainment in Human Groups: Optimising Human-Robot Collaboration from Lessons Learned during Human-Human Collaboration Eike Schneiders et.al. 2402.15427v1 null
2024-02-23 PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning Simon Holk et.al. 2402.15420v1 null
2024-02-23 G-RepsNet: A Fast and General Construction of Equivariant Networks for Arbitrary Matrix Groups Sourya Basu et.al. 2402.15413v1 null
2024-02-23 A Universal Method for Solar Filament Detection from H-alpha Observations using Semi-supervised Deep Learning Andrea Diercke et.al. 2402.15407v1 null
2024-02-22 Link Prediction under Heterophily: A Physics-Inspired Graph Neural Network Approach Andrea Giuseppe Di Francesco et.al. 2402.14802v1 null
2024-02-22 Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis Willi Menapace et.al. 2402.14797v1 null
2024-02-22 Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren et.al. 2402.14780v1 null
2024-02-22 Zero-Shot Pediatric Tuberculosis Detection in Chest X-Rays using Self-Supervised Learning Daniel Capellán-Martín et.al. 2402.14741v1 null
2024-02-22 Solitons of the mean curvature flow in $\mathbb{s}^2\times\mathbb{R}$ Rafael López et.al. 2402.14727v1 null
2024-02-22 A Transformer Model for Boundary Detection in Continuous Sign Language Razieh Rastgoo et.al. 2402.14720v1 null
2024-02-22 InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks Somnath Banerjee et.al. 2402.14702v1 null
2024-02-22 Big data analytics to classify earthwork-related locations: A Chengdu study Lei Yu et.al. 2402.14698v1 null
2024-02-22 Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off Futa Waseda et.al. 2402.14648v1 null
2024-02-22 Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving Eugen Šlapak et.al. 2402.14642v1 null
2024-02-21 A Simple and Yet Fairly Effective Defense for Graph Neural Networks Sofiane Ennadir et.al. 2402.13987v1 link
2024-02-21 On modular representations of inner forms of $\mathrm{GL}_n$ over a local non-archimedean field Johannes Droschl et.al. 2402.13969v1 null
2024-02-21 New directions in algebraic statistics: Three challenges from 2023 Yulia Alexandr et.al. 2402.13961v1 null
2024-02-21 On the topological classification of complex plane curve singularities Alberto Fernández-Hernández et.al. 2402.13941v1 null
2024-02-21 Verifying message-passing neural networks via topology-based bounds tightening Christopher Hojny et.al. 2402.13937v1 null
2024-02-21 Tumor segmentation on whole slide images: training or prompting? Huaqian Wu et.al. 2402.13932v1 null
2024-02-21 BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery Loddo Fabio et.al. 2402.13918v1 link
2024-02-21 An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach Mohammad Amaz Uddin et.al. 2402.13871v1 null
2024-02-21 RFI-DRUnet: Restoring dynamic spectra corrupted by radio frequency interference -- Application to pulsar observations Xiao Zhang et.al. 2402.13867v1 null
2024-02-21 What we can learn from TikTok through its Research API Francesco Corso et.al. 2402.13855v1 null
2024-02-20 Video ReCap: Recursive Captioning of Hour-Long Videos Md Mohaiminul Islam et.al. 2402.13250v1 null
2024-02-20 SMORE: Similarity-based Hyperdimensional Domain Adaptation for Multi-Sensor Time Series Classification Junyao Wang et.al. 2402.13233v1 null
2024-02-20 A Touch, Vision, and Language Dataset for Multimodal Alignment Letian Fu et.al. 2402.13232v1 null
2024-02-20 NeRF Solves Undersampled MRI Reconstruction Tae Jun Jang et.al. 2402.13226v1 null
2024-02-20 VideoPrism: A Foundational Visual Encoder for Video Understanding Long Zhao et.al. 2402.13217v1 null
2024-02-20 How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena Marco Gaido et.al. 2402.13208v1 null
2024-02-20 A novel image correction method for cloud-affected observations with Imaging Atmospheric Cherenkov Telescopes Natalia Żywucka et.al. 2402.13190v1 null
2024-02-20 UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing Jianhong Bai et.al. 2402.13185v1 null
2024-02-20 DINOBot: Robot Manipulation via Retrieval and Alignment with Vision Foundation Models Norman Di Palo et.al. 2402.13181v1 null
2024-02-20 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172v1 null
2024-02-19 Short-Period Variables in TESS Full-Frame Image Light Curves Identified via Convolutional Neural Networks Greg Olmschenk et.al. 2402.12369v1 null
2024-02-19 The first all-sky survey of star-forming galaxies with eROSITA: Scaling relations and a population of X-ray luminous starbursts E. Kyritsis et.al. 2402.12367v1 null
2024-02-19 An Adversarial Approach to Evaluating the Robustness of Event Identification Models Obai Bahwal et.al. 2402.12338v1 null
2024-02-19 Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann et.al. 2402.12336v1 link
2024-02-19 Generating Survival Interpretable Trajectories and Data Andrei V. Konstantinov et.al. 2402.12331v1 null
2024-02-19 Asymptotic Gaussian Fluctuations of Eigenvectors in Spectral Clustering Hugo Lebeau et.al. 2402.12302v1 null
2024-02-19 Time-periodic behaviour in one- and two-dimensional interacting particle systems Jonas Köppl et.al. 2402.12300v1 null
2024-02-19 Is Open-Source There Yet? A Comparative Study on Commercial and Open-Source LLMs in Their Ability to Label Chest X-Ray Reports Felix J. Dorfner et.al. 2402.12298v1 null
2024-02-19 Revisiting registration-based synthesis: A focus on unsupervised MR image synthesis Savannah P. Hays et.al. 2402.12288v1 null
2024-02-19 Significance of Chirp MFCC as a Feature in Speech and Audio Applications S. Johanan Joysingh et.al. 2402.12239v1 null
2024-02-16 PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter Junfei Xiao et.al. 2402.10896v1 null
2024-02-16 Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning Chia-Ling Tsai et.al. 2402.10894v1 null
2024-02-16 Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation Ziyang Wang et.al. 2402.10887v1 link
2024-02-16 Control Color: Multimodal Diffusion-based Interactive Image Colorization Zhexin Liang et.al. 2402.10855v1 null
2024-02-16 HistoSegCap: Capsules for Weakly-Supervised Semantic Segmentation of Histological Tissue Type in Whole Slide Images Mobina Mansoori et.al. 2402.10851v1 null
2024-02-16 FedD2S: Personalized Data-Free Federated Knowledge Distillation Kawa Atapour et.al. 2402.10846v1 null
2024-02-16 Pedipulate: Enabling Manipulation Skills using a Quadruped Robot's Leg Philip Arm et.al. 2402.10837v1 null
2024-02-16 GAN-driven Electromagnetic Imaging of 2-D Dielectric Scatterers Ehtasham Naseer et.al. 2402.10831v1 null
2024-02-16 Structure results for torus fixed loci Jarod Alper et.al. 2402.10823v1 null
2024-02-16 Training Class-Imbalanced Diffusion Model Via Overlap Optimization Divin Yan et.al. 2402.10821v1 link
2024-02-15 Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling Raunaq Bhirangi et.al. 2402.10211v1 null
2024-02-15 FedAnchor: Enhancing Federated Semi-Supervised Learning with Label Contrastive Loss for Unlabeled Clients Xinchi Qiu et.al. 2402.10191v1 null
2024-02-15 Euclid preparation. Measuring detailed galaxy morphologies for Euclid with Machine Learning Euclid Collaboration et.al. 2402.10187v1 link
2024-02-15 DeepSRGM -- Sequence Classification and Ranking in Indian Classical Music with Deep Learning Sathwik Tejaswi Madhusudhan et.al. 2402.10168v1 null
2024-02-15 Holographic covering and the fortuity of black holes Chi-Ming Chang et.al. 2402.10129v1 null
2024-02-15 Classification Diffusion Models Shahar Yadin et.al. 2402.10095v1 null
2024-02-15 MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations Benedikt Alkin et.al. 2402.10093v1 link
2024-02-15 GraphCBAL: Class-Balanced Active Learning for Graph Neural Networks via Reinforcement Learning Chengcheng Yu et.al. 2402.10074v1 null
2024-02-15 Both Matter: Enhancing the Emotional Intelligence of Large Language Models without Compromising the General Intelligence Weixiang Zhao et.al. 2402.10073v1 null
2024-02-15 NYCTALE: Neuro-Evidence Transformer for Adaptive and Personalized Lung Nodule Invasiveness Prediction Sadaf Khademi et.al. 2402.10066v1 null
2024-02-14 LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning Adithya Raman et.al. 2402.09392v1 null
2024-02-14 GraSSRep: Graph-Based Self-Supervised Learning for Repeat Detection in Metagenomic Assembly Ali Azizpour et.al. 2402.09381v1 link
2024-02-14 Deep Rib Fracture Instance Segmentation and Classification from CT on the RibFrac Challenge Jiancheng Yang et.al. 2402.09372v1 null
2024-02-14 Magic-Me: Identity-Specific Video Customized Diffusion Ze Ma et.al. 2402.09368v1 null
2024-02-14 Small instanton-induced flavor invariants and the axion potential Ravneet Bedi et.al. 2402.09361v1 null
2024-02-14 Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy Brice Rauby et.al. 2402.09359v1 null
2024-02-14 DoRA: Weight-Decomposed Low-Rank Adaptation Shih-Yang Liu et.al. 2402.09353v1 null
2024-02-14 Irreducible representations of the crystallisation of the $C^{*}$-algebra $C(SU_{q}(n+1))$ Manabendra Giri et.al. 2402.09347v1 null
2024-02-14 Registration of Longitudinal Spine CTs for Monitoring Lesion Growth Malika Sanhinova et.al. 2402.09341v1 null
2024-02-14 Stability and Multigroup Fairness in Ranking with Uncertain Predictions Siddartha Devic et.al. 2402.09326v1 null
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682v1 null
2024-02-13 A Convergence Analysis of Approximate Message Passing with Non-Separable Functions and Applications to Multi-Class Classification Burak Çakmak et.al. 2402.08676v1 null
2024-02-13 Learning Emergent Gaits with Decentralized Phase Oscillators: on the role of Observations, Rewards, and Feedback Jenny Zhang et.al. 2402.08662v1 null
2024-02-13 BdSLW60: A Word-Level Bangla Sign Language Dataset Husne Ara Rubaiyeat et.al. 2402.08635v1 link
2024-02-13 Convolutional Neural Networks Towards Facial Skin Lesions Detection Reza Sarshar et.al. 2402.08592v1 null
2024-02-13 Totally geodesic submanifolds and polar actions on Stiefel manifolds Claudio Gorodski et.al. 2402.08585v1 null
2024-02-13 Motion-Adaptive Inference for Flexible Learned B-Frame Compression M. Akin Yilmaz et.al. 2402.08550v1 null
2024-02-13 Approximately Piecewise E(3) Equivariant Point Networks Matan Atzmon et.al. 2402.08529v1 null
2024-02-13 Reduced-order modeling of the dynamics of an inverted flag from experimental data Zhenwei Xu et.al. 2402.08504v1 null
2024-02-13 Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models Shaeke Salman et.al. 2402.08473v1 null
2024-02-13 Wavefront Randomization Improves Deconvolution Amit Kohli et.al. 2402.07900v2 null
2024-02-12 Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets Violet Liu et.al. 2402.07895v1 null
2024-02-12 Perfect stable regularity lemma and slice-wise stable hypergraphs Artem Chernikov et.al. 2402.07870v1 null
2024-02-12 On Computationally Efficient Multi-Class Calibration Parikshit Gopalan et.al. 2402.07821v1 null
2024-02-12 A Benchmark Grocery Dataset of Realworld Point Clouds From Single View Shivanand Venkanna Sheshappanavar et.al. 2402.07819v1 null
2024-02-12 Fixation for $\mathcal{U}$-Ising and $\mathcal{U}$-voter dynamics with frozen vertices Laure Marêché et.al. 2402.07807v1 null
2024-02-12 Estimation of non-uniform blur using a patch-based regression convolutional neural network (CNN) Luis G. Varela et.al. 2402.07796v1 null
2024-02-12 "Layer-by-layer" Unsupervised Clustering of Statistically Relevant Fluctuations in Noisy Time-series Data of Complex Dynamical Systems Matteo Becchi et.al. 2402.07786v1 null
2024-02-12 Solving parameter-dependent semi-algebraic systems Louis Gaillard et.al. 2402.07782v1 null
2024-02-12 Observations of the new meteor shower from comet 46P/Wirtanen D. Vida et.al. 2402.07769v1 null
2024-02-09 A two-stage algorithm in evolutionary product unit neural networks for classification Antonio J. Tallón-Ballesteros et.al. 2402.06622v1 null
2024-02-09 Image-based Deep Learning for the time-dependent prediction of fresh concrete properties Max Meyer et.al. 2402.06611v1 null
2024-02-09 SAE: Single Architecture Ensemble Neural Networks Martin Ferianc et.al. 2402.06580v1 null
2024-02-09 Video Annotator: A framework for efficiently building video classifiers using vision-language models and active learning Amir Ziai et.al. 2402.06560v1 link
2024-02-09 Self Supervised Learning for Improved Calibrationless Radial MRI with NLINV-Net Moritz Blumenthal et.al. 2402.06550v1 null
2024-02-09 Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA Marek Šuppa et.al. 2402.06549v1 null
2024-02-09 Feature Density Estimation for Out-of-Distribution Detection via Normalizing Flows Evan D. Cook et.al. 2402.06537v1 null
2024-02-09 Refining Myocardial Infarction Detection: A Novel Multi-Modal Composite Kernel Strategy in One-Class Classification Muhammad Uzair Zahid et.al. 2402.06530v1 null
2024-02-09 Flexible infinite-width graph convolutional networks and the importance of representation learning Ben Anson et.al. 2402.06525v1 null
2024-02-09 Dynamic swarms regulate the morphology and distribution of soft membrane domains Aakanksha Gubbala et.al. 2402.06518v1 null
2024-02-08 Classifying Nodes in Graphs without GNNs Daniel Winter et.al. 2402.05934v1 link
2024-02-08 An Interactive Agent Foundation Model Zane Durante et.al. 2402.05929v1 null
2024-02-08 Point-VOS: Pointing Up Video Object Segmentation Idil Esen Zulfikar et.al. 2402.05917v1 null
2024-02-08 A Survey on Detection, Classification, and Tracking of Aerial Threats using Radar and Communications Systems Wahab Khawaja et.al. 2402.05909v1 null
2024-02-09 Large Language Model Meets Graph Neural Network in Knowledge Distillation Shengxiang Hu et.al. 2402.05894v2 null
2024-02-08 Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data Shufan Li et.al. 2402.05892v1 null
2024-02-08 CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion Shoubin Yu et.al. 2402.05889v1 null
2024-02-08 Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers Onur G. Guleryuz et.al. 2402.05887v1 link
2024-02-08 GET-Tok: A GenAI-Enriched Multimodal TikTok Dataset Documenting the 2022 Attempted Coup in Peru Gabriela Pinto et.al. 2402.05882v1 link
2024-02-08 You've Got to Feel It To Believe It: Multi-Modal Bayesian Inference for Semantic and Property Prediction Parker Ewen et.al. 2402.05872v1 null
2024-02-07 Edu-ConvoKit: An Open-Source Library for Education Conversation Data Rose E. Wang et.al. 2402.05111v1 link
2024-02-07 Moduli Parameters of Complex Singularities with Non-Degenerate Newton Boundary Janko Boehm et.al. 2402.05093v1 null
2024-02-07 Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation Ziyang Wang et.al. 2402.05079v1 link
2024-02-07 Arbitrary Scale Super-Resolution Assisted Lunar Crater Detection in Satellite Images Atal Tewari et.al. 2402.05068v1 null
2024-02-07 Efficient Multi-Resolution Fusion for Remote Sensing Data with Label Uncertainty Hersh Vakharia et.al. 2402.05045v1 link
2024-02-07 PAC Learnability under Explanation-Preserving Graph Perturbations Xu Zheng et.al. 2402.05039v1 null
2024-02-07 Strong convexity-guided hyper-parameter optimization for flatter losses Rahul Yedida et.al. 2402.05025v1 null
2024-02-07 Example-based Explanations for Random Forests using Machine Unlearning Tanmay Surve et.al. 2402.05007v1 null
2024-02-07 Randomized Confidence Bounds for Stochastic Partial Monitoring Maxime Heuillet et.al. 2402.05002v1 null
2024-02-07 Beyond explaining: XAI-based Adaptive Learning with SHAP Clustering for Energy Consumption Prediction Tobias Clement et.al. 2402.04982v1 null
2024-02-06 EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters Quan Sun et.al. 2402.04252v1 link
2024-02-06 The spectrum of excisive functors Gregory Arone et.al. 2402.04244v1 null
2024-02-06 A classification of nonzero skew immaculate functions Sarah Mason et.al. 2402.04219v1 null
2024-02-06 Resource-Aware Hierarchical Federated Learning in Wireless Video Caching Networks Md Ferdous Pervej et.al. 2402.04216v1 null
2024-02-06 "Task Success" is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors Lin Guan et.al. 2402.04210v1 null
2024-02-06 3D Volumetric Super-Resolution in Radiology Using 3D RRDB-GAN Juhyung Ha et.al. 2402.04171v1 null
2024-02-06 Human Emotions Analysis and Recognition Using EEG Signals in Response to 360$^\circ$ Videos Haseeb ur Rahman Abbasi et.al. 2402.04142v1 null
2024-02-06 Hierarchical Delay Attribution Classification using Unstructured Text in Train Management Systems Anton Borg et.al. 2402.04108v1 null
2024-02-06 Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction Shijun Liang et.al. 2402.04097v1 null
2024-02-06 A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation Zhengbo Wang et.al. 2402.04087v1 link
2024-02-05 Multiclass Classification Procedure for Detecting Attacks on MQTT-IoT Protocol Hector Alaiz-Moreton et.al. 2402.03270v1 null
2024-02-05 Security Advice for Parents and Children About Content Filtering and Circumvention as Found on YouTube and TikTok Ran Elgedawy et.al. 2402.03255v1 null
2024-02-05 JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching Antoine Magron et.al. 2402.03242v1 link
2024-02-05 FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition Xiaohu Huang et.al. 2402.03241v1 null
2024-02-05 IGUANe: a 3D generalizable CycleGAN for multicenter harmonization of brain MR images Vincent Roca et.al. 2402.03227v1 null
2024-02-05 English Prompts are Better for NLI-based Zero-Shot Emotion Classification than Target-Language Prompts Patrick Barreiß et.al. 2402.03223v1 null
2024-02-05 "Define Your Terms" : Enhancing Efficient Offensive Speech Classification with Definition Huy Nghiem et.al. 2402.03221v1 link
2024-02-05 Isotropy, Clusters, and Classifiers Timothee Mickus et.al. 2402.03191v1 null
2024-02-06 Cool-chic video: Learned video coding with 800 parameters Thomas Leguay et.al. 2402.03179v2 null
2024-02-05 Accurate and Well-Calibrated ICD Code Assignment Through Attention Over Diverse Label Embeddings Gonçalo Gomes et.al. 2402.03172v1 link
2024-02-02 From gas to stars: MUSEings on the internal evolution of IC 1613 S. Taibi et.al. 2402.01631v1 null
2024-02-02 Truncation technique for variational quantum eigensolver for Molecular Hamiltonians Qidong Xu et.al. 2402.01630v1 null
2024-02-02 L2G2G: a Scalable Local-to-Global Network Embedding with Graph Autoencoders Ruikang Ouyang et.al. 2402.01614v1 link
2024-02-02 Immersive Video Compression using Implicit Neural Representations Ho Man Kwan et.al. 2402.01596v1 link
2024-02-02 NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties Jingyuan Sun et.al. 2402.01590v1 null
2024-02-02 Boximator: Generating Rich and Controllable Motions for Video Synthesis Jiawei Wang et.al. 2402.01566v1 null
2024-02-02 Deep Continuous Networks Nergis Tomen et.al. 2402.01557v1 link
2024-02-02 SLYKLatent, a Learning Framework for Facial Features Estimation Samuel Adebayo et.al. 2402.01555v1 null
2024-02-02 Advancing Brain Tumor Inpainting with Generative Models Ruizhi Zhu et.al. 2402.01509v1 null
2024-02-02 Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses Mahboubeh Asadi et.al. 2402.01485v1 null
2024-02-01 We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline Simar Kareer et.al. 2402.00868v1 link
2024-02-01 Deep Room Impulse Response Completion Jackie Lin et.al. 2402.00859v1 null
2024-02-01 Early Time Classification with Accumulated Accuracy Gap Control Liran Ringel et.al. 2402.00857v1 link
2024-02-01 BootsTAP: Bootstrapped Training for Tracking-Any-Point Carl Doersch et.al. 2402.00847v1 link
2024-02-01 Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering Pinxin Liu et.al. 2402.00827v1 null
2024-02-01 Examining the Influence of Digital Phantom Models in Virtual Imaging Trials for Tomographic Breast Imaging Amar Kavuri et.al. 2402.00812v1 null
2024-02-01 ReAGent: Towards A Model-agnostic Feature Attribution Method for Generative Language Models Zhixue Zhao et.al. 2402.00794v1 link
2024-02-01 Distinguishing the Indistinguishable: Human Expertise in Algorithmic Prediction Rohan Alur et.al. 2402.00793v1 link
2024-02-02 CroissantLLM: A Truly Bilingual French-English Language Model Manuel Faysse et.al. 2402.00786v2 link
2024-02-01 Hybrid Quantum Vision Transformers for Event Classification in High Energy Physics Eyup B. Unlu et.al. 2402.00776v1 null
2024-01-31 Classification-Oriented Semantic Wireless Communications Emrecan Kutay et.al. 2401.18069v1 null
2024-01-31 Rank Supervised Contrastive Learning for Time Series Classification Qianying Ren et.al. 2401.18057v1 null
2024-01-31 Variable selection for Naïve Bayes classification Rafael Blanquero et.al. 2401.18039v1 null
2024-01-31 Optimizing contrastive learning for cortical folding pattern detection Aymeric Gaudin et.al. 2401.18035v1 null
2024-01-31 A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024 Darren Ramsook et.al. 2401.18021v1 null
2024-01-31 EEG-GPT: Exploring Capabilities of Large Language Models for EEG Classification and Interpretation Jonathan W. Kim et.al. 2401.18006v1 null
2024-01-31 Unsupervised Learning of Topological Non-Abelian Braiding in Non-Hermitian Bands Yang Long et.al. 2401.17968v1 null
2024-01-31 Error-Tolerant E-Discovery Protocols Jinshuo Dong et.al. 2401.17952v1 null
2024-01-31 HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction Harvie Zhang et.al. 2401.17948v1 null
2024-01-31 Probabilistic Photonic Computing with Chaotic Light Frank Brückerhoff-Plückelmann et.al. 2401.17915v1 null
2024-01-30 The SRG/eROSITA all-sky survey: Hard X-ray selected Active Galactic Nuclei Sophia G. H. Waddell et.al. 2401.17306v1 null
2024-01-30 Compact white-dwarf binaries in the combined SRG/eROSITA/SDSS eFEDS survey A. Schwope et.al. 2401.17304v1 null
2024-01-30 Searching for X-ray counterparts of unassociated Fermi-LAT sources and rotation-powered pulsars with SRG/eROSITA Martin G. F. Mayer et.al. 2401.17295v1 null
2024-01-30 X-ray AGNs with SRG/eROSITA: Multi-wavelength observations reveal merger triggering and post-coalescence circumnuclear blowout Robert W. Bickley et.al. 2401.17277v1 null
2024-01-30 ReacLLaMA: Merging chemical and textual information in chemical reactivity AI models Aline Hartgers et.al. 2401.17267v1 null
2024-01-30 SLIC: A Learned Image Codec Using Structure and Color Srivatsa Prativadibhayankaram et.al. 2401.17246v1 link
2024-01-31 Faster coloring and embedding in dense hypergraphs via stability Jianfeng Hou et.al. 2401.17219v2 null
2024-01-31 GazeGPT: Augmenting Human Capabilities using Gaze-contingent Contextual AI for Smart Eyewear Robert Konrad et.al. 2401.17217v2 null
2024-01-30 Single Word Change is All You Need: Designing Attacks and Defenses for Text Classifiers Lei Xu et.al. 2401.17196v1 null
2024-01-30 GraphViz2Vec: A Structure-aware Feature Generation Model to Improve Classification in GNNs Shraban Kumar Chatterjee et.al. 2401.17178v1 null
2024-01-29 Computer Vision for Primate Behavior Analysis in the Wild Richard Vogg et.al. 2401.16424v1 null
2024-01-29 Synchformer: Efficient Synchronization from Sparse Cues Vladimir Iashin et.al. 2401.16423v1 null
2024-01-29 Strategic Usage in a Multi-Learner Setting Eliot Shekhtman et.al. 2401.16422v1 null
2024-01-29 ReTaSA: A Nonparametric Functional Estimation Approach for Addressing Continuous Target Shift Hwanwoo Kim et.al. 2401.16410v1 null
2024-01-29 Is K-fold cross validation the best model selection method for Machine Learning? Juan M Gorriz et.al. 2401.16407v1 null
2024-01-29 Zero-shot Imitation Policy via Search in Demonstration Dataset Federco Malato et.al. 2401.16398v1 null
2024-01-29 Ovarian Cancer Diagnostics using Wavelet Packet Scaling Descriptors Raymond J. Hinton Jr. et.al. 2401.16396v1 null
2024-01-29 Evaluation of pseudo-healthy image reconstruction for anomaly detection with deep generative models: Application to brain FDG PET Ravi Hassanaly et.al. 2401.16363v1 link
2024-01-29 Curriculum-Based Reinforcement Learning for Quadrupedal Jumping: A Reference-free Design Vassil Atanassov et.al. 2401.16337v1 null
2024-01-29 Making the unmodulated Pyramid wavefront sensor smart. Closed-loop demonstration of neural network wavefront reconstruction with MagAO-X Rico Landman et.al. 2401.16325v1 null
2024-01-26 From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities Chaochao Lu et.al. 2401.15071v1 null
2024-01-26 Deep learning-based approach for tomato classification in complex scenes Mikael A. Mousse et.al. 2401.15055v1 null
2024-01-26 Non-Unitary $3 \times 3$ Mixing in Majorana Neutrinos and Vector-like Quark Models Pedro M. F. Pereira et.al. 2401.15049v1 null
2024-01-26 Machine learning-based analysis of glioma tissue sections: a review Jan-Philipp Redlich et.al. 2401.15022v1 null
2024-01-26 Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers Kerlos Atia Abdalmalak et.al. 2401.15018v1 null
2024-01-26 Graph-based Active Learning for Entity Cluster Repair Victor Christen et.al. 2401.14992v1 null
2024-01-26 Stokes graphs of the Rabi problem with real parameters René Langøen et.al. 2401.14991v1 null
2024-01-26 Minimum-dissipation principle for synchronised stochastic oscillators far from equilibrium Jan Meibohm et.al. 2401.14982v1 null
2024-01-26 Microwave lymphedema assessment using deep learning with contour assisted backprojection Yuyi Chang et.al. 2401.14970v1 null
2024-01-26 Hold Tight: Identifying Behavioral Patterns During Prolonged Work in VR through Video Analysis Verena Biener et.al. 2401.14920v1 null
2024-01-25 Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities Yiyuan Zhang et.al. 2401.14405v1 link
2024-01-25 Adaptive Mobile Manipulation for Articulated Objects In the Open World Haoyu Xiong et.al. 2401.14403v1 null
2024-01-25 Range-Agnostic Multi-View Depth Estimation With Keyframe Selection Andrea Conti et.al. 2401.14401v1 link
2024-01-25 Rethinking Patch Dependence for Masked Autoencoders Letian Fu et.al. 2401.14391v1 null
2024-01-25 Smooth Ranking SVM via Cutting-Plane Method Erhan Can Ozcan et.al. 2401.14388v1 link
2024-01-25 Inconsistency Masks: Removing the Uncertainty from Input-Pseudo-Label Pairs Michael R. H. Vorndran et.al. 2401.14387v1 link
2024-01-25 A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bengali Texts Kazi Toufique Elahi et.al. 2401.14360v1 link
2024-01-25 Computing Derivations on Nilpotent Quadratic Lie Algebras Pilar Benito et.al. 2401.14348v1 null
2024-01-25 Class-attribute Priors: Adapting Optimization to Heterogeneity and Fairness Objective Xuechen Zhang et.al. 2401.14343v1 null
2024-01-25 Progressive Multi-task Anti-Noise Learning and Distilling Frameworks for Fine-grained Vehicle Recognition Dichao Liu et.al. 2401.14336v1 link
2024-01-24 Tyche: Stochastic In-Context Learning for Medical Image Segmentation Marianne Rakic et.al. 2401.13650v1 null
2024-01-24 Quantifying the Impact of Frame Preemption on Combined TSN Shapers Rubi Debnath et.al. 2401.13631v1 null
2024-01-24 Can overfitted deep neural networks in adversarial training generalize? -- An approximation viewpoint Zhongjie Shi et.al. 2401.13624v1 null
2024-01-24 FLLIC: Functionally Lossless Image Compression Xi Zhang et.al. 2401.13616v1 null
2024-01-24 Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode Naresh Kumar Lahajal et.al. 2401.13613v1 null
2024-01-24 Prompt Weight Experiments for LLM Instruction Fine-Tuning Mathew Huerta-Enochian et.al. 2401.13586v1 null
2024-01-24 WPDA: Frequency-based Backdoor Attack with Wavelet Packet Decomposition Zhengyao Song et.al. 2401.13578v1 null
2024-01-24 CNN architecture extraction on edge GPU Peter Horvath et.al. 2401.13575v1 null
2024-01-24 Benchmarking the Fairness of Image Upsampling Methods Mike Laszkiewicz et.al. 2401.13555v1 null
2024-01-24 PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition Otto Brookes et.al. 2401.13554v1 null
2024-01-23 SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI Hanxue Gu et.al. 2401.12974v1 null
2024-01-23 On the Efficacy of Text-Based Input Modalities for Action Anticipation Apoorva Beedu et.al. 2401.12972v1 null
2024-01-23 The role of environment and AGN feedback in quenching local galaxies: Comparing cosmological hydrodynamical simulations to the SDSS Paul H. Goubert et.al. 2401.12953v1 null
2024-01-23 Lumiere: A Space-Time Diffusion Model for Video Generation Omer Bar-Tal et.al. 2401.12945v1 null
2024-01-23 Long-range three-dimensional tracking of nanoparticles using interferometric scattering (iSCAT) microscopy Kiarash Kasaian et.al. 2401.12939v1 null
2024-01-23 Neural deformation fields for template-based reconstruction of cortical surfaces from MRI Fabian Bongratz et.al. 2401.12938v1 null
2024-01-23 Segmentation of tibiofemoral joint tissues from knee MRI using MtRA-Unet and incorporating shape information: Data from the Osteoarthritis Initiative Akshay Daydar et.al. 2401.12932v1 null
2024-01-23 pyAKI - An Open Source Solution to Automated KDIGO classification Christian Porschen et.al. 2401.12930v1 null
2024-01-23 Performance Analysis of Support Vector Machine (SVM) on Challenging Datasets for Forest Fire Detection Ankan Kar et.al. 2401.12924v1 null
2024-01-23 Advancing Glitch Classification in Gravity Spy: Multi-view Fusion with Attention-based Machine Learning for Advanced LIGO's Fourth Observing Run Yunan Wu et.al. 2401.12913v1 null
2024-01-22 Connecting the Dots: Leveraging Spatio-Temporal Graph Neural Networks for Accurate Bangla Sign Language Recognition Haz Sameen Shahgir et.al. 2401.12210v1 null
2024-01-22 Unsupervised Machine Learning for the Classification of Astrophysical X-ray Sources Víctor Samuel Pérez-Díaz et.al. 2401.12203v1 link
2024-01-22 OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Peiqi Liu et.al. 2401.12202v1 null
2024-01-22 In-Context Learning for Extreme Multi-Label Classification Karel D'Oosterlinck et.al. 2401.12178v1 null
2024-01-22 Broiler-Net: A Deep Convolutional Framework for Broiler Behavior Analysis in Poultry Houses Tahereh Zarrat Ehsan et.al. 2401.12176v1 link
2024-01-22 VRMN-bD: A Multi-modal Natural Behavior Dataset of Immersive Human Fear Responses in VR Stand-up Interactive Games He Zhang et.al. 2401.12133v1 link
2024-01-22 Evaluation of QCNN-LSTM for Disability Forecasting in Multiple Sclerosis Using Sequential Multisequence MRI John D. Mayfield et.al. 2401.12132v1 null
2024-01-22 Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy Will LeVine et.al. 2401.12129v1 link
2024-01-22 Measures of the Capital Network of the U.S. Economy Ben Klemens et.al. 2401.12118v1 null
2024-01-22 A quantitative version of the Steinhaus theorem Alex Iosevich et.al. 2401.12112v1 null
2024-01-19 Classifying affine structures with focus-focus singularities Xiudi Tang et.al. 2401.10881v1 null
2024-01-19 Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning André O. Françani et.al. 2401.10857v1 null
2024-01-19 Emotion Classification In Software Engineering Texts: A Comparative Analysis of Pre-trained Transformers Language Models Mia Mohammad Imran et.al. 2401.10845v1 null
2024-01-19 Understanding Video Transformers via Universal Concept Discovery Matthew Kowal et.al. 2401.10831v1 null
2024-01-19 Long-Term Monitoring of the Oe Star VES 735: Ope! Not So Quiet After All Brandon Marshall et.al. 2401.10829v1 null
2024-01-19 ActAnywhere: Subject-Aware Video Background Generation Boxiao Pan et.al. 2401.10822v1 null
2024-01-19 RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision Fernando Pérez-García et.al. 2401.10815v1 null
2024-01-19 Learning to Visually Connect Actions and their Effects Eric Peh et.al. 2401.10805v1 null
2024-01-19 Endovascular Detection of Catheter-Thrombus Contact by Vacuum Excitation Jared Lawson et.al. 2401.10804v1 null
2024-01-19 TDC-less Direct Time-of-Flight Imaging Using Spiking Neural Networks Jack MacLean et.al. 2401.10793v1 null
2024-01-18 Simultaneous Tactile Estimation and Control for Extrinsic Dexterity Antonia Bronars et.al. 2401.10230v1 null
2024-01-18 OMG-Seg: Is One Model Good Enough For All Segmentation? Xiangtai Li et.al. 2401.10229v1 link
2024-01-18 RAP-SAM: Towards Real-Time All-Purpose Segment Anything Shilin Xu et.al. 2401.10228v1 link
2024-01-18 Towards Language-Driven Video Inpainting via Multimodal Large Language Models Jianzong Wu et.al. 2401.10226v1 null
2024-01-18 Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions Namitha Padmanabhan et.al. 2401.10217v1 null
2024-01-18 Transfer Learning in Human Activity Recognition: A Survey Sourish Gunesh Dhekane et.al. 2401.10185v1 null
2024-01-18 SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild Andreas Engelhardt et.al. 2401.10171v1 null
2024-01-19 Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation Changgu Chen et.al. 2401.10150v2 null
2024-01-18 Few-shot learning for COVID-19 Chest X-Ray Classification with Imbalanced Data: An Inter vs. Intra Domain Study Alejandro Galán-Cuenca et.al. 2401.10129v1 null
2024-01-18 Sub2Full: split spectrum to boost OCT despeckling without clean data Lingyun Wang et.al. 2401.10128v1 link
2024-01-17 Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Lianghui Zhu et.al. 2401.09417v1 link
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414v1 link
2024-01-17 Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text Mazal Bethany et.al. 2401.09407v1 null
2024-01-17 Élivágar: Efficient Quantum Circuit Search for Classification Sashwat Anagolum et.al. 2401.09393v1 null
2024-01-17 Tri$^{2}$-plane: Volumetric Avatar Reconstruction with Feature Pyramid Luchuan Song et.al. 2401.09386v1 link
2024-01-17 New relations of pod partition and its connection with other partition functions Hemjyoti Nath et.al. 2401.09374v1 null
2024-01-17 To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection Luyi Han et.al. 2401.09336v1 link
2024-01-17 Machines Do See Color: A Guideline to Classify Different Forms of Racist Discourse in Large Corpora Diana Davila Gordillo et.al. 2401.09333v1 null
2024-01-17 Spectral Distribution Complexity of the Surface Fibrillatory Waves Predicts Post-Catheter Ablation Relapse in Persistent Atrial Fibrillation Pilar Escribano et.al. 2401.09297v1 null
2024-01-17 T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis Yoonjin Chung et.al. 2401.09294v1 null
2024-01-16 From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers Jiu Feng et.al. 2401.08415v1 null
2024-01-16 Faster ISNet for Background Bias Mitigation on Deep Neural Networks Pedro R. A. S. Bassi et.al. 2401.08409v1 null
2024-01-16 Training and Comparison of nnU-Net and DeepMedic Methods for Autosegmentation of Pediatric Brain Tumors Arastoo Vossough et.al. 2401.08404v1 null
2024-01-16 High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering Xin Ming et.al. 2401.08398v1 null
2024-01-16 DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models Zongxin Yang et.al. 2401.08392v1 link
2024-01-16 We don't need no labels: Estimating post-deployment model performance under covariate shift without ground truth Jakub Białek et.al. 2401.08348v1 null
2024-01-16 Learn What You Need in Personalized Federated Learning Kexin Lv et.al. 2401.08327v1 link
2024-01-16 Application of LLM Agents in Recruitment: A Novel Framework for Resume Screening Chengguang Gan et.al. 2401.08315v1 null
2024-01-16 Central extensions of restricted Lie superalgebras and classification of $p$-nilpotent Lie superalgebras in dimension $4$ Sofiane Bouarroudj et.al. 2401.08313v1 null
2024-01-16 Evaluating online elasticity estimation of soft objects using standard robot grippers Shubhan P. Patni et.al. 2401.08298v1 null
2024-01-16 Multitask Learning in Minimally Invasive Surgical Vision: A Review Oluwatosin Alabi et.al. 2401.08256v1 null
2024-01-16 Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization Chongzhi Zhang et.al. 2401.08232v1 null
2024-01-16 Towards Causal Relationship in Indefinite Data: Baseline Model and New Datasets Hang Chen et.al. 2401.08221v1 link
2024-01-16 Ship Detection in SAR Images with Human-in-the-Loop Hecheng Jia et.al. 2401.08213v1 null
2024-01-16 ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification Zhongbin Fang et.al. 2401.08210v1 link
2024-01-12 Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements Anton Voronov et.al. 2401.06766v1 null
2024-01-12 Classification of singularities of cluster algebras of finite type II: coefficients Angélica Benito et.al. 2401.06758v1 null
2024-01-12 Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction Muhammad Naveed Riaz et.al. 2401.06757v1 null
2024-01-12 Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection Muhammad Tayyab Zamir et.al. 2401.06752v1 null
2024-01-12 Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part II: Spatial and Tonal Data Optimization Niklas Kämper et.al. 2401.06747v1 null
2024-01-12 Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part I: Homogeneous Diffusion Inpainting Niklas Kämper et.al. 2401.06744v1 null
2024-01-12 Complexity Classification of Product State Problems for Local Hamiltonians John Kallaugher et.al. 2401.06725v1 null
2024-01-12 Obstacle-Aware Positioning of a Mobile Robotic Platform for 6G Networks Alexandre Costa et.al. 2401.06717v1 null
2024-01-12 Reliability Analysis of Psychological Concept Extraction and Classification in User-penned Text Muskan Garg et.al. 2401.06709v1 null
2024-01-12 On the existence of charged electrostatic black holes in arbitrary topology Martin Reiris et.al. 2401.06702v1 null
2024-01-11 Distilling Vision-Language Models on Millions of Videos Yue Zhao et.al. 2401.06129v1 null
2024-01-11 Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors Jack Saunders et.al. 2401.06126v1 null
2024-01-11 Gaussian Shadow Casting for Neural Characters Luis Bolanos et.al. 2401.06116v1 null
2024-01-11 A Closer Look at AUROC and AUPRC under Class Imbalance Matthew B. A. McDermott et.al. 2401.06091v1 link
2024-01-12 LEGO:Language Enhanced Multi-modal Grounding Model Zhaowei Li et.al. 2401.06071v2 link
2024-01-11 On the Power of Graph Neural Networks and Feature Augmentation Strategies to Classify Social Networks Walid Guettala et.al. 2401.06048v1 null
2024-01-11 RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks Partha Ghosh et.al. 2401.06035v1 null
2024-01-11 Attention to detail: inter-resolution knowledge distillation Rocío del Amor et.al. 2401.06010v1 link
2024-01-11 Sea ice detection using concurrent multispectral and synthetic aperture radar imagery Martin S J Rogers et.al. 2401.06009v1 null
2024-01-11 Boosting Mixed-Initiative Co-Creativity in Game Design: A Tutorial Solange Margarido et.al. 2401.05999v1 null
2024-01-10 Towards Online Sign Language Recognition and Translation Ronglai Zuo et.al. 2401.05336v1 link
2024-01-10 ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video Kevin Cai et.al. 2401.05314v1 link
2024-01-10 Strategic Client Selection to Address Non-IIDness in HAPS-enabled FL Networks Amin Farajzadeh et.al. 2401.05308v1 null
2024-01-10 Frame-like Fourier expansions for finite Borel measures on $\mathbb{R}$ Chad Berner et.al. 2401.05243v1 null
2024-01-10 Learning effective good variables from physical data Giulio Barletta et.al. 2401.05226v1 link
2024-01-10 TOVAC: Tele-operated Vehicle Admission Control and Routing Jorge Martín-Pérez et.al. 2401.05225v1 null
2024-01-10 Do Vision and Language Encoders Represent the World Similarly? Mayug Maniparambil et.al. 2401.05224v1 null
2024-01-10 Exploring Vulnerabilities of No-Reference Image Quality Assessment Models: A Query-Based Black-Box Method Chenxi Yang et.al. 2401.05217v1 null
2024-01-10 Pre-trained Large Language Models for Financial Sentiment Analysis Wei Luo et.al. 2401.05215v1 link
2024-01-10 A Novel Prompt-tuning Method: Incorporating Scenario-specific Concepts into a Verbalizer Yong Ma et.al. 2401.05204v1 null
2024-01-09 A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars Ronglai Zuo et.al. 2401.04730v1 link
2024-01-09 U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation Jun Ma et.al. 2401.04722v1 null
2024-01-09 Helicoidal surfaces of prescribed mean curvature in $\mathbb{R}^3$ Aires Eduardo Menani Barbieri et.al. 2401.04721v1 null
2024-01-09 Low-resource finetuning of foundation models beats state-of-the-art in histopathology Benedikt Roth et.al. 2401.04720v1 null
2024-01-09 Jump Cut Smoothing for Talking Heads Xiaojuan Wang et.al. 2401.04718v1 null
2024-01-09 NIPn CHIPS Blaise Boissonneau et.al. 2401.04697v1 null
2024-01-09 CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural Networks Sunny Howard et.al. 2401.04680v1 null
2024-01-09 Benchmark Analysis of Various Pre-trained Deep Learning Models on ASSIRA Cats and Dogs Dataset Galib Muhammad Shahriar Himel et.al. 2401.04666v1 null
2024-01-09 DepressionEmo: A novel dataset for multilabel classification of depression emotions Abu Bakar Siddiqur Rahman et.al. 2401.04655v1 link
2024-01-09 Hold 'em and Fold 'em: Towards Human-scale, Feedback-Controlled Soft Origami Robots Immanuel Ampomah Mensah et.al. 2401.04650v1 null
2024-01-08 Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning Chen Zhao et.al. 2401.04105v1 null
2024-01-08 RudolfV: A Foundation Model by Pathologists for Pathologists Jonas Dippel et.al. 2401.04079v1 null
2024-01-08 Variance Reduction in Ratio Metrics for Efficient Online Experiments Shubham Baweja et.al. 2401.04062v1 null
2024-01-08 Bjøntegaard Delta (BD): A Tutorial Overview of the Metric, Evolution, Challenges, and Recommendations Nabajeet Barman et.al. 2401.04039v1 null
2024-01-08 Blocks whose defect groups are Suzuki $2$-groups Charles W. Eaton et.al. 2401.04028v1 null
2024-01-08 IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification Abdullah Alsuhaibani et.al. 2401.04025v1 null
2024-01-08 Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification Wentao Zhu et.al. 2401.04023v1 null
2024-01-08 Resident space object detection method based on the connection between Fourier spectrum of the video data difference frame and the linear velocity projection V. S. Baranova et.al. 2401.04021v1 null
2024-01-09 Recognizing Blazars Using Radio Morphology from the VLA Sky Survey Zhang-Liang Xie et.al. 2401.04009v2 null
2024-01-08 Calabi-Yau Varieties via Cyclic Covers, and Complex Hyperbolic Structures for their Moduli Spaces Chenglong Yu et.al. 2401.04006v1 null
2024-01-05 Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively Haobo Yuan et.al. 2401.02955v1 link
2024-01-05 The Dark Energy Survey Supernova Program: Cosmological Analysis and Systematic Uncertainties M. Vincenzi et.al. 2401.02945v1 null
2024-01-05 Digital-analog quantum learning on Rydberg atom arrays Jonathan Z. Lu et.al. 2401.02940v1 null
2024-01-05 Mixing Magnetic and Electric Ehlers-Harrison transformations: The Electromagnetic Swirling Spacetime and Novel Type I Backgrounds José Barrientos et.al. 2401.02924v1 null
2024-01-05 Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks Kevin Everson et.al. 2401.02921v1 null
2024-01-05 Analytically-Driven Resource Management for Cloud-Native Microservices Yanqi Zhang et.al. 2401.02920v1 null
2024-01-05 Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task Gabriel Lino Garcia et.al. 2401.02909v1 null
2024-01-05 Robust Bichromatic Classification using Two Lines Erwin Glazenburg et.al. 2401.02897v1 null
2024-01-05 Particle-Wise Higher-Order SPH Field Approximation for DVR Jonathan Fischer et.al. 2401.02896v1 null
2024-01-05 Nonlinear functional regression by functional deep neural network with kernel embedding Zhongjie Shi et.al. 2401.02890v1 null
2024-01-04 asimulation: Domain formation and impact on observables in resolved cosmological simulations of the (a)symmetron Øyvind Christiansen et.al. 2401.02410v1 link
2024-01-04 Gravitational waves from dark domain walls Øyvind Christiansen et.al. 2401.02409v1 link
2024-01-05 Correctness Comparison of ChatGPT-4, Bard, Claude-2, and Copilot for Spatial Tasks Hartwig H. Hochmair et.al. 2401.02404v2 null
2024-01-04 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation Zihao Xiao et.al. 2401.02402v1 null
2024-01-04 Analyzing Misinformation Claims During the 2022 Brazilian General Election on WhatsApp, Twitter, and Kwai Scott A. Hale et.al. 2401.02395v1 null
2024-01-04 Image denoising and model-independent parameterization for improving IVIM MRI Caleb Sample et.al. 2401.02394v1 null
2024-01-04 Survey of 3D Human Body Pose and Shape Estimation Methods for Contemporary Dance Applications Darshan Venkatrayappa et.al. 2401.02383v1 null
2024-01-04 A novel method to enhance pneumonia detection via a model-level ensembling of CNN and vision transformer Sandeep Angara et.al. 2401.02358v1 null
2024-01-04 ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation Xinyang Pu et.al. 2401.02326v1 link
2024-01-04 Reflection physics in X-ray-emitting Symbiotic Stars Jesús A. Toalá et.al. 2401.02318v1 null
2024-01-03 Profinite equivariant spectra and their tensor-triangular geometry Scott Balchin et.al. 2401.01878v1 null
2024-01-03 A spatial mixture model for spaceborne lidar observations over mixed forest and non-forest land types Paul B. May et.al. 2401.01848v1 null
2024-01-03 Teaching with a companion: the case of gravity Iuliia Zhurakovskaia et.al. 2401.01832v1 null
2024-01-03 Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling Himmet Toprak Kesgin et.al. 2401.01830v1 null
2024-01-03 Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions David Junhao Zhang et.al. 2401.01827v1 link
2024-01-03 Detours for Navigating Instructional Videos Kumar Ashutosh et.al. 2401.01823v1 null
2024-01-03 SENS3: Multisensory Database of Finger-Surface Interactions and Corresponding Sensations Jagan K. Balasubramanian et.al. 2401.01818v1 null
2024-01-03 Signal Processing in the Retina: Interpretable Graph Classifier to Predict Ganglion Cell Responses Yasaman Parhizkar et.al. 2401.01813v1 null
2024-01-03 Efficient Computation of Confidence Sets Using Classification on Equidistributed Grids Lujie Zhou et.al. 2401.01804v1 null
2024-01-03 An experimental sorting method for improving metagenomic data encoding Diogo Pratas et.al. 2401.01786v1 null
2024-01-02 Street Gaussians for Modeling Dynamic Urban Scenes Yunzhi Yan et.al. 2401.01339v1 null
2024-01-02 Classifying Words with 3-sort Automata Tomasz Jastrząb et.al. 2401.01314v1 null
2024-01-03 A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models S. M Towhidul Islam Tonmoy et.al. 2401.01313v2 null
2024-01-02 Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR Images Subin Sahayam et.al. 2401.01303v1 null
2024-01-02 $f$-Divergence Based Classification: Beyond the Use of Cross-Entropy Nicola Novello et.al. 2401.01268v1 link
2024-01-02 VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM Fuchen Long et.al. 2401.01256v1 null
2024-01-02 An operational approach to classifying measurement incompatibility Arun Kumar Das et.al. 2401.01236v1 null
2024-01-03 Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond Dimitrios Kollias et.al. 2401.01219v2 null
2024-01-02 FGENet: Fine-Grained Extraction Network for Congested Crowd Counting Hao-Yuan Ma et.al. 2401.01208v1 null
2024-01-02 Whole-examination AI estimation of fetal biometrics from 20-week ultrasound scans Lorenzo Venturini et.al. 2401.01201v1 null
2023-12-29 Computational Tools for Trees in Gauge Theory and Gravity Jacob L. Bourjaily et.al. 2312.17745v1 null
2023-12-29 Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization Ioanna Ntinou et.al. 2312.17686v1 null
2023-12-29 Malware Detection in IOT Systems Using Machine Learning Techniques Ali Mehrban et.al. 2312.17683v1 null
2023-12-29 FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis Feng Liang et.al. 2312.17681v1 null
2023-12-29 Grasping, Part Identification, and Pose Refinement in One Shot with a Tactile Gripper Joyce Xin-Yan Lim et.al. 2312.17650v1 null
2023-12-29 MoD2T:Model-Data-Driven Motion-Static Object Tracking Method Yang Feng et.al. 2312.17641v1 null
2023-12-29 A New Explanation of the Mechanism of Hadley Circulation Wei Huang et.al. 2312.17637v1 null
2023-12-29 Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training Dongfang Li et.al. 2312.17591v1 null
2023-12-29 A Tool for the Procedural Generation of Shaders using Interactive Evolutionary Algorithms Elio Sasso et.al. 2312.17587v1 link
2023-12-29 Distribution-based Low-rank Embedding Bardia Yousefi et.al. 2312.17579v1 null
2023-12-28 A Simple LLM Framework for Long-Range Video Question-Answering Ce Zhang et.al. 2312.17235v1 null
2023-12-28 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency Yuyang Yin et.al. 2312.17225v1 null
2023-12-28 EFHQ: Multi-purpose ExtremePose-Face-HQ dataset Trung Tuan Dao et.al. 2312.17205v1 null
2023-12-28 One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts Ziheng Zhao et.al. 2312.17183v1 null
2023-12-28 Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action Jiasen Lu et.al. 2312.17172v1 null
2023-12-28 Classification of multiplication modules over multiplication rings with finitely many minimal primes Volodymyr Bavula et.al. 2312.17170v1 null
2023-12-28 Securing NextG Systems against Poisoning Attacks on Federated Learning: A Game-Theoretic Solution Yalin E. Sagduyu et.al. 2312.17164v1 null
2023-12-28 Replica Tree-based Federated Learning using Limited Data Ramona Ghilea et.al. 2312.17159v1 null
2023-12-29 ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe Yifan Bai et.al. 2312.17133v2 null
2023-12-28 Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos Houlun Chen et.al. 2312.17117v1 null
2023-12-26 Microwave signal processing using an analog quantum reservoir computer Alen Senanian et.al. 2312.16166v1 null
2023-12-26 Large-scale Long-tailed Disease Diagnosis on Radiology Images Qiaoyu Zheng et.al. 2312.16151v1 null
2023-12-27 The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias Timo Spinde et.al. 2312.16148v2 link
2023-12-26 The non-Abelian Aharonov-Bohm effect P. A. Horvathy et.al. 2312.16133v1 null
2023-12-26 LangSplat: 3D Language Gaussian Splatting Minghan Qin et.al. 2312.16084v1 null
2023-12-26 AdaNAS: Adaptively Post-processing with Self-supervised Neural Architecture Search for Ensemble Rainfall Forecasts Yingpeng Wen et.al. 2312.16046v1 null
2023-12-26 An extended asymmetric sigmoid with Perceptron (SIGTRON) for imbalanced linear classification Hyenkyun Woo et.al. 2312.16043v1 null
2023-12-26 Multi-scale Progressive Feature Embedding for Accurate NIR-to-RGB Spectral Domain Translation Xingxing Yang et.al. 2312.16040v1 null
2023-12-26 Plug-and-Play Regularization on Magnitude with Deep Priors for 3D Near-Field MIMO Imaging Okyanus Oral et.al. 2312.16024v1 null
2023-12-26 Classification of positive solutions of Hardy-Sobolev equation without the finite volume constraints Lu Chen et.al. 2312.16017v1 null
2023-12-25 Training Convolutional Neural Networks with the Forward-Forward algorithm Riccardo Scodellaro et.al. 2312.14924v2 null
2023-12-22 DRStageNet: Deep Learning for Diabetic Retinopathy Staging from Fundus Images Yevgeniy Men et.al. 2312.14891v1 null
2023-12-22 On rate-optimal classification from non-private and from private data Balázs Csanád Csáji et.al. 2312.14889v1 null
2023-12-22 Classification of cubic tricirculant nut graphs Ivan Damnjanović et.al. 2312.14884v1 null
2023-12-22 Neural-network-based regularization methods for inverse problems in imaging Andreas Habring et.al. 2312.14849v1 null
2023-12-22 Classification of 3-GNDB Graphs Amir Hosseini et.al. 2312.14835v1 null
2023-12-22 Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models Tanish Baranwal et.al. 2312.14830v1 null
2023-12-22 Classification of generalised higher-order Einstein-Maxwell Lagrangians Aimeric Colléaux et.al. 2312.14814v1 null
2023-12-22 On support vector machines under a multiple-cost scenario Sandra Benítez-Peña et.al. 2312.14795v1 null
2023-12-22 The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs Junli Fang et.al. 2312.14792v1 null
2023-12-21 3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera Christen Millerdurai et.al. 2312.14157v1 null
2023-12-21 Virtual Pets: Animatable Animal Generation in 3D Scenes Yen-Chi Cheng et.al. 2312.14154v1 null
2023-12-21 TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification Qinying Liu et.al. 2312.14149v1 link
2023-12-21 HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs Artem Sevastopolsky et.al. 2312.14140v1 null
2023-12-21 Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach Qinying Liu et.al. 2312.14138v1 link
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134v1 null
2023-12-21 WellFactor: Patient Profiling using Integrative Embedding of Healthcare Data Dongjin Choi et.al. 2312.14129v1 null
2023-12-21 VideoPoet: A Large Language Model for Zero-Shot Video Generation Dan Kondratyuk et.al. 2312.14125v1 null
2023-12-21 LingoQA: Video Question Answering for Autonomous Driving Ana-Maria Marcu et.al. 2312.14115v1 link
2023-12-21 LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding Senqiao Yang et.al. 2312.14074v1 null
2023-12-20 Deep Learning on 3D Neural Fields Pierluigi Zama Ramirez et.al. 2312.13277v1 null
2023-12-20 The 1/4-BPS building blocks of brane interactions Ben Eckardt et.al. 2312.13269v1 null
2023-12-20 ClassLIE: Structure- and Illumination-Adaptive Classification for Low-Light Image Enhancement Zixiang Wei et.al. 2312.13265v1 null
2023-12-20 Putting the p back in Prym Jeff Achter et.al. 2312.13263v1 null
2023-12-20 The role of data embedding in equivariant quantum convolutional neural networks Sreetama Das et.al. 2312.13250v1 null
2023-12-20 Enhancing Neural Training via a Correlated Dynamics Model Jonathan Brokman et.al. 2312.13247v1 null
2023-12-20 SISMIK for brain MRI: Deep-learning-based motion estimation and model-based motion correction in k-space Oscar Dabrowski et.al. 2312.13220v1 null
2023-12-20 Boost recall in QSO selection from highly imbalanced photometric datasets Giorgio Calderone et.al. 2312.13194v1 null
2023-12-20 Ergodic measures for periodic type $\mathbb{Z}^m$-skew-products over Interval Exchange Transformations Yuriy Tumarkin et.al. 2312.13165v1 null
2023-12-20 Underwater Acoustic Signal Recognition Based on Salient Features Minghao Chen et.al. 2312.13143v1 null
2023-12-19 Tracking Any Object Amodally Cheng-Yen Hsieh et.al. 2312.12433v1 null
2023-12-19 The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark Aditya Murali et.al. 2312.12429v1 null
2023-12-19 Chasing Fairness in Graphs: A GNN Architecture Perspective Zhimeng Jiang et.al. 2312.12369v1 link
2023-12-19 Easy quantum groups Teo Banica et.al. 2312.12368v1 null
2023-12-19 SMC-NCA: Semantic-guided Multi-level Contrast for Semi-supervised Action Segmentation Feixiang Zhou et.al. 2312.12347v1 null
2023-12-19 On the Effectiveness of Retrieval, Alignment, and Replay in Manipulation Norman Di Palo et.al. 2312.12345v1 null
2023-12-19 Full-reference Video Quality Assessment for User Generated Content Transcoding Zihao Qi et.al. 2312.12317v1 null
2023-12-19 First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria Stefan Schoder et.al. 2312.12314v1 null
2023-12-19 Holography of New Conformal Higher Spin Gravities in 3d I. Lovrekovic et.al. 2312.12301v1 null
2023-12-19 Prompt-based Domain Discrimination for Multi-source Time Series Domain Adaptation Junxiang Wang et.al. 2312.12276v1 null
2023-12-18 Development and Evaluation of Ensemble Learning-based Environmental Methane Detection and Intensity Prediction Models Reek Majumder et.al. 2312.10879v1 null
2023-12-18 Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation Hui Fu et.al. 2312.10877v1 null
2023-12-17 Global relaxation-based LP-Newton method for multiple hyperparameter selection in support vector classification with feature selection Qingna Li et.al. 2312.10848v1 null
2023-12-17 Online Boosting Adaptive Learning under Concept Drift for Multistream Classification En Yu et.al. 2312.10841v1 null
2023-12-17 Learning to Act without Actions Dominik Schmidt et.al. 2312.10812v1 null
2023-12-17 Land use/land cover classification of fused Sentinel-1 and Sentinel-2 imageries using ensembles of Random Forests Shivam Pande et.al. 2312.10798v1 null
2023-12-17 Learning to Learn in Interactive Constraint Acquisition Dimos Tsouros et.al. 2312.10795v1 null
2023-12-17 Identification of Knowledge Neurons in Protein Language Models Divya Nori et.al. 2312.10770v1 null
2023-12-17 Multi-Label Classification of COVID-Tweets Using Large Language Models Aniket Deroy et.al. 2312.10748v1 link
2023-12-17 Unmasking Deepfake Faces from Videos Using An Explainable Cost-Sensitive Deep Learning Approach Faysal Mahmud et.al. 2312.10740v1 link
2023-12-15 Understanding Probe Behaviors through Variational Bounds of Mutual Information Kwanghee Choi et.al. 2312.10019v1 link
2023-12-15 Wearable Coaxially-shielded Metamaterial for Magnetic Resonance Imaging Xia Zhu et.al. 2312.10018v1 null
2023-12-15 On the Invertibility of Euler Integral Transforms with Hyperplanes and Quadric Hypersurfaces Mattie Ji et.al. 2312.10002v1 null
2023-12-15 Towards Architecture-Insensitive Untrained Network Priors for Accelerated MRI Reconstruction Yilin Liu et.al. 2312.09988v1 null
2023-12-15 DHFormer: A Vision Transformer-Based Attention Module for Image Dehazing Abdul Wasi et.al. 2312.09955v1 null
2023-12-15 Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction Yuanbo Hou et.al. 2312.09952v1 null
2023-12-15 LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer Yuxin Cao et.al. 2312.09935v1 link
2023-12-15 RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding Yuxin Zi et.al. 2312.09932v1 null
2023-12-15 Reliable Probabilistic Classification with Neural Networks Harris Papadopoulos et.al. 2312.09912v1 null
2023-12-15 TMP: Temporal Motion Propagation for Online Video Super-Resolution Zhengqiang Zhang et.al. 2312.09909v1 null
2023-12-14 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting Zhiyin Qian et.al. 2312.09228v1 null
2023-12-14 Efficient Online Learning of Contact Force Models for Connector Insertion Kevin Tracy et.al. 2312.09190v1 null
2023-12-14 General Object Foundation Model for Images and Videos at Scale Junfeng Wu et.al. 2312.09158v1 null
2023-12-14 Evaluating Augmented Reality Communication: How Can We Teach Procedural Skill in AR? Manuel Rebol et.al. 2312.09152v1 null
2023-12-14 Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting Anthony Chen et.al. 2312.09148v1 null
2023-12-14 Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy Junsu Kim et.al. 2312.09139v1 null
2023-12-14 Less is more -- the Dispatcher/ Executor principle for multi-task Reinforcement Learning Martin Riedmiller et.al. 2312.09120v1 null
2023-12-14 VideoLCM: Video Latent Consistency Model Xiang Wang et.al. 2312.09109v1 null
2023-12-14 FastInject: Injecting Unpaired Text Data into CTC-based ASR training Keqi Deng et.al. 2312.09100v1 null
2023-12-14 Agent Attention: On the Integration of Softmax and Linear Attention Dongchen Han et.al. 2312.08874v1 link
2023-12-13 VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering Xijun Wang et.al. 2312.08367v1 null
2023-12-13 Challenges and Opportunities in Implementing Negative Differential Resistance Mode Reconfigurable Field Effect Transistors Lephe S et.al. 2312.08351v1 null
2023-12-13 Ehancing CT Image synthesis from multi-modal MRI data based on a multi-task neural network framework Zhuoyao Xin et.al. 2312.08343v1 null
2023-12-13 Preparing VVC for Streaming: A Fast Multi-Rate Encoding Approach Yiqun Liu et.al. 2312.08330v1 null
2023-12-13 Affine monoids of corank one Yulia Zaitseva et.al. 2312.08316v1 null
2023-12-13 VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space Guénolé Fiche et.al. 2312.08291v1 null
2023-12-13 PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models Anis Bourou et.al. 2312.08290v1 link
2023-12-13 On the verification of Embeddings using Hybrid Markov Logic Anup Shakya et.al. 2312.08287v1 null
2023-12-14 High-throughput Biomedical Relation Extraction for Semi-Structured Web Articles Empowered by Large Language Models Songchi Zhou et.al. 2312.08274v2 null
2023-12-13 Efficient Multi-Object Pose Estimation using Multi-Resolution Deformable Attention and Query Aggregation Arul Selvam Periyasamy et.al. 2312.08268v1 null
2023-12-12 diff History for Long-Context Language Agents Ulyana Piterbarg et.al. 2312.07540v1 null
2023-12-12 FreeInit: Bridging Initialization Gap in Video Diffusion Models Tianxing Wu et.al. 2312.07537v1 link
2023-12-12 WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion Soyong Shin et.al. 2312.07531v1 null
2023-12-12 RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation Peng Lu et.al. 2312.07526v1 link
2023-12-12 PEEKABOO: Interactive Video Generation via Masked-Diffusion Yash Jain et.al. 2312.07509v1 null
2023-12-12 NAC-TCN: Temporal Convolutional Networks with Causal Dilated Neighborhood Attention for Emotion Understanding Alexander Mehta et.al. 2312.07507v1 link
2023-12-12 COLMAP-Free 3D Gaussian Splatting Yang Fu et.al. 2312.07504v1 null
2023-12-12 NearbyPatchCL: Leveraging Nearby Patches for Self-Supervised Patch-Level Multi-Class Classification in Whole-Slide Images Gia-Bao Le et.al. 2312.07489v1 null
2023-12-12 MinD-3D: Reconstruct High-quality 3D objects in Human Brain Jianxiong Gao et.al. 2312.07485v1 null
2023-12-12 Classification of retail products: From probabilistic ranking to neural networks Manar Mohamed Hafez et.al. 2312.07482v1 null
2023-12-11 Photorealistic Video Generation with Diffusion Models Agrim Gupta et.al. 2312.06662v1 null
2023-12-11 LightSim: Neural Lighting Simulation for Urban Scenes Ava Pun et.al. 2312.06654v1 null
2023-12-11 Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection Teodora Popordanoska et.al. 2312.06645v1 null
2023-12-11 Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Shangchen Zhou et.al. 2312.06640v1 null
2023-12-12 TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation Rongkun Zheng et.al. 2312.06630v2 link
2023-12-11 Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism Georgios Milis et.al. 2312.06613v1 link
2023-12-11 Early Action Recognition with Action Prototypes Guglielmo Camporese et.al. 2312.06598v1 null
2023-12-11 Flexible visual prompts for in-context learning in computer vision Thomas Foster et.al. 2312.06592v1 link
2023-12-11 QuickQuakeBuildings: Post-earthquake SAR-Optical Dataset for Quick Damaged-building Detection Yao Sun et.al. 2312.06587v1 null
2023-12-12 ESO/HARPS Radial Velocities Catalog Mauro Barbieri et.al. 2312.06586v2 null
2023-12-08 The Long Secondary Period (LSP) Variables: Overview and Some Analysis John R. Percy et.al. 2312.05255v1 null
2023-12-08 Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration Qi-Wei Wang et.al. 2312.05229v1 null
2023-12-08 Shape Matters: Detecting Vertebral Fractures Using Differentiable Point-Based Shape Decoding Hellena Hempe et.al. 2312.05220v1 link
2023-12-08 Enhancing Facial Classification and Recognition using 3D Facial Models and Deep Learning Houting Li et.al. 2312.05219v1 null
2023-12-08 IntrinsicAvatar: Physically Based Inverse Rendering of Dynamic Humans from Monocular Videos via Explicit Ray Tracing Shaofei Wang et.al. 2312.05210v1 null
2023-12-08 Embedding theory in ML toward real-time tracking of structural dynamics through hyperspectral datasets Jonathan D Hollenbach et.al. 2312.05201v1 null
2023-12-08 Video-Based Rendering Techniques: A Survey Rafael Kuffner dos Anjos et.al. 2312.05179v1 null
2023-12-08 Enhancing Single-Frame Supervision for Better Temporal Action Localization Changjian Chen et.al. 2312.05178v1 null
2023-12-08 MRI Scan Synthesis Methods based on Clustering and Pix2Pix Giulia Baldini et.al. 2312.05176v1 null
2023-12-08 TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis Heming Zhu et.al. 2312.05161v1 null
2023-12-07 GenDeF: Learning Generative Deformation Field for Video Generation Wen Wang et.al. 2312.04561v1 null
2023-12-07 MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar Yufan Chen et.al. 2312.04558v1 null
2023-12-07 GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation Shoufa Chen et.al. 2312.04557v1 null
2023-12-07 SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing Tomoki Ichikawa et.al. 2312.04553v1 null
2023-12-07 PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play Lili Chen et.al. 2312.04549v1 null
2023-12-07 Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception? Aritra Dutta et.al. 2312.04548v1 null
2023-12-07 Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models Ivan Kapelyukh et.al. 2312.04533v1 null
2023-12-07 Camera Height Doesn't Change: Unsupervised Monocular Scale-Aware Road-Scene Depth Estimation Genki Kinoshita et.al. 2312.04530v1 null
2023-12-07 RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models Ozgur Kara et.al. 2312.04524v1 link
2023-12-07 Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation Zhiwu Qing et.al. 2312.04483v1 null
2023-12-06 OneLLM: One Framework to Align All Modalities with Language Jiaming Han et.al. 2312.03700v1 link
2023-12-07 Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Umberto Cappellazzo et.al. 2312.03694v2 null
2023-12-06 Direct Exoplanet Detection Using Deep Convolutional Image Reconstruction (ConStruct): A New Algorithm for Post-Processing High-Contrast Images Trevor N. Wolf et.al. 2312.03671v1 null
2023-12-06 Annihilating branching Brownian motion Daniel Ahlberg et.al. 2312.03669v1 null
2023-12-06 Towards small and accurate convolutional neural networks for acoustic biodiversity monitoring Serge Zaugg et.al. 2312.03666v1 null
2023-12-06 Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving Ming Nie et.al. 2312.03661v1 link
2023-12-06 Editable Stain Transformation Of Histological Images Using Unpaired GANs Tibor Sloboda et.al. 2312.03647v1 link
2023-12-06 MotionCtrl: A Unified and Flexible Motion Controller for Video Generation Zhouxia Wang et.al. 2312.03641v1 null
2023-12-06 Training Neural Networks on RAW and HDR Images for Restoration Tasks Lei Luo et.al. 2312.03640v1 link
2023-12-07 Evaluation of Active Feature Acquisition Methods for Static Feature Settings Henrik von Kleist et.al. 2312.03619v2 null
2023-12-05 Dexterous Functional Grasping Ananye Agarwal et.al. 2312.02975v1 null
2023-12-05 Describing Differences in Image Sets with Natural Language Lisa Dunlap et.al. 2312.02974v1 link
2023-12-05 GauHuman: Articulated Gaussian Splatting from Monocular Human Videos Shoukang Hu et.al. 2312.02973v1 link
2023-12-05 Detecting algorithmic bias in medical AI-models Jeffrey Smith et.al. 2312.02959v1 null
2023-12-05 Classification for everyone : Building geography agnostic models for fairer recognition Akshat Jindal et.al. 2312.02957v1 null
2023-12-05 Choroidalyzer: An open-source, end-to-end pipeline for choroidal analysis in optical coherence tomography Justin Engelmann et.al. 2312.02956v1 null
2023-12-05 An alternating peak-optimization method for optimal trajectory generation of quadrotor drones Wytze A. B. de Vries et.al. 2312.02944v1 null
2023-12-05 Fast CT anatomic localization algorithm Amit Oved et.al. 2312.02941v1 null
2023-12-05 Drag-A-Video: Non-rigid Video Editing with Point-based Interaction Yao Teng et.al. 2312.02936v1 null
2023-12-06 WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation Jiachen Lu et.al. 2312.02934v2 link
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141v1 null
2023-12-04 Fast View Synthesis of Casual Videos Yao-Chih Lee et.al. 2312.02135v1 null
2023-12-04 GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians Liangxiao Hu et.al. 2312.02134v1 null
2023-12-04 Hot PATE: Private Aggregation of Distributions for Diverse Task Edith Cohen et.al. 2312.02132v1 null
2023-12-04 Can we truly transfer an actor's genuine happiness to avatars? An investigation into virtual, real, posed and spontaneous faces Vitor Miguel Xavier Peres et.al. 2312.02128v1 null
2023-12-04 Cosmic star-formation history and black hole accretion history inferred from the JWST mid-infrared source counts Seong Jin Kim et.al. 2312.02090v1 null
2023-12-05 VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence Yuchao Gu et.al. 2312.02087v2 null
2023-12-04 Integrating AI into CCTV Systems: A Comprehensive Evaluation of Smart Video Surveillance in Community Space Shanle Yao et.al. 2312.02078v1 null
2023-12-04 GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians Shenhan Qian et.al. 2312.02069v1 null
2023-12-04 TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding Shuhuai Ren et.al. 2312.02051v1 null
2023-12-01 Dense Optical Tracking: Connecting the Dots Guillaume Le Moing et.al. 2312.00786v1 null
2023-12-01 Sequential Modeling Enables Scalable Learning for Large Vision Models Yutong Bai et.al. 2312.00785v1 null
2023-12-01 MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video Hengyi Wang et.al. 2312.00778v1 null
2023-12-01 VideoBooth: Diffusion-based Video Generation with Image Prompts Yuming Jiang et.al. 2312.00777v1 null
2023-12-01 Towards Generalizable Zero-Shot Manipulation via Translating Human Interaction Plans Homanga Bharadhwaj et.al. 2312.00775v1 null
2023-12-01 Explaining Knock-on Effects of Bias Mitigation Svetoslav Nizhnichenkov et.al. 2312.00765v1 null
2023-12-04 Deep Unlearning: Fast and Efficient Training-free Approach to Controlled Forgetting Sangamesh Kodge et.al. 2312.00761v2 null
2023-12-01 Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals Tam Nguyen et.al. 2312.00751v1 null
2023-12-01 Tight-minimal dichotomies in Banach spaces Alejandra C. Cáceres-Rigo et.al. 2312.00721v1 null
2023-12-01 GIFT: Generative Interpretable Fine-Tuning Transformers Chinmay Savadikar et.al. 2312.00700v1 link
2023-11-30 Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living Dominick Reilly et.al. 2311.18840v1 null
2023-11-30 TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios Lihao Liu et.al. 2311.18839v1 null
2023-11-30 VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models Zhen Xing et.al. 2311.18837v1 null
2023-11-30 ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models Wenming Weng et.al. 2311.18834v1 null
2023-11-30 MotionEditor: Editing Video Motion via Content-Aware Diffusion Shuyuan Tu et.al. 2311.18830v1 link
2023-11-30 MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation Yanhui Wang et.al. 2311.18829v1 null
2023-11-30 Motion-Conditioned Image Animation for Video Editing Wilson Yan et.al. 2311.18827v1 null
2023-11-30 CAST: Cross-Attention in Space and Time for Video Action Recognition Dongho Lee et.al. 2311.18825v1 link
2023-11-30 Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking Kaifeng Lyu et.al. 2311.18817v1 link
2023-11-30 BIOCLIP: A Vision Foundation Model for the Tree of Life Samuel Stevens et.al. 2311.18803v1 null
2023-11-30 Do text-free diffusion models learn discriminative visual representations? Soumik Mukhopadhyay et.al. 2311.17921v2 null
2023-11-29 Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving Yuqi Wang et.al. 2311.17918v1 link
2023-11-29 HUGS: Human Gaussian Splats Muhammed Kocabas et.al. 2311.17910v1 null
2023-11-29 SODA: Bottleneck Diffusion Models for Representation Learning Drew A. Hudson et.al. 2311.17901v1 null
2023-11-30 Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis Jinqi Luo et.al. 2311.17898v2 null
2023-11-29 On the geometry of tensor products over finite fields Stefano Lia et.al. 2311.17896v1 null
2023-11-29 Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation Shuangrui Ding et.al. 2311.17893v1 null
2023-11-29 TSDF-Sampling: Efficient Sampling for Neural Surface Field using Truncated Signed Distance Field Chaerin Min et.al. 2311.17878v1 null
2023-11-29 Enhancing Post-Hoc Explanation Benchmark Reliability for Image Classification Tristan Gomez et.al. 2311.17876v1 null
2023-11-29 On the Adversarial Robustness of Graph Contrastive Learning Methods Filippo Guerranti et.al. 2311.17853v1 null
2023-11-28 Panoptic Video Scene Graph Generation Jingkang Yang et.al. 2311.17058v1 link
2023-11-28 Self-Supervised Motion Magnification by Backpropagating Through Optical Flow Zhaoying Pan et.al. 2311.17056v1 null
2023-11-28 MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training Pavan Kumar Anasosalu Vasu et.al. 2311.17049v1 null
2023-11-28 Jets of foliations and $b^k$-algebroids Francis Bischoff et.al. 2311.17045v1 null
2023-11-28 LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models Yanwei Li et.al. 2311.17043v1 link
2023-11-29 Efficient In-Context Learning in Vision-Language Models for Egocentric Videos Keunwoo Peter Yu et.al. 2311.17041v2 null
2023-11-28 Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer Danah Yatim et.al. 2311.17009v1 null
2023-11-28 MVBench: A Comprehensive Multi-modal Video Understanding Benchmark Kunchang Li et.al. 2311.17005v1 link
2023-11-28 Mirković-Vilonen Polytopes from Combinatorics Mario Sanchez et.al. 2311.16979v1 null
2023-11-28 Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis Aman Yadav et.al. 2311.16965v1 null
2023-11-28 Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models Munan Ning et.al. 2311.16103v2 link
2023-11-27 GART: Gaussian Articulated Template Models Jiahui Lei et.al. 2311.16099v1 null
2023-11-27 On Bringing Robots Home Nur Muhammad Mahi Shafiullah et.al. 2311.16098v1 link
2023-11-27 CG-HOI: Contact-Guided 3D Human-Object Interaction Generation Christian Diller et.al. 2311.16097v1 null
2023-11-27 Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling Zhe Li et.al. 2311.16096v1 link
2023-11-27 Three-dimensional $\mathbb{Z}$ topological insulators without reflection symmetry Alexander C. Tyner et.al. 2311.16092v1 null
2023-11-27 BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre Classification Dmitri Roussinov et.al. 2311.16083v1 link
2023-11-27 ViT-Lens-2: Gateway to Omni-modal Intelligence Weixian Lei et.al. 2311.16081v1 link
2023-11-27 Correlated Spectral and Recurrence Variations of Cygnus X-1 E. M. Broadbent et.al. 2311.16070v1 null
2023-11-27 DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization Zhaoyang Xia et.al. 2311.16060v1 link
2023-11-24 SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation Lingchen Meng et.al. 2311.14671v1 link
2023-11-24 JetLOV: Enhancing Jet Tree Tagging through Neural Network Learning of Optimal LundNet Variables Mauricio A. Diaz et.al. 2311.14654v1 link
2023-11-24 Learning in Deep Factor Graphs with Gaussian Belief Propagation Seth Nabarro et.al. 2311.14649v1 null
2023-11-24 Continuous football player tracking from discrete broadcast data Matthew J. Penn et.al. 2311.14642v1 null
2023-11-24 Emergent Topology in Many-Body Dissipative Quantum Chaos Antonio M. García-García et.al. 2311.14640v1 null
2023-11-24 Unsupervised high-throughput segmentation of cells and cell nuclei in quantitative phase images Julia Sistermanns et.al. 2311.14639v1 null
2023-11-24 ARIA: On the interaction between Architectures, Aggregation methods and Initializations in federated visual classification Vasilis Siomos et.al. 2311.14625v1 null
2023-11-24 Neural Style Transfer for Computer Games Eleftherios Ioannou et.al. 2311.14617v1 null
2023-11-24 Animate124: Animating One Image to 4D Dynamic Scene Yuyang Zhao et.al. 2311.14603v1 null
2023-11-24 A Metalearned Neural Circuit for Nonparametric Bayesian Inference Jake C. Snell et.al. 2311.14601v1 link
2023-11-22 WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space Katja Schwarz et.al. 2311.13570v1 null
2023-11-22 Belted sum decompositions of fully augmented links Porter Morgan et.al. 2311.13540v1 null
2023-11-22 Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression Tam Thuc Do et.al. 2311.13539v1 null
2023-11-22 Leveraging CNNs and Ensemble Learning for Automated Disaster Image Classification Archit Rathod et.al. 2311.13531v1 null
2023-11-22 Applying Dimensionality Reduction as Precursor to LSTM-CNN Models for Classifying Imagery and Motor Signals in ECoG-Based BCIs Soham Bafana et.al. 2311.13507v1 link
2023-11-22 Current Topological and Machine Learning Applications for Bias Detection in Text Colleen Farrelly et.al. 2311.13495v1 null
2023-11-22 Benchmarking Toxic Molecule Classification using Graph Neural Networks and Few Shot Learning Bhavya Mehta et.al. 2311.13490v1 null
2023-11-22 Deep-learning-based acceleration of MRI for radiotherapy planning of pediatric patients with brain tumors Shahinur Alam et.al. 2311.13485v1 link
2023-11-22 Solution discovery via reconfiguration for problems in P Mario Grobler et.al. 2311.13478v1 null
2023-11-22 Experimentation in Early-Stage Video Game Startups: Practices and Challenges Henry Edison et.al. 2311.13462v1 null
2023-11-21 Physics-guided Shape-from-Template: Monocular Video Perception through Neural Surrogate Models David Stotko et.al. 2311.12796v1 null
2023-11-21 Quantifying Impairment and Disease Severity Using AI Models Trained on Healthy Subjects Boyang Yu et.al. 2311.12781v1 link
2023-11-21 Swift Parameter-free Attention Network for Efficient Super-Resolution Cheng Wan et.al. 2311.12770v1 link
2023-11-22 Investigating Weight-Perturbed Deep Neural Networks With Application in Iris Presentation Attack Detection Renu Sharma et.al. 2311.12764v2 link
2023-11-21 High-resolution Image-based Malware Classification using Multiple Instance Learning Tim Peters et.al. 2311.12760v1 link
2023-11-21 SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction Yuanhui Huang et.al. 2311.12754v1 link
2023-11-21 Image Transformation for IoT Time-Series Data: A Review Duygu Altunkaya et.al. 2311.12742v1 null
2023-11-21 Exploring Graph Classification Techniques Under Low Data Constraints: A Comprehensive Study Kush Kothari et.al. 2311.12737v1 null
2023-11-21 Not Just Training, Also Testing: High School Youths' Perspective-Taking through Peer Testing Machine Learning-Powered Applications L. Morales-Navarro et.al. 2311.12733v1 null
2023-11-21 Cascade Learning Localises Discriminant Features in Visual Scene Classification Junwen Wang et.al. 2311.12704v1 null
2023-11-20 Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation Wenhao Li et.al. 2311.12028v1 null
2023-11-20 GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration Naoki Wake et.al. 2311.12015v1 null
2023-11-20 Evaluating Supervision Levels Trade-Offs for Infrared-Based People Counting David Latortue et.al. 2311.11974v1 null
2023-11-20 SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks Jin Ye et.al. 2311.11969v1 link
2023-11-20 Correlated Attention in Transformers for Multivariate Time Series Quang Minh Nguyen et.al. 2311.11959v1 null
2023-11-20 Tubular Curvature Filter: Implicit Pointwise Curvature Calculation Method for Tubular Objects Elifnur Sunger et.al. 2311.11931v1 null
2023-11-20 LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions Songhao Han et.al. 2311.11904v1 null
2023-11-20 Multimodal Characterization of Emotion within Multimedia Space Dayo Samuel Banjo et.al. 2311.11892v1 null
2023-11-20 SniffyArt: The Dataset of Smelling Persons Mathias Zinnen et.al. 2311.11888v1 null
2023-11-20 Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks Rami Haffar et.al. 2311.11882v1 link
2023-11-17 Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning Rohit Girdhar et.al. 2311.10709v1 null
2023-11-17 SpACNN-LDVAE: Spatial Attention Convolutional Latent Dirichlet Variational Autoencoder for Hyperspectral Pixel Unmixing Soham Chitnis et.al. 2311.10701v1 null
2023-11-17 A note on the convergence of the Bayesian entropy estimator for exchangeable partitions Servet Martinez et.al. 2311.10698v1 null
2023-11-17 Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections Lihan Zha et.al. 2311.10678v1 link
2023-11-17 3D-TexSeg: Unsupervised Segmentation of 3D Texture using Mutual Transformer Learning Iyyakutti Iyappan Ganapathi et.al. 2311.10651v1 null
2023-11-17 User Dynamics-Aware Edge Caching and Computing for Mobile Virtual Reality Mushu Li et.al. 2311.10645v1 null
2023-11-17 Image-Domain Material Decomposition for Dual-energy CT using Unsupervised Learning with Data-fidelity Loss Junbo Peng et.al. 2311.10641v1 null
2023-11-17 Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks Benjamin Feuer et.al. 2311.10609v1 null
2023-11-17 Designing Reconfigurable Intelligent Systems with Markov Blankets Boris Sedlak et.al. 2311.10597v1 null
2023-11-17 FOCAL: A Cost-Aware Video Dataset for Active Learning Kiran Kokilepersaud et.al. 2311.10591v1 link
2023-11-16 Traffic Video Object Detection using Motion Prior Lihao Liu et.al. 2311.10092v1 null
2023-11-16 Moduli space of rank three logarithmic connections on the projective line with three poles Takafumi Matsumoto et.al. 2311.10071v1 null
2023-11-16 Inherently Interpretable Time Series Classification via Multiple Instance Learning Joseph Early et.al. 2311.10049v1 link
2023-11-16 On the potential of Carbon-Enhanced Metal-Poor stars for Galactic Archaeology Aruna Goswami et.al. 2311.10043v1 null
2023-11-16 Match and Locate: low-frequency monocular odometry based on deep feature matching Stepan Konev et.al. 2311.10034v1 null
2023-11-16 Revolutionizing Customer Interactions: Insights and Challenges in Deploying ChatGPT and Generative Chatbots for FAQs Feriel Khennouche et.al. 2311.09976v1 null
2023-11-16 From Pretext to Purpose: Batch-Adaptive Self-Supervised Learning Jiansong Zhang et.al. 2311.09974v1 null
2023-11-16 VertDetect: Fully End-to-End 3D Vertebral Instance Segmentation Model Geoff Klein et.al. 2311.09958v1 null
2023-11-16 Harnessing Transformers: A Leap Forward in Lung Cancer Image Detection Amine Bechar et.al. 2311.09942v1 null
2023-11-17 A Framework for Monitoring and Retraining Language Models in Real-World Applications Jaykumar Kasundra et.al. 2311.09930v2 null
2023-11-15 Single-Image 3D Human Digitization with Shape-Guided Diffusion Badour AlBahar et.al. 2311.09221v1 null
2023-11-15 ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy Kirill Vishniakov et.al. 2311.09215v1 link
2023-11-15 Topology of Pulsar Profiles (ToPP). I. Graph theory method and classification of the EPN D. Vohl et.al. 2311.09201v1 null
2023-11-15 ExpM+NF: Differentially Private Machine Learning that Surpasses DPSGD Robert A. Bridges et.al. 2311.09200v1 null
2023-11-15 Domain Aligned CLIP for Few-shot Classification Muhammad Waleed Gondal et.al. 2311.09191v1 null
2023-11-15 ContraDoc: Understanding Self-Contradictions in Documents with Large Language Models Jierui Li et.al. 2311.09182v1 null
2023-11-15 RBPGAN: Recurrent Back-Projection GAN for Video Super Resolution Dareen Hussein et.al. 2311.09178v1 null
2023-11-15 Model Agnostic Explainable Selective Regression via Uncertainty Estimation Andrea Pugnana et.al. 2311.09145v1 null
2023-11-15 Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets Christian Mahoney et.al. 2311.09133v1 null
2023-11-15 Cross-view and Cross-pose Completion for 3D Human Understanding Matthieu Armando et.al. 2311.09104v1 null
2023-11-14 MVSA-Net: Multi-View State-Action Recognition for Robust and Deployable Trajectory Generation Ehsan Asali et.al. 2311.08393v1 null
2023-11-14 USLR: an open-source tool for unbiased and smooth longitudinal registration of brain MR Adrià Casamitjana et.al. 2311.08371v1 link
2023-11-14 Inverse Learning with Extremely Sparse Feedback for Recommendation Guanyu Lin et.al. 2311.08302v1 null
2023-11-14 Level Set KSVD Omer Sapir et.al. 2311.08284v1 null
2023-11-14 TENT: Connect Language Models with IoT Sensors for Zero-Shot Activity Recognition Yunjiao Zhou et.al. 2311.08245v1 null
2023-11-14 MCMC to address model misspecification in Deep Learning classification of Radio Galaxies Devina Mohan et.al. 2311.08243v1 null
2023-11-14 Learning Physics-Inspired Regularization for Medical Image Registration with Hypernetworks Anna Reithmeir et.al. 2311.08239v1 link
2023-11-14 Counterfactual Explanation for Regression via Disentanglement in Latent Space Xuan Zhao et.al. 2311.08228v1 null
2023-11-14 Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images Zhiyun Song et.al. 2311.08225v1 null
2023-11-14 Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling Correction Kunting Li et.al. 2311.08219v1 link
2023-11-13 GPT-4V(ision) as A Social Media Analysis Engine Hanjia Lyu et.al. 2311.07547v1 link
2023-11-13 mlscorecheck: Testing the consistency of reported performance scores and experiments in machine learning György Kovács et.al. 2311.07541v1 null
2023-11-13 FEMDA: a unified framework for discriminant analysis Pierre Houdouin et.al. 2311.07518v1 null
2023-11-13 Reducing the Need for Backpropagation and Discovering Better Optima With Explicit Optimizations of Neural Networks Jake Ryland Williams et.al. 2311.07498v1 null
2023-11-13 Towards Robotic Tree Manipulation: Leveraging Graph Representations Chung Hee Kim et.al. 2311.07479v1 null
2023-11-13 Temporal Performance Prediction for Deep Convolutional Long Short-Term Memory Networks Laura Fieback et.al. 2311.07477v1 null
2023-11-13 Masked Face Dataset Generation and Masked Face Recognition Rui Cai et.al. 2311.07475v1 link
2023-11-13 A Bayesian Approach to Strong Lens Finding in the Era of Wide-area Surveys Philip Holloway et.al. 2311.07455v1 null
2023-11-13 On the Robustness of Neural Collapse and the Neural Collapse of Robustness Jingtong Su et.al. 2311.07444v1 null
2023-11-13 Optimising Human-AI Collaboration by Learning Convincing Explanations Alex J. Chan et.al. 2311.07426v1 null
2023-11-10 Learning Human Action Recognition Representations Without Real Humans Howard Zhong et.al. 2311.06231v1 link
2023-11-10 Semantic-aware Video Representation for Few-shot Action Recognition Yutao Tang et.al. 2311.06218v1 null
2023-11-10 MultiIoT: Towards Large-scale Multisensory Learning for the Internet of Things Shentong Mo et.al. 2311.06217v1 null
2023-11-10 Deep learning segmentation of fibrous cap in intravascular optical coherence tomography images Juhwan Lee et.al. 2311.06202v1 null
2023-11-10 An Automated Pipeline for Tumour-Infiltrating Lymphocyte Scoring in Breast Cancer Adam J Shephard et.al. 2311.06185v1 link
2023-11-10 Automatic Report Generation for Histopathology images using pre-trained Vision Transformers Saurav Sengupta et.al. 2311.06176v1 null
2023-11-10 Two vertex geometrically irreducible algebras Grzegorz Bobinski et.al. 2311.06173v1 null
2023-11-10 Time Scale Network: A Shallow Neural Network For Time Series Data Trevor Meyer et.al. 2311.06170v1 null
2023-11-10 Deep Fast Vision: A Python Library for Accelerated Deep Transfer Learning Vision Prototyping Fabi Prezja et.al. 2311.06169v1 link
2023-11-10 Going beyond persistent homology using persistent homology Johanna Immonen et.al. 2311.06152v1 null
2023-11-09 FogROS2-Sky: Optimizing Latency and Cost for Multi-Cloud Robot Applications Kaiyuan Chen et.al. 2311.05600v1 null
2023-11-09 A Coefficient Makes SVRG Effective Yida Yin et.al. 2311.05589v1 link
2023-11-09 Outlier-Robust Wasserstein DRO Sloan Nietert et.al. 2311.05573v1 link
2023-11-09 Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach Cristina Palmero et.al. 2311.05567v1 null
2023-11-09 Disentangling Quantum and Classical Contributions in Hybrid Quantum Machine Learning Architectures Michael Kölle et.al. 2311.05559v1 null
2023-11-09 L-WaveBlock: A Novel Feature Extractor Leveraging Wavelets for Generative Adversarial Networks Mirat Shah et.al. 2311.05548v1 null
2023-11-09 BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis Hao-Bin Duan et.al. 2311.05521v1 null
2023-11-09 Dirichlet Active Learning Kevin Miller et.al. 2311.05501v1 null
2023-11-09 Retinal OCT Synthesis with Denoising Diffusion Probabilistic Models for Layer Segmentation Yuli Wu et.al. 2311.05479v1 null
2023-11-09 Robust Retraining-free GAN Fingerprinting via Personalized Normalization Jianwei Fei et.al. 2311.05478v1 null
2023-11-08 Towards Few-Annotation Learning in Computer Vision: Application to Image Classification and Object Detection tasks Quentin Bouniot et.al. 2311.04888v1 null
2023-11-08 Are foundation models efficient for medical image segmentation? Danielle Ferreira et.al. 2311.04847v1 null
2023-11-08 Bayesian multi-band fitting of alerts for kilonovae detection Biswajit Biswas et.al. 2311.04845v1 null
2023-11-08 Hierarchically Gated Recurrent Neural Network for Sequence Modeling Zhen Qin et.al. 2311.04823v1 link
2023-11-08 A Lightweight Architecture for Real-Time Neuronal-Spike Classification Muhammad Ali Siddiqi et.al. 2311.04808v1 null
2023-11-08 Determination of toxic comments and unintended model bias minimization using Deep learning approach Md Azim Khan et.al. 2311.04789v1 null
2023-11-08 VioLA: Aligning Videos to 2D LiDAR Scans Jun-Jee Chao et.al. 2311.04783v1 null
2023-11-08 FetMRQC: an open-source machine learning framework for multi-centric fetal brain MRI quality control Thomas Sanchez et.al. 2311.04780v1 link
2023-11-08 GCS-ICHNet: Assessment of Intracerebral Hemorrhage Prognosis using Self-Attention with Domain Knowledge Integration Xuhao Shan et.al. 2311.04772v1 link
2023-11-08 An attention-based deep learning network for predicting Platinum resistance in ovarian cancer Haoming Zhuang et.al. 2311.04769v1 null
2023-11-08 Video Instance Matting Jiachen Li et.al. 2311.04212v2 link
2023-11-07 JPAVE: A Generation and Classification-based Model for Joint Product Attribute Prediction and Value Extraction Zhongfen Deng et.al. 2311.04196v1 link
2023-11-07 Linear to circular conversion in the polarized radio emission of a magnetar Marcus E. Lower et.al. 2311.04195v1 null
2023-11-07 SpaDeLeF: A Dataset for Hierarchical Classification of Lexical Functions for Collocations in Spanish Yevhen Kostiuk et.al. 2311.04189v1 null
2023-11-07 A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis Dipanjyoti Paul et.al. 2311.04157v1 link
2023-11-07 Galaxy Spectra neural Network (GaSNet). II. Using Deep Learning for Spectral Classification and Redshift Predictions Fucheng Zhong et.al. 2311.04146v1 null
2023-11-07 I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models Shiwei Zhang et.al. 2311.04145v1 null
2023-11-07 Modelling Sentiment Analysis: LLMs and data augmentation techniques Guillem Senabre Prades et.al. 2311.04139v1 null
2023-11-07 Improved Topological Preservation in 3D Axon Segmentation and Centerline Detection using Geometric Assessment-driven Topological Smoothing (GATS) Nina I. Shamsi et.al. 2311.04116v1 null
2023-11-07 Joint modelling of recurrent and terminal events with discretely-distributed non-parametric frailty: application on re-hospitalizations and death in heart failure patients Chiara Masci et.al. 2311.04103v1 null
2023-11-06 A Classification of Graphs through Quadratic Embedding Constants and Clique Graph Insights Edy Tri Baskoro et.al. 2311.03342v1 null
2023-11-06 Tackling Concept Shift in Text Classification using Entailment-style Modeling Sumegh Roychowdhury et.al. 2311.03320v1 null
2023-11-06 A Foundation Model for Music Informatics Minz Won et.al. 2311.03318v1 link
2023-11-06 FATE: Feature-Agnostic Transformer-based Encoder for learning generalized embedding spaces in flow cytometry data Lisa Weijler et.al. 2311.03314v1 link
2023-11-06 A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation Qitao Zhao et.al. 2311.03312v1 null
2023-11-06 Advancing Post Hoc Case Based Explanation with Feature Highlighting Eoin Kenny et.al. 2311.03246v1 null
2023-11-06 Machine Learning-Based Tea Leaf Disease Detection: A Comprehensive Review Faruk Ahmed et.al. 2311.03240v1 null
2023-11-06 Out-of-distribution Detection Learning with Unreliable Out-of-distribution Sources Haotian Zheng et.al. 2311.03236v1 null
2023-11-06 Segmentation of Drone Collision Hazards in Airborne RADAR Point Clouds Using PointNet Hector Arroyo et.al. 2311.03221v1 null
2023-11-06 Leveraging Transformers to Improve Breast Cancer Classification and Risk Assessment with Multi-modal and Longitudinal Data Yiqiu Shen et.al. 2311.03217v1 null
2023-11-03 LOTUS: Continual Imitation Learning for Robot Manipulation Through Unsupervised Skill Discovery Weikang Wan et.al. 2311.02058v1 null
2023-11-03 MetaFast: Enabling Fast Metagenomic Classification via Seed Counting and Edit Distance Approximation Arvid E. Gollwitzer et.al. 2311.02029v1 null
2023-11-03 A Structured Pruning Algorithm for Model-based Deep Learning Chicago Park et.al. 2311.02003v1 null
2023-11-03 Detection of keratoconus Diseases using deep Learning AKM Enzam-Ul Haque et.al. 2311.01996v1 null
2023-11-03 Obtaining Explainable Classification Models using Distributionally Robust Optimization Sanjeeb Dash et.al. 2311.01994v1 null
2023-11-03 Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation Shichao Dong et.al. 2311.01989v1 null
2023-11-06 RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches Jiayuan Gu et.al. 2311.01977v2 null
2023-11-03 Welded graphs, Wirtinger groups and knotted punctured spheres Benjamin Audoux et.al. 2311.01922v1 null
2023-11-03 Contrast-Agnostic Groupwise Registration by Robust PCA for Quantitative Cardiac MRI Xinqi Li et.al. 2311.01916v1 null
2023-11-03 VQPy: An Object-Oriented Approach to Modern Video Analytics Shan Yu et.al. 2311.01623v1 null
2023-11-02 Tailoring Mixup to Data using Kernel Warping functions Quentin Bouniot et.al. 2311.01434v1 link
2023-11-02 Identifying Alzheimer Disease Dementia Levels Using Machine Learning Methods Md Gulzar Hussain et.al. 2311.01428v1 null
2023-11-02 Exploring Deep Learning Techniques for Glaucoma Detection: A Comprehensive Review Aized Amin Soofi et.al. 2311.01425v1 null
2023-11-02 Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data Cheng-Hao Tu et.al. 2311.01420v1 null
2023-11-02 Learning to See Physical Properties with Active Sensing Motor Policies Gabriel B. Margolis et.al. 2311.01405v1 null
2023-11-02 Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors Gabriele M. Caddeo et.al. 2311.01380v1 link
2023-11-02 Deep learning based Image Compression for Microscopy Images: An Empirical Study Yu Zhou et.al. 2311.01352v1 null
2023-11-02 Unreading Race: Purging Protected Features from Chest X-ray Embeddings Tobias Weber et.al. 2311.01349v1 null
2023-11-02 Scattering Vision Transformer: Spectral Mixing Matters Badri N. Patro et.al. 2311.01310v1 null
2023-11-02 Hybrid-Fusion Transformer for Multisequence MRI Jihoon Cho et.al. 2311.01308v1 null
2023-11-01 Software Repositories and Machine Learning Research in Cyber Security Mounika Vanamala et.al. 2311.00691v1 null
2023-11-01 What User Behaviors Make the Differences During the Process of Visual Analytics? Shahin Doroudian et.al. 2311.00690v1 null
2023-11-01 Deep Learning-Based Classification of Gamma Photon Interactions in Room-Temperature Semiconductor Radiation Detectors Sandeep K. Chaudhuri et.al. 2311.00682v1 null
2023-11-01 Latent Space Translation via Semantic Alignment Valentino Maiorca et.al. 2311.00664v1 link
2023-11-01 Rediscussion of eclipsing binaries. Paper XV. The B-type supergiant system V1765 Cygni John Southworth et.al. 2311.00655v1 null
2023-11-02 Emergence of Collective Open-Ended Exploration from Decentralized Meta-Reinforcement Learning Richard Bornemann et.al. 2311.00651v2 null
2023-11-01 Understanding the Issues and Causes in WebAssembly Application Development: A Mining-based Study Muhammad Waseem et.al. 2311.00646v1 null
2023-11-01 A Bi-level Framework for Traffic Accident Duration Prediction: Leveraging Weather and Road Condition Data within a Practical Optimum Pipeline Rafat Tabassum Sukonna et.al. 2311.00634v1 null
2023-11-01 Controllable Music Production with Diffusion Models and Guidance Gradients Mark Levy et.al. 2311.00613v1 null
2023-11-01 A Robust Deep Learning Method with Uncertainty Estimation for the Pathological Classification of Renal Cell Carcinoma based on CT Images Ni Yao et.al. 2311.00567v1 null
2023-10-31 Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders Srijan Das et.al. 2310.20704v1 null
2023-10-31 SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction Xinyuan Chen et.al. 2310.20700v1 null
2023-10-31 StairNet: Visual Recognition of Stairs for Human-Robot Locomotion Andrew Garrett Kurbis et.al. 2310.20666v1 null
2023-10-31 Performance Improvement in Multi-class Classification via Automated Hierarchy Generation and Exploitation through Extended LCPN Schemes Celal Alagoz et.al. 2310.20641v1 null
2023-10-31 Deepfake detection by exploiting surface anomalies: the SurFake approach Andrea Ciamarra et.al. 2310.20621v1 null
2023-10-31 Enhanced Synthetic MRI Generation from CT Scans Using CycleGAN with Feature Extraction Saba Nikbakhsh et.al. 2310.20604v1 null
2023-10-31 Finiteness properties for Shimura curves and modified diagonal cycles Congling Qiu et.al. 2310.20600v1 null
2023-10-31 Brain-like Flexible Visual Inference by Harnessing Feedback-Feedforward Alignment Tahereh Toosi et.al. 2310.20599v1 link
2023-10-31 Tracially Complete C-Algebras* José R. Carrión et.al. 2310.20594v1 null
2023-10-31 Strongly Magnetized Tidal Disruption Event Disks via Stream Injection in GRMHD Brandon Curd et.al. 2310.20592v1 null
2023-10-29 Improved Motor Imagery Classification Using Adaptive Spatial Filters Based on Particle Swarm Optimization Algorithm Xiong Xiong et.al. 2310.19202v1 null
2023-10-29 Enhancing Motor Imagery Decoding in Brain Computer Interfaces using Riemann Tangent Space Mapping and Cross Frequency Coupling Xiong Xiong et.al. 2310.19198v1 null
2023-10-29 A Survey on Watching Social Issue Videos among YouTube and TikTok Users Shuo Niu et.al. 2310.19193v1 null
2023-10-29 Subjective Quality Evaluation of Point Clouds Using a Head Mounted Display Joao Prazeres et.al. 2310.19179v1 null
2023-10-29 Robustifying Language Models with Test-Time Adaptation Noah Thomas McDermott et.al. 2310.19177v1 null
2023-10-29 Predicting recovery following stroke: deep learning, multimodal data and feature selection using explainable AI Adam White et.al. 2310.19174v1 null
2023-10-29 BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping Srikumar Sastry et.al. 2310.19168v1 link
2023-10-29 Unified Representation for Non-compositional and Compositional Expressions Ziheng Zeng et.al. 2310.19127v1 null
2023-10-29 Efficient IoT Inference via Context-Awareness Mohammad Mehdi Rastikerdar et.al. 2310.19112v1 null
2023-10-29 Pushdown Layers: Encoding Recursive Structure in Transformer Language Models Shikhar Murty et.al. 2310.19089v1 null
2023-10-27 Addressing GAN Training Instabilities via Tunable Classification Losses Monica Welfert et.al. 2310.18291v1 null
2023-10-27 PlantPlotGAN: A Physics-Informed Generative Adversarial Network for Plant Disease Prediction Felipe A. Lopes et.al. 2310.18268v1 null
2023-10-27 MalFake: A Multimodal Fake News Identification for Malayalam using Recurrent Neural Networks and VGG-16 Adhish S. Sujan et.al. 2310.18263v1 null
2023-10-27 Edge AI-Based Vein Detector for Efficient Venipuncture in the Antecubital Fossa Edwin Salcedo et.al. 2310.18234v1 null
2023-10-27 TBDLNet: a network for classifying multidrug-resistant and drug-sensitive tuberculosis Ziquan Zhu et.al. 2310.18222v1 null
2023-10-27 ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models Benjamin Feuer et.al. 2310.18208v1 link
2023-10-27 Artifact-Robust Graph-Based Learning in Digital Pathology Saba Heidari Gheshlaghi et.al. 2310.18192v1 null
2023-10-27 Globular clusters and bar: captured or not captured? Anton A. Smirnov et.al. 2310.18172v1 null
2023-10-27 Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN Neeraj Kumar et.al. 2310.18169v1 null
2023-10-27 DESiRED -- Dynamic, Enhanced, and Smart iRED: A P4-AQM with Deep Reinforcement Learning and In-band Network Telemetry Leandro C. de Almeida et.al. 2310.18159v1 null
2023-10-26 A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection Anas Al-lahham et.al. 2310.17650v1 null
2023-10-26 torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP Yoshitomo Matsubara et.al. 2310.17644v1 link
2023-10-26 Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models Tsun-Hsuan Wang et.al. 2310.17642v1 null
2023-10-26 Skew Products on the Berkovich Projective Line Richard A. P. Birkett et.al. 2310.17628v1 null
2023-10-26 A Survey on Transferability of Adversarial Examples across Deep Neural Networks Jindong Gu et.al. 2310.17626v1 link
2023-10-26 MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations Ajay Mandlekar et.al. 2310.17596v1 null
2023-10-26 Linear $x$-coordinate relations of triples on elliptic curves Jerson Caro et.al. 2310.17592v1 null
2023-10-26 A minimax optimal control approach for robust neural ODEs Cristina Cipriani et.al. 2310.17584v1 null
2023-10-26 BLIS-Net: Classifying and Analyzing Signals on Graphs Charles Xu et.al. 2310.17579v1 null
2023-10-26 Knots bounding non-isotopic ribbon disks Jeffrey Meier et.al. 2310.17564v1 null
2023-10-25 RDBench: ML Benchmark for Relational Databases Zizhao Zhang et.al. 2310.16837v1 link
2023-10-25 TD-MPC2: Scalable, Robust World Models for Continuous Control Nicklas Hansen et.al. 2310.16828v1 null
2023-10-26 Deep machine learning for meteor monitoring: advances with transfer learning and gradient-weighted class activation mapping Eloy Peña-Asensio et.al. 2310.16826v2 null
2023-10-25 Uncovering a new group of T Tauri stars in the Taurus-Auriga molecular complex from Gaia and GALEX data Ana Inés Gómez de Castro et.al. 2310.16820v1 null
2023-10-25 Using Diffusion Models to Generate Synthetic Labelled Data for Medical Image Segmentation Daniel Saragih et.al. 2310.16794v1 null
2023-10-25 Navigating Socio-Emotional Risk through Comfort-Building in a Physics Teaching Community of Practice: A Case Study Maggie Mahmood et.al. 2310.16778v1 null
2023-10-25 IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery Bhavuk Singhal et.al. 2310.16761v1 null
2023-10-25 Interferometric Neural Networks Arun Sehrawat et.al. 2310.16742v1 link
2023-10-25 A No-Reference Quality Assessment Method for Digital Human Head Yingjie Zhou et.al. 2310.16732v1 null
2023-10-25 Spherical Wavefront Near-Field DoA Estimation in THz Automotive Radar Ahmet M. Elbir et.al. 2310.16724v1 null
2023-10-24 From Posterior Sampling to Meaningful Diversity in Image Restoration Noa Cohen et.al. 2310.16047v1 null
2023-10-24 Finetuning Offline World Models in the Real World Yunhai Feng et.al. 2310.16029v1 null
2023-10-24 Human-in-the-Loop Task and Motion Planning for Imitation Learning Ajay Mandlekar et.al. 2310.16014v1 null
2023-10-24 CVPR 2023 Text Guided Video Editing Competition Jay Zhangjie Wu et.al. 2310.16003v1 null
2023-10-24 Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning Xin Xing et.al. 2310.15985v1 link
2023-10-24 Geometry-Aware Video Quality Assessment for Dynamic Digital Human Zicheng Zhang et.al. 2310.15984v1 null
2023-10-24 Minimax Forward and Backward Learning of Evolving Tasks with Performance Guarantees Verónica Álvarez et.al. 2310.15974v1 link
2023-10-24 Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection Manyuan Zhang et.al. 2310.15955v1 null
2023-10-25 Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles Xing Shen et.al. 2310.15952v2 null
2023-10-24 ShARc: Shape and Appearance Recognition for Person Identification In-the-wild Haidong Zhu et.al. 2310.15946v1 null
2023-10-23 FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling Haonan Qiu et.al. 2310.15169v1 null
2023-10-23 Bitrate Ladder Prediction Methods for Adaptive Video Streaming: A Review and Benchmark Ahmed Telili et.al. 2310.15163v1 null
2023-10-23 Linear Representations of Sentiment in Large Language Models Curt Tigges et.al. 2310.15154v1 null
2023-10-23 Unlocking the Transferability of Tokens in Deep Models for Tabular Data Qi-Le Zhou et.al. 2310.15149v1 null
2023-10-23 When Should the FDA Inspect Pharmaceutical Manufacturing Facilities to Better Mitigate Drug Shortages? Daniel Kosmas et.al. 2310.15146v1 null
2023-10-23 Novel-View Acoustic Synthesis from 3D Reconstructed Rooms Byeongjoo Ahn et.al. 2310.15130v1 link
2023-10-23 Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models Gabriel Sarch et.al. 2310.15127v1 null
2023-10-23 SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution Weihao Lin et.al. 2310.15115v1 null
2023-10-23 The Self 2.0: How AI-Enhanced Self-Clones Transform Self-Perception and Improve Presentation Skills Qingxiao Zheng et.al. 2310.15112v1 null
2023-10-23 Matryoshka Diffusion Models Jiatao Gu et.al. 2310.15111v1 null
2023-10-20 Using Human-like Mechanism to Weaken Effect of Pre-training Weight Bias in Face-Recognition Convolutional Neural Network Haojiang Ying et.al. 2310.13674v1 null
2023-10-23 Explainable Depression Symptom Detection in Social Media Eliseo Bao Souto et.al. 2310.13664v2 null
2023-10-20 Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification Amr Keleg et.al. 2310.13661v1 link
2023-10-20 Optimal Transport for Measures with Noisy Tree Metric Tam Le et.al. 2310.13653v1 null
2023-10-20 Principal $2$-blocks with wreathed defect groups up to splendid Morita equivalence Shigeo Koshitani et.al. 2310.13621v1 null
2023-10-20 Skin Lesion Segmentation Improved by Transformer-based Networks with Inter-scale Dependency Modeling Sania Eskandari et.al. 2310.13604v1 link
2023-10-20 Classification of quantum states of light using random measurements through a multimode fiber Saroch Leedumrongwatthanakun et.al. 2310.13599v1 null
2023-10-20 Longer-range Contextualized Masked Autoencoder Taekyung Kim et.al. 2310.13593v1 null
2023-10-20 POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization Elahe Vahdani et.al. 2310.13585v1 null
2023-10-20 Progressive Dual Priori Network for Generalized Breast Tumor Segmentation Li Wang et.al. 2310.13574v1 null
2023-10-19 Putting the Object Back into Video Object Segmentation Ho Kei Cheng et.al. 2310.12982v1 link
2023-10-19 Variational Inference for SDEs Driven by Fractional Noise Rembert Daems et.al. 2310.12975v1 null
2023-10-19 Frozen Transformers in Language Models Are Effective Visual Encoder Layers Ziqi Pang et.al. 2310.12973v1 link
2023-10-19 Bialgebra structures on flat Lie algebras Amine Bahayou et.al. 2310.12966v1 null
2023-10-19 End-to-End Delay Minimization based on Joint Optimization of DNN Partitioning and Resource Allocation for Cooperative Edge Inference Xinrui Ye et.al. 2310.12937v1 null
2023-10-19 Digital Twin-Enabled Intelligent DDoS Detection Mechanism for Autonomous Core Networks Yagmur Yigit et.al. 2310.12924v1 null
2023-10-19 Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning Juan Rocamonde et.al. 2310.12921v1 null
2023-10-19 Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey Oriane Siméoni et.al. 2310.12904v1 link
2023-10-19 A Markovian dynamics for $C. elegans$ behavior across scales Antonio C. Costa et.al. 2310.12883v1 link
2023-10-19 Perceptual Assessment and Optimization of High Dynamic Range Image Rendering Peibei Cao et.al. 2310.12877v1 null
2023-10-18 SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks Mohammadreza Salehi et.al. 2310.12126v1 null
2023-10-18 Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture Daniel Y. Fu et.al. 2310.12109v1 null
2023-10-18 HSTR-Net: Reference Based Video Super-resolution for Aerial Surveillance with Dual Cameras H. Umut Suluhan et.al. 2310.12092v1 null
2023-10-18 Chemical Analysis of the Brightest Star of the Cetus II Ultra-Faint Dwarf Galaxy Candidate K. B. Webber et.al. 2310.12090v1 null
2023-10-18 One-Shot Imitation Learning: A Pose Estimation Perspective Pietro Vitiello et.al. 2310.12077v1 null
2023-10-18 Exploring Fairness in Pre-trained Visual Transformer based Natural and GAN Generated Image Detection Systems and Understanding the Impact of Image Compression in Fairness Manjary P. Gangan et.al. 2310.12076v1 null
2023-10-18 Black-Box Training Data Identification in GANs via Detector Networks Lukman Olagoke et.al. 2310.12063v1 null
2023-10-19 Robust Class-Conditional Distribution Alignment for Partial Domain Adaptation Sandipan Choudhuri et.al. 2310.12060v2 null
2023-10-18 Exact and efficient solutions of the LMC Multitask Gaussian Process model Olivier Truffinet et.al. 2310.12032v1 link
2023-10-18 CORE: A Few-Shot Company Relation Classification Dataset for Robust Domain Adaptation Philipp Borchert et.al. 2310.12024v1 link
2023-10-17 DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis Youngjoong Kwon et.al. 2310.11449v1 null
2023-10-18 4K4D: Real-Time 4D View Synthesis at 4K Resolution Zhen Xu et.al. 2310.11448v2 null
2023-10-18 EvalCrafter: Benchmarking and Evaluating Large Video Generation Models Yaofang Liu et.al. 2310.11440v2 null
2023-10-17 Transitive generalized toggle groups containing a cycle Jonathan S. Bloom et.al. 2310.11387v1 null
2023-10-17 DialogueLLM: Context and Emotion Knowledge-Tuned LLaMA Models for Emotion Recognition in Conversations Yazhou Zhang et.al. 2310.11374v1 null
2023-10-17 VECHR: A Dataset for Explainable and Robust Classification of Vulnerability Type in the European Court of Human Rights Shanshan Xu et.al. 2310.11368v1 null
2023-10-17 Lie Group Decompositions for Equivariant Neural Networks Mircea Mironenco et.al. 2310.11366v1 null
2023-10-17 Hybrid quantum-classical graph neural networks for tumor classification in digital pathology Anupama Ray et.al. 2310.11353v1 null
2023-10-17 The effect of stemming and lemmatization on Portuguese fake news text classification Lucca de Freitas Santos et.al. 2310.11344v1 null
2023-10-17 Influencing factors on false positive rates when classifying tumor cell line response to drug treatment Priyanka Vasanthakumari et.al. 2310.11329v1 null
2023-10-16 A Survey on Video Diffusion Models Zhen Xing et.al. 2310.10647v1 link
2023-10-16 Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting Zeyu Yang et.al. 2310.10642v1 link
2023-10-16 Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models Kevin Black et.al. 2310.10639v1 null
2023-10-16 Efficacy of Dual-Encoders for Extreme Multi-Label Classification Nilesh Gupta et.al. 2310.10636v1 null
2023-10-16 Overcoming the Rayleigh limit in extremely low SNR Hyunsoo Choi et.al. 2310.10633v1 null
2023-10-16 Video Language Planning Yilun Du et.al. 2310.10625v1 null
2023-10-16 DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing Jia-Wei Liu et.al. 2310.10624v1 null
2023-10-16 BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation Ji Qi et.al. 2310.10586v1 null
2023-10-16 RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets Zhicheng Cai et.al. 2310.10563v1 link
2023-10-16 Deep learning applied to EEG data with different montages using spatial attention Dung Truong et.al. 2310.10550v1 null
2023-10-13 An Unbiased Look at Datasets for Visuo-Motor Pre-Training Sudeep Dasari et.al. 2310.09289v1 null
2023-10-13 Disentangled Latent Spaces Facilitate Data-Driven Auxiliary Learning Geri Skenderi et.al. 2310.09278v1 null
2023-10-13 A Hybrid Approach for Depression Classification: Random Forest-ANN Ensemble on Motor Activity Signals Anket Patil et.al. 2310.09277v1 null
2023-10-13 PromptRE: Weakly-Supervised Document-Level Relation Extraction via Prompting-Based Data Programming Chufan Gao et.al. 2310.09265v1 null
2023-10-13 Political claim identification and categorization in a multilingual setting: First experiments Urs Zaberer et.al. 2310.09256v1 null
2023-10-13 It's an Alignment, Not a Trade-off: Revisiting Bias and Variance in Deep Models Lin Chen et.al. 2310.09250v1 null
2023-10-13 A Multifaceted Look at Starlink Performance Nitinder Mohan et.al. 2310.09242v1 null
2023-10-13 Time CNN and Graph Convolution Network for Epileptic Spike Detection in MEG Data Pauline Mouches et.al. 2310.09236v1 null
2023-10-13 Ultrasound Image Segmentation of Thyroid Nodule via Latent Semantic Feature Co-Registration Xuewei Li et.al. 2310.09221v1 null
2023-10-13 PaLI-3 Vision Language Models: Smaller, Faster, Stronger Xi Chen et.al. 2310.09199v1 null
2023-10-12 Octopus: Embodied Vision-Language Programmer from Environmental Feedback Jingkang Yang et.al. 2310.08588v1 link
2023-10-12 Is Generalized Dynamic Novel View Synthesis from Monocular Videos Possible Today? Xiaoming Zhao et.al. 2310.08587v1 null
2023-10-12 Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes Haotong Lin et.al. 2310.08585v1 null
2023-10-12 Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video Shashanka Venkataramanan et.al. 2310.08584v1 null
2023-10-12 Universal Visual Decomposer: Long-Horizon Manipulation Made Easy Zichen Zhang et.al. 2310.08581v1 null
2023-10-12 Learning to Act from Actionless Videos through Dense Correspondences Po-Chen Ko et.al. 2310.08576v1 null
2023-10-12 Effective isometries of periodic shells Hussein Nassar et.al. 2310.08531v1 null
2023-10-12 LLM-augmented Preference Learning from Natural Language Inwon Kang et.al. 2310.08523v1 null
2023-10-12 Impact of time and note duration tokenizations on deep learning symbolic music modeling Nathan Fradet et.al. 2310.08497v1 link
2023-10-12 GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models Yuanchun Shen et.al. 2310.08487v1 link
2023-10-11 ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Yingqing He et.al. 2310.07702v1 link
2023-10-11 ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation Bo Peng et.al. 2310.07697v1 null
2023-10-11 Large-scale photonic computing with nonlinear disordered media Hao Wang et.al. 2310.07690v1 null
2023-10-11 Deep Video Inpainting Guided by Audio-Visual Self-Supervision Kyuyeon Kim et.al. 2310.07663v1 null
2023-10-11 Hypercomplex Multimodal Emotion Recognition from EEG and Peripheral Physiological Signals Eleonora Lopez et.al. 2310.07648v1 null
2023-10-11 Attention-Map Augmentation for Hypercomplex Breast Cancer Classification Eleonora Lopez et.al. 2310.07633v1 null
2023-10-11 Differentiable Euler Characteristic Transforms for Shape Classification Ernst Roell et.al. 2310.07630v1 link
2023-10-11 Time-Resolved Reconstruction of Motion, Force, and Stiffness using Spectro-Dynamic MRI Max H. C. van Riel et.al. 2310.07622v1 null
2023-10-11 Reinforcement Learning-based Knowledge Graph Reasoning for Explainable Fact-checking Gustav Nikopensius et.al. 2310.07613v1 null
2023-10-11 QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking Liangming Pan et.al. 2310.07609v1 link
2023-10-10 Convivial Solipsism as a maximally perspectival interpretation Herve Zwirn et.al. 2310.06815v1 null
2023-10-10 A Supervised Embedding and Clustering Anomaly Detection method for classification of Mobile Network Faults R. Mosayebi et.al. 2310.06779v1 null
2023-10-10 Optical assembly of nanostructures mediated by surface roughness Robert G. Felsted et.al. 2310.06774v1 null
2023-10-10 Uni3D: Exploring Unified 3D Representation at Scale Junsheng Zhou et.al. 2310.06773v1 link
2023-10-10 Improved convergence rates for some kernel random forest algorithms Isidoros Iakovidis et.al. 2310.06760v1 null
2023-10-10 Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation Networks Marc Rußwurm et.al. 2310.06743v1 link
2023-10-10 Multi-domain improves out-of-distribution and data-limited scenarios for medical image analysis Ece Ozkan et.al. 2310.06737v1 null
2023-10-10 S4Sleep: Elucidating the design space of deep-learning-based sleep stage classification models Tiezhi Wang et.al. 2310.06715v1 link
2023-10-10 Tertiary Lymphoid Structures Generation through Graph-based Diffusion Manuel Madeira et.al. 2310.06661v1 null
2023-10-10 Assessing the Impact of a Supervised Classification Filter on Flow-based Hybrid Network Anomaly Detection Dominik Macko et.al. 2310.06656v1 link
2023-10-09 FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing Yuren Cong et.al. 2310.05922v1 null
2023-10-09 Enumerating Calabi-Yau Manifolds: Placing bounds on the number of diffeomorphism classes in the Kreuzer-Skarke list Aditi Chandra et.al. 2310.05909v1 null
2023-10-09 ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models Kaiwen Zhou et.al. 2310.05872v1 null
2023-10-10 Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models Guangzhi Sun et.al. 2310.05863v2 link
2023-10-09 Latent Wander: an Alternative Interface for Interactive and Serendipitous Discovery of Large AV Archives Yuchen Yang et.al. 2310.05835v1 null
2023-10-09 Write What You Want: Applying Text-to-video Retrieval to Audiovisual Archives Yuchen Yang et.al. 2310.05825v1 null
2023-10-09 Dipole-Spread Function Engineering for 6D Super-Resolution Microscopy Tingting Wu et.al. 2310.05810v1 null
2023-10-09 A Simple Open-Loop Baseline for Reinforcement Learning Locomotion Tasks Antonin Raffin et.al. 2310.05808v1 null
2023-10-09 Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis Haoyu Zhang et.al. 2310.05804v1 null
2023-10-10 Two-timescale Derivative Free Optimization for Performative Prediction with Markovian Data Haitong Liu et.al. 2310.05792v2 null
2023-10-06 Exploiting Transformer Activation Sparsity with Dynamic Inference Mikołaj Piórczyński et.al. 2310.04361v1 null
2023-10-06 SwimXYZ: A large-scale dataset of synthetic swimming motions and videos Fiche Guénolé et.al. 2310.04360v1 null
2023-10-06 Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services Dasol Choi et.al. 2310.04313v1 null
2023-10-06 Convergent ADMM Plug and Play PET Image Reconstruction Florent Sureau et.al. 2310.04299v1 null
2023-10-06 A Plug-and-Play Image Registration Network Junhao Hu et.al. 2310.04297v1 null
2023-10-06 Towards Non-contact 3D Ultrasound for Wrist Imaging Antony Jerald et.al. 2310.04296v1 null
2023-10-06 Spectroscopic variability of massive pre-main-sequence stars in M17 A. R. Derkink et.al. 2310.04287v1 null
2023-10-06 Multi-Industry Simplex : A Probabilistic Extension of GICS Maksim Papenkov et.al. 2310.04280v1 null
2023-10-06 Bringing Quantum Algorithms to Automated Machine Learning: A Systematic Review of AutoML Frameworks Regarding Extensibility for QML Algorithms Dennis Klau et.al. 2310.04238v1 null
2023-10-06 Written and spoken corpus of real and fake social media postings about COVID-19 Ng Bee Chin et.al. 2310.04237v1 null
2023-10-05 The Un-Kidnappable Robot: Acoustic Localization of Sneaking People Mengyu Yang et.al. 2310.03743v1 null
2023-10-05 Agent Instructs Large Language Models to be General Zero-Shot Reasoners Nicholas Crispino et.al. 2310.03710v1 link
2023-10-05 OMG-ATTACK: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks Ofir Bar Tal et.al. 2310.03707v1 null
2023-10-05 Role of Spatial Coherence in Diffractive Optical Neural Networks Matthew J. Filipovich et.al. 2310.03679v1 null
2023-10-05 Certification of Deep Learning Models for Medical Image Segmentation Othmane Laousy et.al. 2310.03664v1 null
2023-10-05 Autoregressive Coefficients based Intelligent Protection of Transmission Lines Connected to Type-3 Wind Farms Pallav Kumar Bera et.al. 2310.03663v1 null
2023-10-05 Robustness-Guided Image Synthesis for Data-Free Quantization Jianhong Bai et.al. 2310.03661v1 null
2023-10-05 Balancing Autonomy and Alignment: A Multi-Dimensional Taxonomy for Autonomous LLM-powered Multi-Agent Architectures Thorsten Händler et.al. 2310.03659v1 null
2023-10-05 Strategic Evaluation: Subjects, Evaluators, and Society Benjamin Laufer et.al. 2310.03655v1 null
2023-10-05 CLEVRER-Humans: Describing Physical and Causal Events the Human Way Jiayuan Mao et.al. 2310.03635v1 null
2023-10-04 SemiReward: A General Reward Model for Semi-supervised Learning Siyuan Li et.al. 2310.03013v1 link
2023-10-04 High-dimensional SGD aligns with emerging outlier eigenspaces Gerard Ben Arous et.al. 2310.03010v1 null
2023-10-05 IBCL: Zero-shot Model Generation for Task Trade-offs in Continual Learning Pengyuan Lu et.al. 2310.02995v2 link
2023-10-04 Multiple Physics Pretraining for Physical Surrogate Models Michael McCabe et.al. 2310.02994v1 null
2023-10-04 UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network Siddhant Arora et.al. 2310.02973v1 null
2023-10-04 Fully Automatic Segmentation of Gross Target Volume and Organs-at-Risk for Radiotherapy Planning of Nasopharyngeal Carcinoma Mehdi Astaraki et.al. 2310.02972v1 null
2023-10-04 Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model Kai-Wei Chang et.al. 2310.02971v1 null
2023-10-05 Co-modeling the Sequential and Graphical Routes for Peptide Representation Learning Zihan Liu et.al. 2310.02964v2 link
2023-10-04 CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection Yang Cao et.al. 2310.02960v1 link
2023-10-04 HappyFeat -- An interactive and efficient BCI framework for clinical applications Arthur Desbois et.al. 2310.02948v1 null
2023-10-03 DREAM: Visual Decoding from Reversing Human Visual System Weihao Xia et.al. 2310.02265v1 null
2023-10-03 RSRD: A Road Surface Reconstruction Dataset and Benchmark for Safe and Comfortable Autonomous Driving Tong Zhao et.al. 2310.02262v1 null
2023-10-03 Harnessing Pre-Trained Sentence Transformers for Offensive Language Detection in Indian Languages Ananya Joshi et.al. 2310.02249v1 null
2023-10-04 Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks Greg Yang et.al. 2310.02244v2 null
2023-10-03 MIS-AVioDD: Modality Invariant and Specific Representation for Audio-Visual Deepfake Detection Vinaya Sree Katamneni et.al. 2310.02234v1 null
2023-10-03 HoloNets: Spectral Convolutions do extend to Directed Graphs Christian Koke et.al. 2310.02232v1 null
2023-10-03 Extraction of Medication and Temporal Relation from Clinical Text by Harnessing Different Deep Learning Models Hangyu Tu et.al. 2310.02229v1 null
2023-10-03 Symmetry-based classification of exact flat bands in single and bilayer moiré systems Siddhartha Sarkar et.al. 2310.02218v1 null
2023-10-03 Learnable Data Augmentation for One-Shot Unsupervised Domain Adaptation Julio Ivan Davila Carrazco et.al. 2310.02201v1 null
2023-10-03 CNN photometric redshifts in the SDSS at $r\leq 20$ M. Treyer et.al. 2310.02173v1 null
2023-09-29 A Large Language Model Approach to Educational Survey Feedback Analysis Michael J. Parker et.al. 2309.17447v1 null
2023-10-02 LLM-grounded Video Diffusion Models Long Lian et.al. 2309.17444v2 null
2023-09-29 Classification of Potholes Based on Surface Area Using Pre-Trained Models of Convolutional Neural Network Chauhdary Fazeel Ahmad et.al. 2309.17426v1 null
2023-09-29 CNN-based automatic segmentation of Lumen & Media boundaries in IVUS images using closed polygonal chains Pavel Sinha et.al. 2309.17406v1 null
2023-09-29 AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition Andrew Rouditchenko et.al. 2309.17395v1 null
2023-09-29 Tree Cross Attention Leo Feng et.al. 2309.17388v1 null
2023-09-29 Adversarial Imitation Learning from Visual Observations using Latent Information Vittorio Giammarino et.al. 2309.17371v1 link
2023-09-29 SpinView: General interactive visual analysis tool for multiscale computational magnetism Qichen Xu et.al. 2309.17367v1 null
2023-09-29 Asynchronous Graph Generators Christopher P. Ley et.al. 2309.17335v1 null
2023-09-29 Multi-Depth Branches Network for Efficient Image Super-Resolution Huiyuan Tian et.al. 2309.17334v1 link
2023-09-29 Demystifying CLIP Data Hu Xu et.al. 2309.16671v2 link
2023-09-28 Decaf: Monocular Deformation Capture for Face and Hand Interactions Soshi Shimada et.al. 2309.16670v1 null
2023-09-28 Training a Large Video Model on a Single Machine in a Day Yue Zhao et.al. 2309.16669v1 link
2023-09-28 Novel Deep Learning Pipeline for Automatic Weapon Detection Haribharathi Sivakumar et.al. 2309.16654v1 null
2023-09-28 ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning Qiao Gu et.al. 2309.16650v1 null
2023-09-29 Mixup Your Own Pairs Yilei Wu et.al. 2309.16633v2 link
2023-09-28 Class Activation Map-based Weakly supervised Hemorrhage Segmentation using Resnet-LSTM in Non-Contrast Computed Tomography images Shreyas H Ramananda et.al. 2309.16627v1 null
2023-09-28 The twisting index in semitoric systems Jaume Alonso et.al. 2309.16614v1 null
2023-09-28 Exploiting Edge Features in Graphs with Fused Network Gromov-Wasserstein Distance Junjie Yang et.al. 2309.16604v1 null
2023-09-28 Can LLMs Effectively Leverage Structural Information for Graph Learning: When and Why Jin Huang et.al. 2309.16595v1 null
2023-09-27 SHACIRA: Scalable HAsh-grid Compression for Implicit Neural Representations Sharath Girish et.al. 2309.15848v1 null
2023-09-27 Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing Brian Yan et.al. 2309.15826v1 null
2023-09-27 Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation David Junhao Zhang et.al. 2309.15818v1 link
2023-09-27 Convolutional Networks with Oriented 1D Kernels Alexandre Kirchmeyer et.al. 2309.15812v1 link
2023-09-27 A Quantum-Classical Hybrid Block-Matching Algorithm in Noisy Environment using Dissimilarity Measure M. Martínez-Felipe et.al. 2309.15792v1 null
2023-09-27 Large Language Model Routing with Benchmark Datasets Tal Shnitzer et.al. 2309.15789v1 null
2023-09-27 One For All: Video Conversation is Feasible Without Video Instruction Tuning Ruyang Liu et.al. 2309.15785v1 null
2023-09-27 Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback Teresa Yeo et.al. 2309.15762v1 null
2023-09-27 Automated CT Lung Cancer Screening Workflow using 3D Camera Brian Teixeira et.al. 2309.15750v1 null
2023-09-27 Data-Driven Latent Space Representation for Robust Bipedal Locomotion Learning Guillermo A. Castillo et.al. 2309.15740v1 null
2023-09-26 Classification of symmetry-enriched topological quantum spin liquids Weicheng Ye et.al. 2309.15118v1 null
2023-09-26 Doduo: Learning Dense Visual Correspondence from Unsupervised Semantic-Aware Flow Zhenyu Jiang et.al. 2309.15110v1 null
2023-09-27 LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Yaohui Wang et.al. 2309.15103v2 null
2023-09-26 VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning Han Lin et.al. 2309.15091v1 null
2023-09-26 Video-adverb retrieval with compositional adverb-action embeddings Thomas Hummel et.al. 2309.15086v1 null
2023-09-26 Challenges of building medical image datasets for development of deep learning software in stroke Alessandro Fontanella et.al. 2309.15081v1 null
2023-09-26 On Excess Risk Convergence Rates of Neural Network Classifiers Hyunouk Ko et.al. 2309.15075v1 null
2023-09-26 Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding Christina Kassab et.al. 2309.15065v1 null
2023-09-26 QUILT: Effective Multi-Class Classification on Quantum Computers Using an Ensemble of Diverse Quantum Classifiers Daniel Silver et.al. 2309.15056v1 null
2023-09-26 Thalamic nuclei segmentation from T$_1$-weighted MRI: unifying and benchmarking state-of-the-art methods with young and old cohorts Brendan Williams et.al. 2309.15053v1 null
2023-09-25 Extreme Parkour with Legged Robots Xuxin Cheng et.al. 2309.14341v1 null
2023-09-25 Chop & Learn: Recognizing and Generating Object-State Compositions Nirat Saini et.al. 2309.14339v1 null
2023-09-25 Human-Assisted Continual Robot Learning with Foundation Models Meenal Parakh et.al. 2309.14321v1 null
2023-09-25 MUTEX: Learning Unified Policies from Multimodal Task Specifications Rutav Shah et.al. 2309.14320v1 null
2023-09-25 DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning Qingjie Meng et.al. 2309.14306v1 null
2023-09-25 NAS-NeRF: Generative Neural Architecture Search for Neural Radiance Fields Saeejith Nair et.al. 2309.14293v1 null
2023-09-25 CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free Monika Wysoczańska et.al. 2309.14289v1 null
2023-09-25 Comparison of One- Two- and Three- Dimensional CNN models for Drawing-Test-Based Diagnostics of the Parkinson's Disease Xuechao Wang et.al. 2309.14288v1 null
2023-09-26 Virtual Hyperspectral Images Using Symmetric Autoencoders Archisman Bhattacharjee et.al. 2309.14286v2 null
2023-09-25 OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding Hao Peng et.al. 2309.14258v1 link
2023-09-22 Robotic Offline RL from Internet Videos via Value-Function Pre-Training Chethan Bhateja et.al. 2309.13041v1 null
2023-09-22 Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception?

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages