Skip to main content

Showing 1–50 of 337 results for author: Banerjee, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10429  [pdf, other

    cs.GT

    Price Competition Under A Consider-Then-Choose Model With Lexicographic Choice

    Authors: Siddhartha Banerjee, Chamsi Hssaine, Vijay Kamble

    Abstract: The sorting and filtering capabilities offered by modern e-commerce platforms significantly impact customers' purchase decisions, as well as the resulting prices set by competing sellers on these platforms. Motivated by this practical reality, we study price competition under a flexible choice model: Consider-then-Choose with Lexicographic Choice (CLC). In this model, a customer first forms a cons… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  2. arXiv:2408.10090  [pdf, other

    cs.LG cs.DC

    Federated Frank-Wolfe Algorithm

    Authors: Ali Dadras, Sourasekhar Banerjee, Karthik Prakhya, Alp Yurtsever

    Abstract: Federated learning (FL) has gained a lot of attention in recent years for building privacy-preserving collaborative learning systems. However, FL algorithms for constrained machine learning problems are still limited, particularly when the projection step is costly. To this end, we propose a Federated Frank-Wolfe Algorithm (FedFW). FedFW features data privacy, low per-iteration cost, and communica… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases

  3. arXiv:2408.07689  [pdf, other

    cs.CV

    Detecting Near-Duplicate Face Images

    Authors: Sudipta Banerjee, Arun Ross

    Abstract: Near-duplicate images are often generated when applying repeated photometric and geometric transformations that produce imperceptible variants of the original image. Consequently, a deluge of near-duplicates can be circulated online posing copyright infringement concerns. The concerns are more severe when biometric data is altered through such nuanced transformations. In this work, we address the… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: Under review

  4. arXiv:2408.05924  [pdf, other

    cs.RO cs.AI

    Adapting a Foundation Model for Space-based Tasks

    Authors: Matthew Foutter, Praneet Bhoj, Rohan Sinha, Amine Elhafsi, Somrita Banerjee, Christopher Agia, Justin Kruger, Tommaso Guffanti, Daniele Gammelli, Simone D'Amico, Marco Pavone

    Abstract: Foundation models, e.g., large language models, possess attributes of intelligence which offer promise to endow a robot with the contextual understanding necessary to navigate complex, unstructured tasks in the wild. In the future of space robotics, we see three core challenges which motivate the use of a foundation model adapted to space-based applications: 1) Scalability of ground-in-the-loop op… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  5. arXiv:2408.04870  [pdf, other

    cs.CR cs.AI

    ConfusedPilot: Confused Deputy Risks in RAG-based LLMs

    Authors: Ayush RoyChowdhury, Mulong Luo, Prateek Sahu, Sarbartha Banerjee, Mohit Tiwari

    Abstract: Retrieval augmented generation (RAG) is a process where a large language model (LLM) retrieves useful information from a database and then generates the responses. It is becoming popular in enterprise settings for daily business operations. For example, Copilot for Microsoft 365 has accumulated millions of businesses. However, the security implications of adopting such RAG-based systems are unclea… ▽ More

    Submitted 15 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

  6. arXiv:2407.21748  [pdf, other

    cs.RO cs.LG

    Diagnostic Runtime Monitoring with Martingales

    Authors: Ali Hindy, Rachel Luo, Somrita Banerjee, Jonathan Kuck, Edward Schmerling, Marco Pavone

    Abstract: Machine learning systems deployed in safety-critical robotics settings must be robust to distribution shifts. However, system designers must understand the cause of a distribution shift in order to implement the appropriate intervention or mitigation strategy and prevent system failure. In this paper, we present a novel framework for diagnosing distribution shifts in a streaming fashion by deployi… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  7. arXiv:2407.19119  [pdf, other

    cs.LG cs.AI cs.CR

    Accuracy-Privacy Trade-off in the Mitigation of Membership Inference Attack in Federated Learning

    Authors: Sayyed Farid Ahamed, Soumya Banerjee, Sandip Roy, Devin Quinn, Marc Vucovich, Kevin Choi, Abdul Rahman, Alison Hu, Edward Bowen, Sachin Shetty

    Abstract: Over the last few years, federated learning (FL) has emerged as a prominent method in machine learning, emphasizing privacy preservation by allowing multiple clients to collaboratively build a model while keeping their training data private. Despite this focus on privacy, FL models are susceptible to various attacks, including membership inference attacks (MIAs), posing a serious threat to data co… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  8. arXiv:2407.17777  [pdf, other

    eess.SP cs.AI

    Advancing Multi-Modal Sensing Through Expandable Modality Alignment

    Authors: Shenghong Dai, Shiqi Jiang, Yifan Yang, Ting Cao, Mo Li, Suman Banerjee, Lili Qiu

    Abstract: Sensing technology is widely used for comprehending the physical world, with numerous modalities explored in past decades. While there has been considerable work on multi-modality learning, they all require data of all modalities be paired. How to leverage multi-modality data with partially pairings remains an open problem. To tackle this challenge, we introduce the Babel framework, encompassing t… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  9. arXiv:2407.14251  [pdf, other

    cs.LG cs.AI math.OC

    Personalized Multi-tier Federated Learning

    Authors: Sourasekhar Banerjee, Ali Dadras, Alp Yurtsever, Monowar Bhuyan

    Abstract: The key challenge of personalized federated learning (PerFL) is to capture the statistical heterogeneity properties of data with inexpensive communications and gain customized performance for participating devices. To address these, we introduced personalized federated learning in multi-tier architecture (PerMFL) to obtain optimized and personalized local models when there are known team structure… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  10. arXiv:2407.00091  [pdf, other

    cs.IR cs.HC cs.LG

    Learning to Rank for Maps at Airbnb

    Authors: Malay Haldar, Hongwei Zhang, Kedar Bellare, Sherry Chen, Soumyadip Banerjee, Xiaotang Wang, Mustafa Abdool, Huiji Gao, Pavan Tapadia, Liwei He, Sanjeev Katariya

    Abstract: As a two-sided marketplace, Airbnb brings together hosts who own listings for rent with prospective guests from around the globe. Results from a guest's search for listings are displayed primarily through two interfaces: (1) as a list of rectangular cards that contain on them the listing image, price, rating, and other details, referred to as list-results (2) as oval pins on a map showing the list… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  11. arXiv:2406.12274  [pdf, other

    cs.CL

    SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

    Authors: Somnath Banerjee, Soham Tripathy, Sayan Layek, Shanu Kumar, Animesh Mukherjee, Rima Hazra

    Abstract: Safety-aligned language models often exhibit fragile and imbalanced safety mechanisms, increasing the likelihood of generating unsafe content. In addition, incorporating new knowledge through editing techniques to language models can further compromise safety. To address these issues, we propose SafeInfer, a context-adaptive, decoding-time safety alignment strategy for generating safe responses to… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Under review

  12. arXiv:2406.11801  [pdf, other

    cs.CL

    Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: Ensuring the safe alignment of large language models (LLMs) with human values is critical as they become integral to applications like translation and question answering. Current alignment methods struggle with dynamic user intentions and complex objectives, making models vulnerable to generating harmful content. We propose Safety Arithmetic, a training-free framework enhancing LLM safety across d… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under Review. Codes are available at: https://github.com/declare-lab/safety-arithmetic

  13. arXiv:2406.11139  [pdf, other

    cs.CL

    Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

    Authors: Somnath Banerjee, Avik Halder, Rajarshi Mandal, Sayan Layek, Ian Soboroff, Rima Hazra, Animesh Mukherjee

    Abstract: The integration of pretrained language models (PLMs) like BERT and GPT has revolutionized NLP, particularly for English, but it has also created linguistic imbalances. This paper strategically identifies the need for linguistic equity by examining several knowledge editing techniques in multilingual contexts. We evaluate the performance of models such as Mistral, TowerInstruct, OpenHathi, Tamil-Ll… ▽ More

    Submitted 17 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: Under review

  14. arXiv:2406.10886  [pdf, other

    cs.CL cs.LG

    Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM

    Authors: Sri Raghava Muddu, Rupasai Rangaraju, Tejpalsingh Siledar, Swaroop Nath, Pushpak Bhattacharyya, Swaprava Nath, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Sudhanshu Shekhar Singh, Nikesh Garera

    Abstract: Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limi… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  15. arXiv:2406.08307  [pdf, other

    stat.ML cs.LG

    Measuring model variability using robust non-parametric testing

    Authors: Sinjini Banerjee, Tim Marrinan, Reilly Cannon, Tony Chiang, Anand D. Sarwate

    Abstract: Training a deep neural network often involves stochastic optimization, meaning each run will produce a different model. The seed used to initialize random elements of the optimization procedure heavily influences the quality of a trained model, which may be obscure from many commonly reported summary statistics, like accuracy. However, random seed is often not included in hyper-parameter optimizat… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  16. arXiv:2406.02402  [pdf, other

    math.OC cs.GT stat.ML

    Online Fair Allocation of Perishable Resources

    Authors: Siddhartha Banerjee, Chamsi Hssaine, Sean R. Sinclair

    Abstract: We consider a practically motivated variant of the canonical online fair allocation problem: a decision-maker has a budget of perishable resources to allocate over a fixed number of rounds. Each round sees a random number of arrivals, and the decision-maker must commit to an allocation for these individuals before moving on to the next round. The goal is to construct a sequence of allocations that… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 51 pages, 8 figures

    MSC Class: 91B32

  17. arXiv:2406.00375  [pdf, other

    cs.RO

    Teledrive: An Embodied AI based Telepresence System

    Authors: Snehasis Banerjee, Sayan Paul, Ruddradev Roychoudhury, Abhijan Bhattacharya, Chayan Sarkar, Ashis Sau, Pradip Pramanick, Brojeshwar Bhowmick

    Abstract: This article presents Teledrive, a telepresence robotic system with embodied AI features that empowers an operator to navigate the telerobot in any unknown remote place with minimal human intervention. We conceive Teledrive in the context of democratizing remote care-giving for elderly citizens as well as for isolated patients, affected by contagious diseases. In particular, this paper focuses on… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted in Journal of Intelligent Robotic System

    Journal ref: Journal of Intelligent Robotic System 2024

  18. arXiv:2404.12913  [pdf, other

    cs.DB

    Influential Billboard Slot Selection under Zonal Influence Constraint

    Authors: Dildar Ali, Suman Banerjee, Yamuna Prasad

    Abstract: Given billboard and trajectory database, finding a limited number of billboard slots for maximizing the influence is an important problem in the context of billboard advertisement. Most of the existing literature focused on the influential slot selection problem without considering any specific zonal influence constraint. To bridge this gap in this paper, we introduce and study the Influential Bil… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 14 Pages

  19. arXiv:2404.05243  [pdf, other

    cs.CL cs.AI

    Product Description and QA Assisted Self-Supervised Opinion Summarization

    Authors: Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

    Abstract: In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) s… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  20. arXiv:2404.03587  [pdf, other

    cs.RO cs.AI

    Anticipate & Collab: Data-driven Task Anticipation and Knowledge-driven Planning for Human-robot Collaboration

    Authors: Shivam Singh, Karthik Swaminathan, Raghav Arora, Ramandeep Singh, Ahana Datta, Dipanjan Das, Snehasis Banerjee, Mohan Sridharan, Madhava Krishna

    Abstract: An agent assisting humans in daily living activities can collaborate more effectively by anticipating upcoming tasks. Data-driven methods represent the state of the art in task anticipation, planning, and related problems, but these methods are resource-hungry and opaque. Our prior work introduced a proof of concept framework that used an LLM to anticipate 3 high-level tasks that served as goals f… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  21. arXiv:2404.01632  [pdf, other

    cs.LG eess.SY

    Enhancing Functional Safety in Automotive AMS Circuits through Unsupervised Machine Learning

    Authors: Ayush Arunachalam, Ian Kintz, Suvadeep Banerjee, Arnab Raha, Xiankun Jin, Fei Su, Viswanathan Pillai Prasanth, Rubin A. Parekhji, Suriyaprakash Natarajan, Kanad Basu

    Abstract: Given the widespread use of safety-critical applications in the automotive field, it is crucial to ensure the Functional Safety (FuSa) of circuits and components within automotive systems. The Analog and Mixed-Signal (AMS) circuits prevalent in these systems are more vulnerable to faults induced by parametric perturbations, noise, environmental stress, and other factors, in comparison to their dig… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 12 figures

  22. arXiv:2403.19717  [pdf, other

    cs.LG cs.CR cs.CY

    A Picture is Worth 500 Labels: A Case Study of Demographic Disparities in Local Machine Learning Models for Instagram and TikTok

    Authors: Jack West, Lea Thiemt, Shimaa Ahmed, Maggie Bartig, Kassem Fawaz, Suman Banerjee

    Abstract: Mobile apps have embraced user privacy by moving their data processing to the user's smartphone. Advanced machine learning (ML) models, such as vision models, can now locally analyze user images to extract insights that drive several functionalities. Capitalizing on this new processing model of locally analyzing user images, we analyze two popular social media apps, TikTok and Instagram, to reveal… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 18 pages, 13 figures, to appear at IEEE Symposium on Security and Privacy 2024

    ACM Class: K.4.2; C.4; D.2.2

  23. arXiv:2403.12047  [pdf, other

    cs.CV

    Alpha-wolves and Alpha-mammals: Exploring Dictionary Attacks on Iris Recognition Systems

    Authors: Sudipta Banerjee, Anubhav Jain, Zehua Jiang, Nasir Memon, Julian Togelius, Arun Ross

    Abstract: A dictionary attack in a biometric system entails the use of a small number of strategically generated images or templates to successfully match with a large number of identities, thereby compromising security. We focus on dictionary attacks at the template level, specifically the IrisCodes used in iris recognition systems. We present an hitherto unknown vulnerability wherein we mix IrisCodes usin… ▽ More

    Submitted 20 November, 2023; originally announced March 2024.

    Comments: 8 pages, 5 figures, 13 tables, Workshop on Manipulation, Adversarial, and Presentation Attacks in Biometrics, Winter Conference on Applications of Computer Vision

  24. arXiv:2403.08092  [pdf, other

    cs.CV

    Mitigating the Impact of Attribute Editing on Face Recognition

    Authors: Sudipta Banerjee, Sai Pranaswi Mullangi, Shruti Wagle, Chinmay Hegde, Nasir Memon

    Abstract: Through a large-scale study over diverse face images, we show that facial attribute editing using modern generative AI models can severely degrade automated face recognition systems. This degradation persists even with identity-preserving generative models. To mitigate this issue, we propose two novel techniques for local and global attribute editing. We empirically ablate twenty-six facial semant… ▽ More

    Submitted 9 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Under review

  25. arXiv:2403.04660  [pdf, other

    cs.HC

    Exploring the Design Space of Optical See-through AR Head-Mounted Displays to Support First Responders in the Field

    Authors: Kexin Zhang, Brianna Cochran, Ruijia Chen, Lance Hartung, Bryce Sprecher, Ross Tredinnick, Kevin Ponto, Suman Banerjee, Yuhang Zhao

    Abstract: First responders (FRs) navigate hazardous, unfamiliar environments in the field (e.g., mass-casualty incidents), making life-changing decisions in a split second. AR head-mounted displays (HMDs) have shown promise in supporting them due to its capability of recognizing and augmenting the challenging environments in a hands-free manner. However, the design space have not been thoroughly explored by… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Journal ref: CHI 2024

  26. arXiv:2403.02688  [pdf, other

    cs.ET cs.AI cs.LG

    DOCTOR: Dynamic On-Chip Temporal Variation Remediation Toward Self-Corrected Photonic Tensor Accelerators

    Authors: Haotian Lu, Sanmitra Banerjee, Jiaqi Gu

    Abstract: Photonic computing has emerged as a promising solution for accelerating computation-intensive artificial intelligence (AI) workloads, offering unparalleled speed and energy efficiency, especially in resource-limited, latency-sensitive edge computing environments. However, the deployment of analog photonic tensor accelerators encounters reliability challenges due to hardware noise and environmental… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 9 pages. Accepted to IEEE JLT 2024

  27. arXiv:2402.17720  [pdf, other

    cs.LG cs.DS cs.IT

    The SMART approach to instance-optimal online learning

    Authors: Siddhartha Banerjee, Alankrita Bhatt, Christina Lee Yu

    Abstract: We devise an online learning algorithm -- titled Switching via Monotone Adapted Regret Traces (SMART) -- that adapts to the data and achieves regret that is instance optimal, i.e., simultaneously competitive on every input sequence compared to the performance of the follow-the-leader (FTL) policy and the worst case guarantee of any other input policy. We show that the regret of the SMART policy on… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  28. arXiv:2402.16342  [pdf, other

    cs.AI cs.RO

    Contingency Planning Using Bi-level Markov Decision Processes for Space Missions

    Authors: Somrita Banerjee, Edward Balaban, Mark Shirley, Kevin Bradner, Marco Pavone

    Abstract: This work focuses on autonomous contingency planning for scientific missions by enabling rapid policy computation from any off-nominal point in the state space in the event of a delay or deviation from the nominal mission plan. Successful contingency planning involves managing risks and rewards, often probabilistically associated with actions, in stochastic scenarios. Markov Decision Processes (MD… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  29. arXiv:2402.16159  [pdf, other

    cs.CL

    DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Aaditya Agrawal, Rima Hazra, Animesh Mukherjee

    Abstract: With the AI revolution in place, the trend for building automated systems to support professionals in different domains such as the open source software systems, healthcare systems, banking systems, transportation systems and many others have become increasingly prominent. A crucial requirement in the automation of support tools for such systems is the early identification of named entities, which… ▽ More

    Submitted 20 June, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted at ECML-PKDD 2024 (Long Paper)

  30. arXiv:2402.15473  [pdf, other

    cs.CL cs.LG

    Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization

    Authors: Swaroop Nath, Tejpalsingh Siledar, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Harshad Khadilkar, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of ten… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 19 pages, 6 figures, 21 tables

  31. arXiv:2402.15302  [pdf, other

    cs.CL cs.CR

    How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries

    Authors: Somnath Banerjee, Sayan Layek, Rima Hazra, Animesh Mukherjee

    Abstract: In this study, we tackle a growing concern around the safety and ethical use of large language models (LLMs). Despite their potential, these models can be tricked into producing harmful or unethical content through various sophisticated methods, including 'jailbreaking' techniques and targeted manipulation. Our work zeroes in on a specific issue: to what extent LLMs can be led astray by asking the… ▽ More

    Submitted 15 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Under review. {https://huggingface.co/datasets/SoftMINER-Group/TechHazardQA}

  32. arXiv:2402.14702  [pdf, other

    cs.CL

    InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks

    Authors: Somnath Banerjee, Maulindu Sarkar, Punyajoy Saha, Binny Mathew, Animesh Mukherjee

    Abstract: Recently, influence functions present an apparatus for achieving explainability for deep neural models by quantifying the perturbation of individual train instances that might impact a test prediction. Our objectives in this paper are twofold. First we incorporate influence functions as a feedback into the model to improve its performance. Second, in a dataset extension exercise, using influence f… ▽ More

    Submitted 9 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted at LREC-COLING 2024 (Long Paper)

  33. arXiv:2402.11683  [pdf, other

    cs.CL

    One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

    Authors: Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluat… ▽ More

    Submitted 9 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  34. Publicly auditable privacy-preserving electoral rolls

    Authors: Prashant Agrawal, Mahabir Prasad Jhanwar, Subodh Vishnu Sharma, Subhashis Banerjee

    Abstract: While existing literature on electronic voting has extensively addressed verifiability of voting protocols, the vulnerability of electoral rolls in large public elections remains a critical concern. To ensure integrity of electoral rolls, the current practice is to either make electoral rolls public or share them with the political parties. However, this enables construction of detailed voter prof… ▽ More

    Submitted 2 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Report number: CSF 2024

    Journal ref: 2024 IEEE 37th Computer Security Foundations Symposium (CSF)

  35. arXiv:2402.03507  [pdf, other

    cs.AI cs.CL cs.LG

    Neural networks for abstraction and reasoning: Towards broad generalization in machines

    Authors: Mikel Bober-Irizar, Soumya Banerjee

    Abstract: For half a century, artificial intelligence research has attempted to reproduce the human qualities of abstraction and reasoning - creating computer systems that can learn new concepts from a minimal set of examples, in settings where humans find this easy. While specific neural networks are able to solve an impressive range of problems, broad generalisation to situations outside their training da… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 32 pages main text, 17 pages

  36. arXiv:2402.01294  [pdf, other

    cs.DB cs.IR cs.MA

    Minimizing Regret in Billboard Advertisement under Zonal Influence Constraint

    Authors: Dildar Ali, Suman Banerjee, Yamuna Prasad

    Abstract: In a typical billboard advertisement technique, a number of digital billboards are owned by an influence provider, and many advertisers approach the influence provider for a specific number of views of their advertisement content on a payment basis. If the influence provider provides the demanded or more influence, then he will receive the full payment or else a partial payment. In the context of… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 32 Pages

  37. arXiv:2402.00845  [pdf, ps, other

    cs.IT cs.GT cs.NI eess.SP eess.SY

    When to Preempt in a Status Update System?

    Authors: Subhankar Banerjee, Sennur Ulukus

    Abstract: We consider a time-slotted status update system with an error-free preemptive queue. The goal of the sampler-scheduler pair is to minimize the age of information at the monitor by sampling and transmitting the freshly sampled update packets to the monitor. The sampler-scheduler pair also has a choice to preempt an old update packet from the server and transmit a new update packet to the server. We… ▽ More

    Submitted 20 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  38. arXiv:2401.16649  [pdf, other

    cs.LG cs.CR

    Using Motion Forecasting for Behavior-Based Virtual Reality (VR) Authentication

    Authors: Mingjun Li, Natasha Kholgade Banerjee, Sean Banerjee

    Abstract: Task-based behavioral biometric authentication of users interacting in virtual reality (VR) environments enables seamless continuous authentication by using only the motion trajectories of the person's body as a unique signature. Deep learning-based approaches for behavioral biometrics show high accuracy when using complete or near complete portions of the user trajectory, but show lower performan… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: AIxVR 2024 Best Paper Award

  39. arXiv:2401.16464  [pdf, other

    cs.IR cs.DB cs.LG cs.MA

    Towards Regret Free Slot Allocation in Billboard Advertisement

    Authors: Dildar Ali, Suman Banerjee, Yamuna Prasad

    Abstract: Creating and maximizing influence among the customers is one of the central goals of an advertiser, and hence, remains an active area of research in recent times. In this advertisement technique, the advertisers approach an influence provider for a specific number of views of their content on a payment basis. Now, if the influence provider can provide the required number of views or more, he will… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 37 Pages

  40. arXiv:2401.16443  [pdf, other

    cs.HC cs.AI cs.LG

    Evaluating Deep Networks for Detecting User Familiarity with VR from Hand Interactions

    Authors: Mingjun Li, Numan Zafar, Natasha Kholgade Banerjee, Sean Banerjee

    Abstract: As VR devices become more prevalent in the consumer space, VR applications are likely to be increasingly used by users unfamiliar with VR. Detecting the familiarity level of a user with VR as an interaction medium provides the potential of providing on-demand training for acclimatization and prevents the user from being burdened by the VR environment in accomplishing their tasks. In this work, we… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: AIxVR 2024 poster paper

  41. arXiv:2401.12671  [pdf, other

    cs.CL

    Context Matters: Pushing the Boundaries of Open-Ended Answer Generation with Graph-Structured Knowledge Context

    Authors: Somnath Banerjee, Amruit Sahoo, Sayan Layek, Avik Dutta, Rima Hazra, Animesh Mukherjee

    Abstract: In the continuously advancing AI landscape, crafting context-rich and meaningful responses via Large Language Models (LLMs) is essential. Researchers are becoming more aware of the challenges that LLMs with fewer parameters encounter when trying to provide suitable answers to open-ended questions. To address these hurdles, the integration of cutting-edge strategies, augmentation of rich external d… ▽ More

    Submitted 5 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  42. Automatic Recognition of Learning Resource Category in a Digital Library

    Authors: Soumya Banerjee, Debarshi Kumar Sanyal, Samiran Chattopadhyay, Plaban Kumar Bhowmick, Partha Pratim Das

    Abstract: Digital libraries often face the challenge of processing a large volume of diverse document types. The manual collection and tagging of metadata can be a time-consuming and error-prone task. To address this, we aim to develop an automatic metadata extractor for digital libraries. In this work, we introduce the Heterogeneous Learning Resources (HLR) dataset designed for document image classificatio… ▽ More

    Submitted 28 November, 2023; originally announced January 2024.

    Comments: 2 pages, 3 figures, Published in JCDL 21

  43. arXiv:2401.10647  [pdf, other

    cs.CL

    Sowing the Wind, Reaping the Whirlwind: The Impact of Editing Language Models

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: In the rapidly advancing field of artificial intelligence, the concept of Red-Teaming or Jailbreaking large language models (LLMs) has emerged as a crucial area of study. This approach is especially significant in terms of assessing and enhancing the safety and robustness of these models. This paper investigates the intricate consequences of such modifications through model editing, uncovering a c… ▽ More

    Submitted 16 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted at ACL 2024

  44. arXiv:2401.10601  [pdf, other

    cs.DS cs.DB

    Influential Slot and Tag Selection in Billboard Advertisement

    Authors: Dildar Ali, Tejash Gupta, Suman Banerjee, Yamuna Prasad

    Abstract: The selection of influential billboard slots remains an important problem in billboard advertisements. Existing studies on this problem have not considered the case of context-specific influence probability. To bridge this gap, in this paper, we introduce the Context Dependent Influential Billboard Slot Selection Problem. First, we show that the problem is NP-hard. We also show that the influence… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 15 pages

  45. arXiv:2312.16071  [pdf, other

    cs.NE cs.AI cs.GR cs.LG

    Event-based Shape from Polarization with Spiking Neural Networks

    Authors: Peng Kang, Srutarshi Banerjee, Henry Chopp, Aggelos Katsaggelos, Oliver Cossairt

    Abstract: Recent advances in event-based shape determination from polarization offer a transformative approach that tackles the trade-off between speed and accuracy in capturing surface geometries. In this paper, we investigate event-based shape from polarization using Spiking Neural Networks (SNNs), introducing the Single-Timestep and Multi-Timestep Spiking UNets for effective and efficient surface normal… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 25 pages

  46. Performance Analysis of Fixed Broadband Wireless Access in mmWave Band in 5G

    Authors: Soumya Banerjee, Sarada Prasad Gochhayat, Sachin Shetty

    Abstract: An end-to-end fiber-based network holds the potential to provide multi-gigabit fixed access to end-users. However, deploying fiber access, especially in areas where fiber is non-existent, can be time-consuming and costly, resulting in delayed returns for Operators. This work investigates transmission data from fixed broadband wireless access in the mmWave band in 5G. Given the growing interest in… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: 6 pages, 16 figures, Published in ICNC 22

  47. arXiv:2312.05626  [pdf, other

    cs.SE cs.AI

    Redefining Developer Assistance: Through Large Language Models in Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Sayan Layek, Amruit Sahoo, Sam Conrad Joyce, Rima Hazra

    Abstract: In this paper, we delve into the advancement of domain-specific Large Language Models (LLMs) with a focus on their application in software development. We introduce DevAssistLlama, a model developed through instruction tuning, to assist developers in processing software-related natural language queries. This model, a variant of instruction tuned LLM, is particularly adept at handling intricate tec… ▽ More

    Submitted 15 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: Under review

  48. arXiv:2312.00507  [pdf, other

    cs.PL cs.CR cs.LG

    VEXIR2Vec: An Architecture-Neutral Embedding Framework for Binary Similarity

    Authors: S. VenkataKeerthy, Soumya Banerjee, Sayan Dey, Yashas Andaluri, Raghul PS, Subrahmanyam Kalyanasundaram, Fernando Magno Quintão Pereira, Ramakrishna Upadrasta

    Abstract: Binary similarity involves determining whether two binary programs exhibit similar functionality, often originating from the same source code. In this work, we propose VexIR2Vec, an approach for binary similarity using VEX-IR, an architecture-neutral Intermediate Representation (IR). We extract the embeddings from sequences of basic blocks, termed peepholes, derived by random walks on the control-… ▽ More

    Submitted 9 July, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

  49. arXiv:2312.00051  [pdf, other

    cs.CR cs.AI cs.LG

    MIA-BAD: An Approach for Enhancing Membership Inference Attack and its Mitigation with Federated Learning

    Authors: Soumya Banerjee, Sandip Roy, Sayyed Farid Ahamed, Devin Quinn, Marc Vucovich, Dhruv Nandakumar, Kevin Choi, Abdul Rahman, Edward Bowen, Sachin Shetty

    Abstract: The membership inference attack (MIA) is a popular paradigm for compromising the privacy of a machine learning (ML) model. MIA exploits the natural inclination of ML models to overfit upon the training data. MIAs are trained to distinguish between training and testing prediction confidence to infer membership information. Federated Learning (FL) is a privacy-preserving ML paradigm that enables mul… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: 6 pages, 5 figures, Accepted to be published in ICNC 23

  50. arXiv:2311.17097  [pdf, other

    cs.LG cs.AI cs.CR cs.NI

    Anonymous Jamming Detection in 5G with Bayesian Network Model Based Inference Analysis

    Authors: Ying Wang, Shashank Jere, Soumya Banerjee, Lingjia Liu, Sachin Shetty, Shehadi Dayekh

    Abstract: Jamming and intrusion detection are critical in 5G research, aiming to maintain reliability, prevent user experience degradation, and avoid infrastructure failure. This paper introduces an anonymous jamming detection model for 5G based on signal parameters from the protocol stacks. The system uses supervised and unsupervised learning for real-time, high-accuracy detection of jamming, including unk… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 6 pages, 9 figures, Published in HPSR22. arXiv admin note: text overlap with arXiv:2304.13660