Skip to main content

Showing 1–50 of 73 results for author: Chow, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.01562  [pdf

    cs.CY

    Welfare, sustainability, and equity evaluation of the New York City Interborough Express using spatially heterogeneous mode choice models

    Authors: Hai Yang, Hongying Wu, Lauren Whang, Xiyuan Ren, Joseph Y. J. Chow

    Abstract: The Metropolitan Transit Authority (MTA) proposed building a new light rail route called the Interborough Express (IBX) to provide a direct, fast transit linkage between Queens and Brooklyn. An open-access synthetic citywide trip agenda dataset and a block-group-level mode choice model are used to assess the potential impact IBX could bring to New York City (NYC). IBX could save 28.1 minutes to po… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  2. arXiv:2406.00024  [pdf, other

    cs.CL cs.AI cs.ET cs.LG

    Embedding-Aligned Language Models

    Authors: Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Lior Shani, Ethan Liang, Craig Boutilier

    Abstract: We propose a novel approach for training large language models (LLMs) to adhere to objectives defined within a latent embedding space. Our method leverages reinforcement learning (RL), treating a pre-trained LLM as an environment. Our embedding-aligned guided language (EAGLE) agent is trained to iteratively steer the LLM's generation towards optimal regions of the latent embedding space, w.r.t. so… ▽ More

    Submitted 24 May, 2024; originally announced June 2024.

  3. arXiv:2404.05053  [pdf, ps, other

    math.CO cs.GT

    Cooking Poisons: Thinking Laterally with Game Theory

    Authors: Timothy Y. Chow

    Abstract: We revive an old lateral-thinking puzzle by Michael Rabin, involving poisons with strange properties. We show that the puzzle admits several unintended solutions that are just as interesting as the intended solution. Analyzing these alternative solutions using game theory yields surprisingly subtle results and several unanswered questions.

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 7 pages, to be published in Mathematics Magazine

    MSC Class: 91A05

  4. arXiv:2402.15957  [pdf, other

    cs.LG

    DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning

    Authors: Anthony Liang, Guy Tennenholtz, Chih-wei Hsu, Yinlam Chow, Erdem Bıyık, Craig Boutilier

    Abstract: We introduce DynaMITE-RL, a meta-reinforcement learning (meta-RL) approach to approximate inference in environments where the latent state evolves at varying rates. We model episode sessions - parts of the episode where the latent state is fixed - and propose three key modifications to existing meta-RL methods: consistency of latent information within sessions, session masking, and prior latent co… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  5. arXiv:2402.14925  [pdf, other

    cs.IT cs.LG math.ST

    Efficient Unbiased Sparsification

    Authors: Leighton Barnes, Stephen Cameron, Timothy Chow, Emma Cohen, Keith Frankston, Benjamin Howard, Fred Kochman, Daniel Scheinerman, Jeffrey VanderKam

    Abstract: An unbiased $m$-sparsification of a vector $p\in \mathbb{R}^n$ is a random vector $Q\in \mathbb{R}^n$ with mean $p$ that has at most $m<n$ nonzero coordinates. Unbiased sparsification compresses the original vector without introducing bias; it arises in various contexts, such as in federated learning and sampling sparse probability distributions. Ideally, unbiased sparsification should also minimi… ▽ More

    Submitted 24 July, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  6. PyTy: Repairing Static Type Errors in Python

    Authors: Yiu Wai Chow, Luca Di Grazia, Michael Pradel

    Abstract: Gradual typing enables developers to annotate types of their own choosing, offering a flexible middle ground between no type annotations and a fully statically typed language. As more and more code bases get type-annotated, static type checkers detect an increasingly large number of type errors. Unfortunately, fixing these errors requires manual effort, hampering the adoption of gradual typing in… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Journal ref: ICSE 2024

  7. arXiv:2311.02085  [pdf, other

    cs.IR cs.AI

    Preference Elicitation with Soft Attributes in Interactive Recommendation

    Authors: Erdem Biyik, Fan Yao, Yinlam Chow, Alex Haig, Chih-wei Hsu, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Preference elicitation plays a central role in interactive recommender systems. Most preference elicitation approaches use either item queries that ask users to select preferred items from a slate, or attribute queries that ask them to express their preferences for item characteristics. Unfortunately, users often wish to describe their preferences using soft attributes for which no ground-truth se… ▽ More

    Submitted 22 October, 2023; originally announced November 2023.

  8. arXiv:2310.17475  [pdf

    cs.CY

    Analytical model for large-scale design of sidewalk delivery robot systems

    Authors: Hai Yang, Yuchen Du, Tho V. Le, Joseph Y. J. Chow

    Abstract: With the rise in demand for local deliveries and e-commerce, robotic deliveries are being considered as efficient and sustainable solutions. However, the deployment of such systems can be highly complex due to numerous factors involving stochastic demand, stochastic charging and maintenance needs, complex routing, etc. We propose a model that uses continuous approximation methods for evaluating se… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  9. arXiv:2310.06176  [pdf, other

    cs.AI

    Factual and Personalized Recommendations using Language Models and Reinforcement Learning

    Authors: Jihwan Jeong, Yinlam Chow, Guy Tennenholtz, Chih-Wei Hsu, Azamat Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Recommender systems (RSs) play a central role in connecting users to content, products, and services, matching candidate items to users based on their preferences. While traditional RSs rely on implicit user feedback signals, conversational RSs interact with users in natural language. In this work, we develop a comPelling, Precise, Personalized, Preference-relevant language model (P4LM) that recom… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  10. arXiv:2310.04475  [pdf, other

    cs.CL cs.AI cs.LG

    Demystifying Embedding Spaces using Large Language Models

    Authors: Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Jihwan Jeong, Lior Shani, Azamat Tulepbergenov, Deepak Ramachandran, Martin Mladenov, Craig Boutilier

    Abstract: Embeddings have become a pivotal means to represent complex, multi-faceted information about entities, concepts, and relationships in a condensed and useful format. Nevertheless, they often preclude direct interpretation. While downstream tasks make use of these compressed representations, meaningful interpretation usually requires visualization using dimensionality reduction or specialized machin… ▽ More

    Submitted 13 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  11. arXiv:2305.09452  [pdf

    cs.AI cs.CY

    A sequential transit network design algorithm with optimal learning under correlated beliefs

    Authors: Gyugeun Yoon, Joseph Y. J. Chow

    Abstract: Mobility service route design requires demand information to operate in a service region. Transit planners and operators can access various data sources including household travel survey data and mobile device location logs. However, when implementing a mobility system with emerging technologies, estimating demand becomes harder because of limited data resulting in uncertainty. This study proposes… ▽ More

    Submitted 26 January, 2024; v1 submitted 16 May, 2023; originally announced May 2023.

  12. arXiv:2305.04324  [pdf

    cs.CY eess.SY

    A generalized network level disruption strategy selection model for urban public transport systems

    Authors: Qi Liu, Joseph Y. J. Chow

    Abstract: A fast recovery from disruptions is of vital importance for the reliability of transit systems. This study presents a new attempt to tackle the transit disruption mitigation problem in a comprehensive and hierarchical way. A network level strategy selection optimization model is formulated as a joint routing and resource allocation (nJRRA) problem. By constraining the problem further into an epsil… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  13. arXiv:2305.00818  [pdf

    cs.GT cs.CY

    On-demand Mobility-as-a-Service platform assignment games with guaranteed stable outcomes

    Authors: Bingqing Liu, Joseph Y. J. Chow

    Abstract: Mobility-as-a-Service (MaaS) systems are two-sided markets, with two mutually exclusive sets of agents, i.e., travelers/users and operators, forming a mobility ecosystem in which multiple operators compete or cooperate to serve customers under a governing platform provider. This study proposes a MaaS platform equilibrium model based on many-to-many assignment games incorporating both fixed-route t… ▽ More

    Submitted 21 June, 2024; v1 submitted 1 May, 2023; originally announced May 2023.

  14. arXiv:2303.05126  [pdf, other

    eess.IV cs.CV

    Hybrid Dual Mean-Teacher Network With Double-Uncertainty Guidance for Semi-Supervised Segmentation of MRI Scans

    Authors: Jiayi Zhu, Bart Bolsterlee, Brian V. Y. Chow, Yang Song, Erik Meijering

    Abstract: Semi-supervised learning has made significant progress in medical image segmentation. However, existing methods primarily utilize information acquired from a single dimensionality (2D/3D), resulting in sub-optimal performance on challenging data, such as magnetic resonance imaging (MRI) scans with multiple objects and highly anisotropic resolution. To address this issue, we present a Hybrid Dual M… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  15. arXiv:2302.10850  [pdf, other

    cs.LG cs.AI cs.CL

    Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

    Authors: Dhawal Gupta, Yinlam Chow, Aza Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Reinforcement learning (RL) has shown great promise for developing dialogue management (DM) agents that are non-myopic, conduct rich conversations, and maximize overall user satisfaction. Despite recent developments in RL and language models (LMs), using RL to power conversational chatbots remains challenging, in part because RL requires online exploration to learn effectively, whereas collecting… ▽ More

    Submitted 29 October, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  16. arXiv:2301.10545  [pdf, other

    cs.SE cs.CR cs.PL

    Beware of the Unexpected: Bimodal Taint Analysis

    Authors: Yiu Wai Chow, Max Schäfer, Michael Pradel

    Abstract: Static analysis is a powerful tool for detecting security vulnerabilities and other programming problems. Global taint tracking, in particular, can spot vulnerabilities arising from complicated data flow across multiple functions. However, precisely identifying which flows are problematic is challenging, and sometimes depends on factors beyond the reach of pure program analysis, such as convention… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Journal ref: International Symposium on Software Testing and Analysis (ISSTA), 2023

  17. arXiv:2212.14800  [pdf, other

    cs.LG cs.AI

    A deep real options policy for sequential service region design and timing

    Authors: Srushti Rath, Joseph Y. J. Chow

    Abstract: As various city agencies and mobility operators navigate toward innovative mobility solutions, there is a need for strategic flexibility in well-timed investment decisions in the design and timing of mobility service regions, i.e. cast as "real options" (RO). This problem becomes increasingly challenging with multiple interacting RO in such investments. We propose a scalable machine learning based… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

  18. arXiv:2212.00289  [pdf

    cs.CY

    Dial-a-ride problem with modular platooning and en-route transfers

    Authors: Zhexi Fu, Joseph Y. J. Chow

    Abstract: Modular vehicles (MV) possess the ability to physically connect/disconnect with each other and travel in platoon with less energy consumption. A fleet of demand-responsive transit vehicles with such technology can serve passengers door to door or have vehicles deviate to platoon with each other to travel at lower cost and allow for en-route passenger transfers before splitting. A mixed integer lin… ▽ More

    Submitted 23 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

  19. arXiv:2208.02294  [pdf, other

    cs.CL cs.LG

    Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning

    Authors: Deborah Cohen, Moonkyung Ryu, Yinlam Chow, Orgad Keller, Ido Greenberg, Avinatan Hassidim, Michael Fink, Yossi Matias, Idan Szpektor, Craig Boutilier, Gal Elidan

    Abstract: Despite recent advances in natural language understanding and generation, and decades of research on the development of conversational bots, building automated agents that can carry on rich open-ended conversations with humans "in the wild" remains a formidable challenge. In this work we develop a real-time, open-ended dialogue system that uses reinforcement learning (RL) to power a bot's conversa… ▽ More

    Submitted 25 July, 2022; originally announced August 2022.

  20. EMVLight: a Multi-agent Reinforcement Learning Framework for an Emergency Vehicle Decentralized Routing and Traffic Signal Control System

    Authors: Haoran Su, Yaofeng D. Zhong, Joseph Y. J. Chow, Biswadip Dey, Li Jin

    Abstract: Emergency vehicles (EMVs) play a crucial role in responding to time-critical calls such as medical emergencies and fire outbreaks in urban areas. Existing methods for EMV dispatch typically optimize routes based on historical traffic-flow data and design traffic signal pre-emption accordingly; however, we still lack a systematic methodology to address the coupling between EMV routing and traffic s… ▽ More

    Submitted 29 June, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: 19 figures, 10 tables. Manuscript extended on previous work arXiv:2109.05429, arXiv:2111.00278

    Journal ref: Transportation Research Part C: Emerging Technologies Volume 146, January 2023, 103955

  21. arXiv:2206.00059  [pdf, other

    cs.CL cs.AI

    A Mixture-of-Expert Approach to RL-based Dialogue Management

    Authors: Yinlam Chow, Aza Tulepbergenov, Ofir Nachum, MoonKyung Ryu, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Despite recent advancements in language models (LMs), their application to dialogue management (DM) problems and ability to carry on rich conversations remain a challenge. We use reinforcement learning (RL) to develop a dialogue agent that avoids being short-sighted (outputting generic utterances) and maximizes overall user satisfaction. Most existing RL approaches to DM train the agent at the wor… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  22. arXiv:2205.05138  [pdf, other

    cs.LG

    Efficient Risk-Averse Reinforcement Learning

    Authors: Ido Greenberg, Yinlam Chow, Mohammad Ghavamzadeh, Shie Mannor

    Abstract: In risk-averse reinforcement learning (RL), the goal is to optimize some risk measure of the returns. A risk measure often focuses on the worst returns out of the agent's experience. As a result, standard methods for risk-averse RL often ignore high-return strategies. We prove that under certain conditions this inevitably leads to a local-optimum barrier, and propose a soft risk mechanism to bypas… ▽ More

    Submitted 12 October, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS 2022

  23. arXiv:2204.05193  [pdf, other

    cs.CL cs.LG

    Worldwide city transport typology prediction with sentence-BERT based supervised learning via Wikipedia

    Authors: Srushti Rath, Joseph Y. J. Chow

    Abstract: An overwhelming majority of the world's human population lives in urban areas and cities. Understanding a city's transportation typology is immensely valuable for planners and policy makers whose decisions can potentially impact millions of city residents. Despite the value of understanding a city's typology, labeled data (city and it's typology) is scarce, and spans at most a few hundred cities i… ▽ More

    Submitted 28 March, 2022; originally announced April 2022.

  24. arXiv:2202.04849  [pdf, other

    cs.LG

    SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition

    Authors: Dylan Slack, Yinlam Chow, Bo Dai, Nevan Wichers

    Abstract: Methods that extract policy primitives from offline demonstrations using deep generative models have shown promise at accelerating reinforcement learning(RL) for new tasks. Intuitively, these methods should also help to trainsafeRLagents because they enforce useful skills. However, we identify these techniques are not well equipped for safe policy learning because they ignore negative experiences(… ▽ More

    Submitted 30 June, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

  25. arXiv:2202.02830  [pdf, other

    cs.IR cs.AI cs.LG

    Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors

    Authors: Christina Göpfert, Alex Haig, Yinlam Chow, Chih-wei Hsu, Ivan Vendrov, Tyler Lu, Deepak Ramachandran, Hubert Pham, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Interactive recommender systems have emerged as a promising paradigm to overcome the limitations of the primitive user feedback used by traditional recommender systems (e.g., clicks, item consumption, ratings). They allow users to express intent, preferences, constraints, and contexts in a richer fashion, often using natural language (including faceted search and dialogue). Yet more research is ne… ▽ More

    Submitted 2 June, 2023; v1 submitted 6 February, 2022; originally announced February 2022.

  26. A simulation sandbox to compare fixed-route, semi-flexible-transit, and on-demand microtransit system designs

    Authors: Gyugeun Yoon, Joseph Y. J. Chow, Srushti Rath

    Abstract: With advances in emerging technologies, options for operating public transit services have broadened from conventional fixed-route service through semi-flexible service to on-demand microtransit. Nevertheless, guidelines for deciding between these services remain limited in the real implementation. An open-source simulation sandbox is developed that can compare state-of-the-practice methods for ev… ▽ More

    Submitted 19 January, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

    Journal ref: KSCE Journal of Civil Engineering 26, 3043-3062 (2022)

  27. A congested schedule-based dynamic transit passenger flow estimator using stop count data

    Authors: Qi Liu, Joseph Y. J. Chow

    Abstract: A dynamic transit flow estimation model based on congested schedule-based transit equilibrium assignment is proposed using observations from stop count data. A solution algorithm is proposed for the mathematical program with schedule-based transit equilibrium constraints (MPEC) with polynomial computational complexity. The equilibrium constraints corresponding to the schedule-based hyperpath flow… ▽ More

    Submitted 16 August, 2021; v1 submitted 17 July, 2021; originally announced July 2021.

    Journal ref: Transportmetrica B: Transport Dynamics (2022)

  28. An electric vehicle charging station access equilibrium model with M/D/C queueing

    Authors: Bingqing Liu, Theodoros P. Pantelidis, Stephanie Tam, Joseph Y. J. Chow

    Abstract: Despite the dependency of electric vehicle (EV) fleets on charging station availability, charging infrastructure remains limited in many cities. Three contributions are made. First, we propose an EV-to-charging station user equilibrium (UE) assignment model with a M/D/C queue approximation as a nondifferentiable nonlinear program. Second, to address the non-differentiability of the queue delay fun… ▽ More

    Submitted 3 September, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

    Journal ref: International Journal of Sustainable Transportation (2022)

  29. arXiv:2012.00386  [pdf, other

    cs.LG cs.AI

    Non-Stationary Latent Bandits

    Authors: Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Users of recommender systems often behave in a non-stationary fashion, due to their evolving preferences and tastes over time. In this work, we propose a practical approach for fast personalization to non-stationary users. The key idea is to frame this problem as a latent bandit, where the prototypical models of user behavior are learned offline and the latent state of the user is inferred online… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

    Comments: 15 pages, 4 figures

  30. arXiv:2010.11652  [pdf, other

    cs.LG stat.ML

    CoinDICE: Off-Policy Confidence Interval Estimation

    Authors: Bo Dai, Ofir Nachum, Yinlam Chow, Lihong Li, Csaba Szepesvári, Dale Schuurmans

    Abstract: We study high-confidence behavior-agnostic off-policy evaluation in reinforcement learning, where the goal is to estimate a confidence interval on a target policy's value, given only access to a static experience dataset collected by unknown behavior policies. Starting from a function space embedding of the linear program formulation of the $Q$-function, we obtain an optimization problem with gene… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: To appear at NeurIPS 2020 as spotlight

  31. arXiv:2010.09648  [pdf

    cs.MA cs.CV eess.IV physics.soc-ph

    Agent-based Simulation Model and Deep Learning Techniques to Evaluate and Predict Transportation Trends around COVID-19

    Authors: Ding Wang, Fan Zuo, Jingqin Gao, Yueshuai He, Zilin Bian, Suzana Duran Bernardes, Chaekuk Na, Jingxing Wang, John Petinos, Kaan Ozbay, Joseph Y. J. Chow, Shri Iyer, Hani Nassif, Xuegang Jeff Ban

    Abstract: The COVID-19 pandemic has affected travel behaviors and transportation system operations, and cities are grappling with what policies can be effective for a phased reopening shaped by social distancing. This edition of the white paper updates travel trends and highlights an agent-based simulation model's results to predict the impact of proposed phased reopening strategies. It also introduces a re… ▽ More

    Submitted 23 September, 2020; originally announced October 2020.

  32. arXiv:2010.05150  [pdf, other

    cs.CL cs.AI cs.LG cs.RO

    Safe Reinforcement Learning with Natural Language Constraints

    Authors: Tsung-Yen Yang, Michael Hu, Yinlam Chow, Peter J. Ramadge, Karthik Narasimhan

    Abstract: While safe reinforcement learning (RL) holds great promise for many practical applications like robotics or autonomous cars, current approaches require specifying constraints in mathematical form. Such specifications demand domain expertise, limiting the adoption of safe RL. In this paper, we propose learning to interpret natural language constraints for safe RL. To this end, we first introduce Ha… ▽ More

    Submitted 3 August, 2021; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: The first two authors contributed equally

  33. arXiv:2009.14018  [pdf

    physics.soc-ph cs.SI

    Toward the "New Normal": A Surge in Speeding, New Volume Patterns, and Recent Trends in Taxis/For-Hire Vehicles

    Authors: Jingqin Gao, Abhinav Bhattacharyya, Ding Wang, Nick Hudanich, Siva Sooryaa, Muruga Thambiran, Suzana Duran Bernardes, Chaekuk Na, Fan Zuo, Zilin Bian, Kaan Ozbay, Shri Iyer, Hani Nassif, Joseph Y. J. Chow

    Abstract: Six months into the pandemic and one month after the phase four reopening in New York City (NYC), restrictions are lifting, businesses and schools are reopening, but global infections are still rising. This white paper updates travel trends observed in the aftermath of the COVID-19 outbreak in NYC and highlight some findings toward the "new normal."

    Submitted 23 September, 2020; originally announced September 2020.

  34. arXiv:2008.04762  [pdf

    physics.soc-ph cs.CY

    A validated multi-agent simulation test bed to evaluate congestion pricing policies on population segments by time-of-day in New York City

    Authors: Brian Yueshuai He, Jinkai Zhou, Ziyi Ma, Ding Wang, Di Sha, Mina Lee, Joseph Y. J. Chow, Kaan Ozbay

    Abstract: Evaluation of the demand for emerging transportation technologies and policies can vary by time of day due to spillbacks on roadways, rescheduling of travelers' activity patterns, and shifting to other modes that affect the level of congestion. These effects are not well-captured with static travel demand models. We calibrate and validate the first open-source multi-agent simulation model for New… ▽ More

    Submitted 21 December, 2020; v1 submitted 31 July, 2020; originally announced August 2020.

    Journal ref: Transport Policy 101 (2021) 145-161

  35. arXiv:2008.00335  [pdf, other

    cs.AI cs.LG eess.SY

    V2I Connectivity-Based Dynamic Queue-Jump Lane for Emergency Vehicles: A Deep Reinforcement Learning Approach

    Authors: Haoran Su, Kejian Shi, Li Jin, Joseph Y. J. Chow

    Abstract: Emergency vehicle (EMV) service is a key function of cities and is exceedingly challenging due to urban traffic congestion. A main reason behind EMV service delay is the lack of communication and cooperation between vehicles blocking EMVs. In this paper, we study the improvement of EMV service under V2I connectivity. We consider the establishment of dynamic queue jump lanes (DQJLs) based on real-t… ▽ More

    Submitted 29 May, 2021; v1 submitted 1 August, 2020; originally announced August 2020.

    Comments: 20 pages, 6 figures

  36. arXiv:2006.14518  [pdf

    cs.GT cs.CY

    Mobility operator service capacity sharing contract design to risk-pool against network disruptions

    Authors: Theodoros P. Pantelidis, Joseph Y. J. Chow, Oded Cats

    Abstract: We propose a new mechanism to design risk-pooling contracts between operators to facilitate horizontal cooperation to mitigate those costs and improve service resilience during disruptions. We formulate a novel two-stage stochastic multicommodity flow model to determine the cost savings of a coalition under different disruption scenarios and solve it using L-shaped method along with sample average… ▽ More

    Submitted 1 May, 2023; v1 submitted 25 June, 2020; originally announced June 2020.

  37. arXiv:2006.13408  [pdf, other

    cs.LG cs.AI stat.ML

    Control-Aware Representations for Model-based Reinforcement Learning

    Authors: Brandon Cui, Yinlam Chow, Mohammad Ghavamzadeh

    Abstract: A major challenge in modern reinforcement learning (RL) is efficient control of dynamical systems from high-dimensional sensory observations. Learning controllable embedding (LCE) is a promising approach that addresses this challenge by embedding the observations into a lower-dimensional latent space, estimating the latent dynamics, and utilizing it to perform control in the latent space. Two impo… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

  38. arXiv:2006.13368  [pdf

    econ.GN cs.MA physics.soc-ph

    Impact of COVID-19 behavioral inertia on reopening strategies for New York City Transit

    Authors: Ding Wang, Brian Yueshuai He, Jingqin Gao, Joseph Y. J. Chow, Kaan Ozbay, Shri Iyer

    Abstract: The COVID-19 pandemic has affected travel behaviors and transportation system operations, and cities are grappling with what policies can be effective for a phased reopening shaped by social distancing. A baseline model was previously developed and calibrated for pre-COVID conditions as MATSim-NYC. A new COVID model is calibrated that represents travel behavior during the COVID-19 pandemic by reca… ▽ More

    Submitted 11 February, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Journal ref: International Journal of Transportation Science & Technology 10(2) 197-211 (2021)

  39. arXiv:2006.08714  [pdf, other

    cs.LG cs.AI stat.ML

    Latent Bandits Revisited

    Authors: Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Craig Boutilier

    Abstract: A latent bandit problem is one in which the learning agent knows the arm reward distributions conditioned on an unknown discrete latent state. The primary goal of the agent is to identify the latent state, after which it can act optimally. This setting is a natural midpoint between online and offline learning---complex models can be learned offline with the agent identifying latent state online---… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 16 pages, 2 figures

  40. arXiv:2006.08236  [pdf, other

    cs.LG cs.AI stat.ML

    Non-Stationary Off-Policy Optimization

    Authors: Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed

    Abstract: Off-policy learning is a framework for evaluating and optimizing policies without deploying them, from data collected by another policy. Real-world environments are typically non-stationary and the offline learned policies should adapt to these changes. To address this challenge, we study the novel problem of off-policy optimization in piecewise-stationary contextual bandits. Our proposed solution… ▽ More

    Submitted 4 April, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: AISTATS 2021; 16 pages, 2 figures

  41. arXiv:2006.05443  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Model-based Policy Optimization

    Authors: Yinlam Chow, Brandon Cui, MoonKyung Ryu, Mohammad Ghavamzadeh

    Abstract: Model-based reinforcement learning (RL) algorithms allow us to combine model-generated data with those collected from interaction with the real system in order to alleviate the data efficiency problem in RL. However, designing such algorithms is often challenging because the bias in simulated data may overshadow the ease of data generation. A potential solution to this challenge is to jointly lear… ▽ More

    Submitted 23 June, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

  42. arXiv:2005.03465  [pdf

    physics.soc-ph cs.CE stat.AP

    A stochastic user-operator assignment game for microtransit service evaluation: A case study of Kussbus in Luxembourg

    Authors: Tai-Yu Ma, Joseph Y. J. Chow, Sylvain Klein, Ziyi Ma

    Abstract: This paper proposes a stochastic variant of the stable matching model from Rasulkhani and Chow [1] which allows microtransit operators to evaluate their operation policy and resource allocations. The proposed model takes into account the stochastic nature of users' travel utility perception, resulting in a probabilistic stable operation cost allocation outcome to design ticket price and ridership… ▽ More

    Submitted 8 April, 2020; originally announced May 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1912.01984

  43. arXiv:2003.01086  [pdf, other

    cs.LG eess.SY stat.ML

    Predictive Coding for Locally-Linear Control

    Authors: Rui Shu, Tung Nguyen, Yinlam Chow, Tuan Pham, Khoat Than, Mohammad Ghavamzadeh, Stefano Ermon, Hung H. Bui

    Abstract: High-dimensional observations and unknown dynamics are major challenges when applying optimal control to many real-world decision making tasks. The Learning Controllable Embedding (LCE) framework addresses these challenges by embedding the observations into a lower dimensional latent space, estimating the latent dynamics, and then performing control directly in the latent space. To ensure the lear… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  44. arXiv:2003.01025  [pdf, other

    cs.AI eess.SY

    Dynamic Queue-Jump Lane for Emergency Vehicles under Partially Connected Settings: A Multi-Agent Deep Reinforcement Learning Approach

    Authors: Haoran Su, Kejian Shi, Joseph. Y. J. Chow, Li Jin

    Abstract: Emergency vehicle (EMV) service is a key function of cities and is exceedingly challenging due to urban traffic congestion. The main reason behind EMV service delay is the lack of communication and cooperation between vehicles blocking EMVs. In this paper, we study the improvement of EMV service under V2X connectivity. We consider the establishment of dynamic queue jump lanes (DQJLs) based on real… ▽ More

    Submitted 15 January, 2021; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: 42 pages, 13 figures, 7 tables

  45. arXiv:2002.05522  [pdf, other

    cs.LG cs.AI stat.ML

    BRPO: Batch Residual Policy Optimization

    Authors: Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed Chi, Craig Boutilier

    Abstract: In batch reinforcement learning (RL), one often constrains a learned policy to be close to the behavior (data-generating) policy, e.g., by constraining the learned action distribution to differ from the behavior policy by some maximum degree that is the same at each state. This can cause batch RL to be overly conservative, unable to exploit large policy changes at frequently-visited, high-confiden… ▽ More

    Submitted 28 March, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  46. arXiv:2001.07282  [pdf

    cs.DS cs.CY math.OC

    A node-charge graph-based online carshare rebalancing policy with capacitated electric charging

    Authors: Theodoros P. Pantelidis, Li Li, Tai-Yu Ma, Joseph Y. J. Chow, Saif Eddin G. Jabari

    Abstract: Viability of electric car-sharing operations depends on rebalancing algorithms. Earlier methods in the literature suggest a trend toward non-myopic algorithms using queueing principles. We propose a new rebalancing policy using cost function approximation. The cost function is modeled as a p-median relocation problem with minimum cost flow conservation and path-based charging station capacities on… ▽ More

    Submitted 14 March, 2021; v1 submitted 20 January, 2020; originally announced January 2020.

    Journal ref: Transportation Science (2021)

  47. Online route choice modeling for Mobility-as-a-Service networks with non-separable, congestible link capacity effects

    Authors: Susan Jia Xu, Joseph Y. J. Chow

    Abstract: With the prevalence of MaaS systems, route choice models need to consider characteristics unique to them. MaaS systems tend to involve service systems with fleets of vehicles; as a result, the available service capacity depends on the choices of other travelers in different parts of the system. We model this with a new concept of "congestible capacity"; that is, link capacities are a function of f… ▽ More

    Submitted 9 July, 2021; v1 submitted 18 December, 2019; originally announced December 2019.

    Journal ref: IEEE Transactions on Intelligent Transportation Systems, 2021

  48. arXiv:1912.02074  [pdf, other

    cs.LG cs.AI

    AlgaeDICE: Policy Gradient from Arbitrary Experience

    Authors: Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, Dale Schuurmans

    Abstract: In many real-world applications of reinforcement learning (RL), interactions with the environment are limited due to cost or feasibility. This presents a challenge to traditional RL algorithms since the max-return objective involves an expectation over on-policy samples. We introduce a new formulation of max-return optimization that allows the problem to be re-expressed by an expectation over an a… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

  49. A many-to-many assignment game and stable outcome algorithm to evaluate collaborative Mobility-as-a-Service platforms

    Authors: Theodoros P. Pantelidis, Joseph Y. J. Chow, Saeid Rasulkhani

    Abstract: As Mobility as a Service (MaaS) systems become increasingly popular, travel is changing from unimodal trips to personalized services offered by a platform of mobility operators. Evaluation of MaaS platforms depends on modeling both user route decisions as well as operator service and pricing decisions. We adopt a new paradigm for traffic assignment in a MaaS network of multiple operators using the… ▽ More

    Submitted 28 June, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Journal ref: Transportation Research Part B 104 (2020) 79-100

  50. arXiv:1911.03779  [pdf

    physics.soc-ph cs.LG stat.ML

    Empirical validation of network learning with taxi GPS data from Wuhan, China

    Authors: Susan Jia Xu, Qian Xie, Joseph Y. J. Chow, Xintao Liu

    Abstract: In prior research, a statistically cheap method was developed to monitor transportation network performance by using only a few groups of agents without having to forecast the population flows. The current study validates this "multi-agent inverse optimization" method using taxi GPS probe data from the city of Wuhan, China. Using a controlled 2062-link network environment and different GPS data pr… ▽ More

    Submitted 17 August, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

    Journal ref: IEEE Intelligent Transportation Systems Magazine 13(1) (2021) 42-58