Skip to main content

Showing 1–50 of 265 results for author: Alvarez, J

  1. arXiv:2407.07276  [pdf, other

    cs.CV cs.AI

    Exploring Camera Encoder Designs for Autonomous Driving Perception

    Authors: Barath Lakshmanan, Joshua Chen, Shiyi Lan, Maying Shen, Zhiding Yu, Jose M. Alvarez

    Abstract: The cornerstone of autonomous vehicles (AV) is a solid perception system, where camera encoders play a crucial role. Existing works usually leverage pre-trained Convolutional Neural Networks (CNN) or Vision Transformers (ViTs) designed for general vision tasks, such as image classification, segmentation, and 2D detection. Although those well-known architectures have achieved state-of-the-art accur… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2406.12079  [pdf, other

    cs.CV cs.AI cs.LG

    Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint

    Authors: Xinglong Sun, Barath Lakshmanan, Maying Shen, Shiyi Lan, Jingde Chen, Jose Alvarez

    Abstract: As we push the boundaries of performance in various vision tasks, the models grow in size correspondingly. To keep up with this growth, we need very aggressive pruning techniques for efficient inference and deployment on edge devices. Existing pruning approaches are limited to channel pruning and struggle with aggressive parameter reductions. In this paper, we propose a novel multi-dimensional pru… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under Review

  3. arXiv:2406.11556  [pdf

    astro-ph.EP astro-ph.IM

    PLATO's signal and noise budget

    Authors: Anko Börner, Carsten Paproth, Juan Cabrera, Martin Pertenais, Heike Rauer, J. Miguel Mas-Hesse, Isabella Pagano, Jose Lorenzo Alvarez, Anders Erikson, Denis Grießbach, Yves Levillain, Demetrio Magrin, Valery Mogulsky, Sami-Matias Niemi, Thibaut Prod'homme, Sara Regibo, Joris De Ridder, Steve Rockstein, Reza Samadi, Dimitri Serrano-Velarde, Alan Smith, Peter Verhoeve, Dave Walton

    Abstract: ESA's PLATO mission aims the detection and characterization of terrestrial planets around solar-type stars as well as the study of host star properties. The noise-to-signal ratio (NSR) is the main performance parameter of the PLATO instrument, which consists of 24 Normal Cameras and 2 Fast Cameras. In order to justify, verify and breakdown NSR-relevant requirements the software simulator PINE was… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 17 pages, 8 figures, 3 tables

  4. arXiv:2406.11218  [pdf

    cs.CL cs.AI

    Building another Spanish dictionary, this time with GPT-4

    Authors: Miguel Ortega-Martín, Óscar García-Sierra, Alfonso Ardoiz, Juan Carlos Armenteros, Ignacio Garrido, Jorge Álvarez, Camilo Torrón, Iñigo Galdeano, Ignacio Arranz, Oleg Vorontsov, Adrián Alonso

    Abstract: We present the "Spanish Built Factual Freectianary 2.0" (Spanish-BFF-2) as the second iteration of an AI-generated Spanish dictionary. Previously, we developed the inaugural version of this unique free dictionary employing GPT-3. In this study, we aim to improve the dictionary by using GPT-4-turbo instead. Furthermore, we explore improvements made to the initial version and compare the performance… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.06978  [pdf, other

    cs.CV

    Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

    Authors: Zhenxin Li, Kailin Li, Shihao Wang, Shiyi Lan, Zhiding Yu, Yishen Ji, Zhiqi Li, Ziyue Zhu, Jan Kautz, Zuxuan Wu, Yu-Gang Jiang, Jose M. Alvarez

    Abstract: We propose Hydra-MDP, a novel paradigm employing multiple teachers in a teacher-student model. This approach uses knowledge distillation from both human and rule-based teachers to train the student model, which features a multi-head decoder to learn diverse trajectory candidates tailored to various evaluation metrics. With the knowledge of rule-based teachers, Hydra-MDP learns how the environment… ▽ More

    Submitted 19 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: The 1st place solution of End-to-end Driving at Scale at the CVPR 2024 Autonomous Grand Challenge

  6. arXiv:2406.04484  [pdf, ps, other

    cs.CV

    Step Out and Seek Around: On Warm-Start Training with Incremental Data

    Authors: Maying Shen, Hongxu Yin, Pavlo Molchanov, Lei Mao, Jose M. Alvarez

    Abstract: Data often arrives in sequence over time in real-world deep learning applications such as autonomous driving. When new training data is available, training the model from scratch undermines the benefit of leveraging the learned knowledge, leading to significant training costs. Warm-starting from a previously trained checkpoint is the most intuitive way to retain knowledge and advance learning. How… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2405.20153  [pdf, other

    quant-ph

    Decoherence-assisted quantum key distribution

    Authors: Daniel R. Sabogal, Daniel F. Urrego, Juan Rafael Álvarez, Andrés F. Herrera, Juan P. Torres, Alejandra Valencia

    Abstract: We present a theoretical and experimental study of a controllable decoherence-assisted quantum key distribution scheme. Our method is based on the possibility of introducing controllable decoherence to polarization qubits using the spatial degree of freedom of light. We show that our method reduces the amount of information that an eavesdropper can obtain in the BB84 protocol under the entangling… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  8. arXiv:2405.18902  [pdf, other

    cs.LG cs.AI stat.ML

    A Causal Framework for Evaluating Deferring Systems

    Authors: Filippo Palomba, Andrea Pugnana, José Manuel Alvarez, Salvatore Ruggieri

    Abstract: Deferring systems extend supervised Machine Learning (ML) models with the possibility to defer predictions to human experts. However, evaluating the impact of a deferring strategy on system accuracy is still an overlooked area. This paper fills this gap by evaluating deferring systems through a causal lens. We link the potential outcomes framework for causal inference with deferring systems. This… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  9. arXiv:2405.17187  [pdf, other

    cs.CV cs.AI cs.RO

    Memorize What Matters: Emergent Scene Decomposition from Multitraverse

    Authors: Yiming Li, Zehong Wang, Yue Wang, Zhiding Yu, Zan Gojcic, Marco Pavone, Chen Feng, Jose M. Alvarez

    Abstract: Humans naturally retain memories of permanent elements, while ephemeral moments often slip through the cracks of memory. This selective retention is crucial for robotic perception, localization, and mapping. To endow robots with this capability, we introduce 3D Gaussian Mapping (3DGM), a self-supervised, camera-only offline mapping framework grounded in 3D Gaussian Splatting. 3DGM converts multitr… ▽ More

    Submitted 29 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Project page: https://3d-gaussian-mapping.github.io; Code and data: https://github.com/NVlabs/3DGM

  10. arXiv:2405.13693  [pdf, ps, other

    cs.LG

    Uncovering Algorithmic Discrimination: An Opportunity to Revisit the Comparator

    Authors: Jose M. Alvarez, Salvatore Ruggieri

    Abstract: Causal reasoning, in particular, counterfactual reasoning plays a central role in testing for discrimination. Counterfactual reasoning materializes when testing for discrimination, what is known as the counterfactual model of discrimination, when we compare the discrimination comparator with the discrimination complainant, where the comparator is a similar (or similarly situated) profile to that o… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  11. arXiv:2405.13493  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA

    Euclid. III. The NISP Instrument

    Authors: Euclid Collaboration, K. Jahnke, W. Gillard, M. Schirmer, A. Ealet, T. Maciaszek, E. Prieto, R. Barbier, C. Bonoli, L. Corcione, S. Dusini, F. Grupp, F. Hormuth, S. Ligori, L. Martin, G. Morgante, C. Padilla, R. Toledo-Moreo, M. Trifoglio, L. Valenziano, R. Bender, F. J. Castander, B. Garilli, P. B. Lilje, H. -W. Rix , et al. (412 additional authors not shown)

    Abstract: The Near-Infrared Spectrometer and Photometer (NISP) on board the Euclid satellite provides multiband photometry and R>=450 slitless grism spectroscopy in the 950-2020nm wavelength range. In this reference article we illuminate the background of NISP's functional and calibration requirements, describe the instrument's integral components, and provide all its key properties. We also sketch the proc… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Paper submitted as part of the A&A special issue 'Euclid on Sky', which contains Euclid key reference papers and first results from the Euclid Early Release Observations

  12. arXiv:2405.13492  [pdf, other

    astro-ph.IM astro-ph.CO

    Euclid. II. The VIS Instrument

    Authors: Euclid Collaboration, M. Cropper, A. Al-Bahlawan, J. Amiaux, S. Awan, R. Azzollini, K. Benson, M. Berthe, J. Boucher, E. Bozzo, C. Brockley-Blatt, G. P. Candini, C. Cara, R. A. Chaudery, R. E. Cole, P. Danto, J. Denniston, A. M. Di Giorgio, B. Dryer, J. Endicott, J. -P. Dubois, M. Farina, E. Galli, L. Genolet, J. P. D. Gow , et al. (403 additional authors not shown)

    Abstract: This paper presents the specification, design, and development of the Visible Camera (VIS) on the ESA Euclid mission. VIS is a large optical-band imager with a field of view of 0.54 deg^2 sampled at 0.1" with an array of 609 Megapixels and spatial resolution of 0.18". It will be used to survey approximately 14,000 deg^2 of extragalactic sky to measure the distortion of galaxies in the redshift ran… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Paper submitted as part of the A&A special issue `Euclid on Sky', which contains Euclid key reference papers and first results from the Euclid Early Release Observations

  13. arXiv:2405.13491  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM

    Euclid. I. Overview of the Euclid mission

    Authors: Euclid Collaboration, Y. Mellier, Abdurro'uf, J. A. Acevedo Barroso, A. Achúcarro, J. Adamek, R. Adam, G. E. Addison, N. Aghanim, M. Aguena, V. Ajani, Y. Akrami, A. Al-Bahlawan, A. Alavi, I. S. Albuquerque, G. Alestas, G. Alguero, A. Allaoui, S. W. Allen, V. Allevato, A. V. Alonso-Tetilla, B. Altieri, A. Alvarez-Candal, A. Amara, L. Amendola , et al. (1086 additional authors not shown)

    Abstract: The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Paper submitted as part of the A&A special issue`Euclid on Sky'

  14. arXiv:2405.08975  [pdf, other

    stat.ML cs.LG math.ST

    A distribution-free valid p-value for finite samples of bounded random variables

    Authors: Joaquin Alvarez

    Abstract: We build a valid p-value based on a concentration inequality for bounded random variables introduced by Pelekis, Ramon and Wang. The motivation behind this work is the calibration of predictive algorithms in a distribution-free setting. The super-uniform p-value is tighter than Hoeffding and Bentkus alternatives in certain regions. Even though we are motivated by a calibration setting in a machine… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: -

  15. arXiv:2405.01542  [pdf

    physics.ins-det

    Materials research for hiper laser fusion facilities: chamber wall, structural material and final optics

    Authors: J. Alvarez, A. Rivera, R. Gonzalez-Arrabal, D. Garoz, E. Del Rio, J. M. Perlado

    Abstract: The European HiPER project aims to demonstrate commercial viability of inertial fusion energy within the following two decades. This goal requires an extensive Research & Development program on materials for different applications (e.g., first wall, structural components and final optics). In this paper we will discuss our activities in the framework of HiPER to develop materials studies for the d… ▽ More

    Submitted 11 February, 2024; originally announced May 2024.

    Journal ref: Fusion Science and Technology, vol. 60, n. 2, pp. 565-569, 2011

  16. arXiv:2405.01533  [pdf, other

    cs.CV

    OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

    Authors: Shihao Wang, Zhiding Yu, Xiaohui Jiang, Shiyi Lan, Min Shi, Nadine Chang, Jan Kautz, Ying Li, Jose M. Alvarez

    Abstract: The advances in multimodal large language models (MLLMs) have led to growing interests in LLM-based autonomous driving agents to leverage their strong reasoning capabilities. However, capitalizing on MLLMs' strong reasoning capabilities for improved planning behavior is challenging since planning requires full 3D situational awareness beyond 2D reasoning. To address this challenge, our work propos… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  17. arXiv:2404.14908  [pdf, other

    cs.CV

    Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

    Authors: Hoang Chuong Nguyen, Tianyu Wang, Jose M. Alvarez, Miaomiao Liu

    Abstract: This paper focuses on self-supervised monocular depth estimation in dynamic scenes trained on monocular videos. Existing methods jointly estimate pixel-wise depth and motion, relying mainly on an image reconstruction loss. Dynamic regions1 remain a critical challenge for these methods due to the inherent ambiguity in depth and motion estimation, resulting in inaccurate depth estimation. This paper… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR2024

  18. arXiv:2404.01990  [pdf, other

    cs.CV

    What is Point Supervision Worth in Video Instance Segmentation?

    Authors: Shuaiyi Huang, De-An Huang, Zhiding Yu, Shiyi Lan, Subhashree Radhakrishnan, Jose M. Alvarez, Abhinav Shrivastava, Anima Anandkumar

    Abstract: Video instance segmentation (VIS) is a challenging vision task that aims to detect, segment, and track objects in videos. Conventional VIS methods rely on densely-annotated object masks which are expensive. We reduce the human annotations to only one point for each object in a video frame during training, and obtain high-quality mask predictions close to fully supervised models. Our proposed train… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  19. arXiv:2403.09230  [pdf, other

    cs.CV

    Improving Distant 3D Object Detection Using 2D Box Supervision

    Authors: Zetong Yang, Zhiding Yu, Chris Choy, Renhao Wang, Anima Anandkumar, Jose M. Alvarez

    Abstract: Improving the detection of distant 3d objects is an important yet challenging task. For camera-based 3D perception, the annotation of 3d bounding relies heavily on LiDAR for accurate depth information. As such, the distance of annotation is often limited due to the sparsity of LiDAR points on distant objects, which hampers the capability of existing detectors for long-range scenarios. We address t… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  20. arXiv:2403.08708  [pdf, other

    cond-mat.soft

    The importance of stretching rate in achieving true stress relaxation in the elasto-capillary thinning of dilute solutions

    Authors: Ann Aisling, Renee Saraka, Nicolas J. Alvarez

    Abstract: This work focuses on inferring the molecular state of the polymer chain required to induce elasto-capillary stress relaxation and the accurate measure of the polymer relaxation time in uniaxial stretching of dilute polymer solutions. This work is facilitated by the discovery that constant velocity applied at early times leads to initial constant extension rate before reaching the Rayleigh-Plateau… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 27 pages, 9 figures

  21. arXiv:2403.03538  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    RADIA -- Radio Advertisement Detection with Intelligent Analytics

    Authors: Jorge Álvarez, Juan Carlos Armenteros, Camilo Torrón, Miguel Ortega-Martín, Alfonso Ardoiz, Óscar García, Ignacio Arranz, Íñigo Galdeano, Ignacio Garrido, Adrián Alonso, Fernando Bayón, Oleg Vorontsov

    Abstract: Radio advertising remains an integral part of modern marketing strategies, with its appeal and potential for targeted reach undeniably effective. However, the dynamic nature of radio airtime and the rising trend of multiple radio spots necessitates an efficient system for monitoring advertisement broadcasts. This study investigates a novel automated radio advertisement detection technique incorpor… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  22. Silica final lens performance in laser fusion facilities: HiPER and LIFE

    Authors: David Garoz, R. González-Arrabal, R. Juárez, J. Álvarez, J. Sanz, J. M. Perlado, A. Rivera

    Abstract: Nowadays, the projects LIFE (Laser Inertial Fusion Energy) in USA and HiPER (High Power Laser Energy Research) in Europe are the most advanced ones to demonstrate laser fusion energy viability. One of the main points of concern to properly achieve ignition is the performance of the final optics (lenses) under the severe irradiation conditions that take place in fusion facilities. In this paper, we… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Journal ref: Nuclear Fusion, vol. 53, no. 1, p. 013010, Jan. 2013

  23. Evaluation of Zadoff-Chu, Kasami and Chirp based encoding schemes for Acoustic Local Positioning Systems

    Authors: Santiago Murano, Carmen Perez-Rubio, David Gualda, Fernando J. Alvarez, Teodoro Aguilera, Carlos de Marziani

    Abstract: The task of determining the physical coordinates of a target in indoor environments is still a key factor for many applications including people and robot navigation, user tracking, location-based advertising, augmented reality, gaming, emergency response or ambient assisted living environments. Among the different possibilities for indoor positioning, Acoustic Local Positioning Systems (ALPS) hav… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Journal ref: IEEE Transactions on Instrumentation and Measurement, vol. 69, no. 8, pp. 5356-5368, Aug. 2020

  24. Multipath Compensation Algorithm for TDMA-Based Ultrasonic Local Positioning Systems

    Authors: Teodoro Aguilera, Fernando J. Alvarez, David Gualda, Jose M. Villadangos, Alvaro Hernandez, Jesus Urena

    Abstract: This paper proposes a multipath compensation algorithm (MCA) to enhance the performance of an ultrasonic local positioning system under adverse multipath conditions. The proposed algorithm is based on the accurate estimation of the environment impulse response from which the corresponding line of sight for each channel is obtained. Experimental results in two different environments and with differ… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Journal ref: IEEE Transactions on Instrumentation and Measurement, vol. 67, no. 5, pp. 984-991, 2018

  25. arXiv:2402.02384  [pdf

    eess.SP cs.AR cs.SD eess.AS

    Acoustic Local Positioning With Encoded Emission Beacons

    Authors: Jesus Urena, Alvaro Hernandez, Juan Jesus Garcia, Jose Manuel Villadangos, Maria del Carmen Perez, David Gualda, Fernando J. Alvarez, Teodoro Aguilera

    Abstract: Acoustic local positioning systems (ALPSs) are an interesting alternative for indoor positioning due to certain advantages over other approaches, including their relatively high accuracy, low cost, and room-level signal propagation. Centimeter-level or fine-grained indoor positioning can be an asset for robot navigation, guiding a person to, for instance, a particular piece in a museum or to a spe… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the IEEE, vol. 106, no. 6, pp. 1042-1062, Jun. 2018

  26. arXiv:2401.13408  [pdf, other

    cs.AI cs.CY cs.HC

    Causal Perception

    Authors: Jose M. Alvarez, Salvatore Ruggieri

    Abstract: Perception occurs when two individuals interpret the same information differently. Despite being a known phenomenon with implications for bias in decision-making, as individual experience determines interpretation, perception remains largely overlooked in machine learning (ML) research. Modern decision flows, whether partially or fully automated, involve human experts interacting with ML applicati… ▽ More

    Submitted 22 May, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.09535 by other authors

  27. arXiv:2401.13378  [pdf

    physics.optics cond-mat.mes-hall

    Tunable circular dichroism through absorption in coupled optical modes of twisted triskelia nanostructures

    Authors: Javier Rodriguez Alvarez, Antonio Garcia Martin, Arantxa Fraile Rodriguez, Xavier Batlle, Amilcar Labarta

    Abstract: We present a system consisting of two stacked chiral plasmonic nanoelements, so-called triskelia, that exhibits a high degree of circular dichroism. The optical modes arising from the interactions between the two elements are the main responsible for the dichroic signal. Their excitation in the absorption cross section is favored when the circular polarization of the light is opposite to the helic… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Journal ref: Scientific Reports 12 (2022) 26

  28. arXiv:2401.12702  [pdf

    physics.optics cond-mat.mes-hall

    Imaging of Antiferroelectric Dark Modes in an Inverted Plasmonic Lattice

    Authors: Javier Rodriguez Alvarez, Amilcar Labarta, Juan Carlos Idrobo, Rossana Dell Anna, Alessandro Cian, Damiano Giubertoni, Xavier Borrise, Albert Guerrero, Francesc Perez Murano, Arantxa Fraile Rodriguez, Xavier Batlle

    Abstract: Plasmonic lattice nanostructures are of technological interest because of their capacity to manipulate light below the diffraction limit. Here, we present a detailed study of dark and bright modes in the visible and near-infrared energy regime of an inverted plasmonic honeycomb lattice by a combination of Au+ focused ion beam lithography with nanometric resolution, optical and electron spectroscop… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Journal ref: ACS Nano 17 (2023) 8123

  29. arXiv:2401.03844  [pdf, other

    cs.CV

    Fully Attentional Networks with Self-emerging Token Labeling

    Authors: Bingyin Zhao, Zhiding Yu, Shiyi Lan, Yutao Cheng, Anima Anandkumar, Yingjie Lao, Jose M. Alvarez

    Abstract: Recent studies indicate that Vision Transformers (ViTs) are robust against out-of-distribution scenarios. In particular, the Fully Attentional Network (FAN) - a family of ViT backbones, has achieved state-of-the-art robustness. In this paper, we revisit the FAN models and improve their pre-training with a self-emerging token labeling (STL) framework. Our method contains a two-stage training framew… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 5585-5595

  30. Dataset of turbulent flow over interacting barchan dunes

    Authors: Jimmy Gabriel Alvarez, Danilo da Silva Borges, Erick de Moraes Franklin

    Abstract: Barchans are dunes commonly found in dune fields on Earth, Mars and other celestial bodies, where they can interact with each other. This article concerns experimental data for the flow over subaqueous barchans that are either isolated or interacting with each other. The experiments were carried out in a transparent channel of rectangular cross section in which turbulent water flows were imposed o… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Journal ref: Data in Brief, 52, 109978 (2024) - invited article

  31. arXiv:2312.03031  [pdf, other

    cs.CV

    Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

    Authors: Zhiqi Li, Zhiding Yu, Shiyi Lan, Jiahan Li, Jan Kautz, Tong Lu, Jose M. Alvarez

    Abstract: End-to-end autonomous driving recently emerged as a promising research direction to target autonomy from a full-stack perspective. Along this line, many of the latest works follow an open-loop evaluation setting on nuScenes to study the planning behavior. In this paper, we delve deeper into the problem by conducting thorough analyses and demystifying more devils in the details. We initially observ… ▽ More

    Submitted 2 June, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Accept to cvpr 2024

  32. arXiv:2312.01696  [pdf, other

    cs.CV

    BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection

    Authors: Zhenxin Li, Shiyi Lan, Jose M. Alvarez, Zuxuan Wu

    Abstract: Recently, the rise of query-based Transformer decoders is reshaping camera-based 3D object detection. These query-based decoders are surpassing the traditional dense BEV (Bird's Eye View)-based methods. However, we argue that dense BEV frameworks remain important due to their outstanding abilities in depth estimation and object localization, depicting 3D scenes accurately and comprehensively. This… ▽ More

    Submitted 24 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  33. arXiv:2311.14671  [pdf, other

    cs.CV

    SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

    Authors: Lingchen Meng, Shiyi Lan, Hengduo Li, Jose M. Alvarez, Zuxuan Wu, Yu-Gang Jiang

    Abstract: In-context segmentation aims at segmenting novel images using a few labeled example images, termed as "in-context examples", exploring content similarities between examples and the target. The resulting models can be generalized seamlessly to novel segmentation tasks, significantly reducing the labeling and training costs compared with conventional pipelines. However, in-context segmentation is mo… ▽ More

    Submitted 29 March, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

  34. arXiv:2310.20437  [pdf, other

    hep-ph astro-ph.CO astro-ph.IM hep-ex

    A forecast of the sensitivity of the DALI Experiment to Galactic axion dark matter

    Authors: Juan F. Hernández Cabrera, Javier De Miguel, Enrique Joven Álvarez, E. Hernández-Suárez, J. Alberto Rubiiño-Martín, Chiko Otani

    Abstract: The axion is a long-postulated boson that can simultaneously solve two fundamental problems of modern physics: the charge-parity symmetry problem in the strong interaction and the enigma of dark matter. In this work we estimate, by means of Monte Carlo simulations, the sensitivity of the Dark-photons$\&$Axion-Like particles Interferometer (DALI), a new-generation Fabry-Pérot haloscope proposed to… ▽ More

    Submitted 15 January, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: As accepted by Symmetry

  35. arXiv:2310.19731  [pdf, other

    cs.CV cs.AI cs.LG

    ViR: Towards Efficient Vision Retention Backbones

    Authors: Ali Hatamizadeh, Michael Ranzinger, Shiyi Lan, Jose M. Alvarez, Sanja Fidler, Jan Kautz

    Abstract: Vision Transformers (ViTs) have attracted a lot of popularity in recent years, due to their exceptional capabilities in modeling long-range spatial dependencies and scalability for large scale training. Although the training parallelism of self-attention mechanism plays an important role in retaining great performance, its quadratic complexity baffles the application of ViTs in many scenarios whic… ▽ More

    Submitted 26 January, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Introduction of Vision Retention Networks (ViR) for Efficient Visual Modeling

  36. arXiv:2310.17525  [pdf, other

    physics.ed-ph physics.optics quant-ph

    Measuring Wigner functions of quantum states of light in the undergraduate laboratory

    Authors: Juan-Rafael Álvarez, Andrés Martínez Silva, Alejandra Valencia

    Abstract: In this work, we present an educational activity aimed at measuring the Wigner distribution functions of quantum states of light in the undergraduate laboratory. This project was conceived by students from various courses within the physics undergraduate curriculum, and its outcomes were used in an introductory Quantum Optics course at the Universidad de los Andes in Bogotá, Colombia. The activity… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 10 pages, 5 figures

  37. arXiv:2310.05509  [pdf, other

    math.DS math.CA

    Quartic rigid systems in the plane and in the Poincaré sphere

    Authors: M. J. Álvarez, J. L. Bravo, L. A. Calderón

    Abstract: We consider the planar family of rigid systems of the form $x'=-y+xP(x,y), y'=x+yP(x,y)$, where $P$ is any polynomial with monomials of degree one and three. This is the simplest non-trivial family of rigid systems with no rotatory parameters. The family can be compactified to the Poincaré sphere such that the vector field along the equator is not identically null. We study the centers, singular… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 19 pages, 10 figures

  38. arXiv:2309.05192  [pdf, other

    cs.CV

    Towards Viewpoint Robustness in Bird's Eye View Segmentation

    Authors: Tzofi Klinghoffer, Jonah Philion, Wenzheng Chen, Or Litany, Zan Gojcic, Jungseock Joo, Ramesh Raskar, Sanja Fidler, Jose M. Alvarez

    Abstract: Autonomous vehicles (AV) require that neural networks used for perception be robust to different viewpoints if they are to be deployed across many types of vehicles without the repeated cost of data collection and labeling for each. AV companies typically focus on collecting data from diverse scenarios and locations, but not camera rig configurations, due to cost. As a result, only a small number… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: ICCV 2023. Project Page: https://nvlabs.github.io/viewpoint-robustness

  39. arXiv:2308.04556  [pdf, other

    cs.CV

    FocalFormer3D : Focusing on Hard Instance for 3D Object Detection

    Authors: Yilun Chen, Zhiding Yu, Yukang Chen, Shiyi Lan, Animashree Anandkumar, Jiaya Jia, Jose Alvarez

    Abstract: False negatives (FN) in 3D object detection, {\em e.g.}, missing predictions of pedestrians, vehicles, or other obstacles, can lead to potentially dangerous situations in autonomous driving. While being fatal, this issue is understudied in many current 3D detection methods. In this work, we propose Hard Instance Probing (HIP), a general pipeline that identifies \textit{FN} in a multi-stage manner… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  40. arXiv:2308.02236  [pdf, other

    cs.CV

    FB-BEV: BEV Representation from Forward-Backward View Transformations

    Authors: Zhiqi Li, Zhiding Yu, Wenhai Wang, Anima Anandkumar, Tong Lu, Jose M. Alvarez

    Abstract: View Transformation Module (VTM), where transformations happen between multi-view image features and Bird-Eye-View (BEV) representation, is a crucial step in camera-based BEV perception systems. Currently, the two most prominent VTM paradigms are forward projection and backward projection. Forward projection, represented by Lift-Splat-Shoot, leads to sparsely projected BEV features without post-pr… ▽ More

    Submitted 17 August, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: Accept to ICCV 2023, camera-ready version

  41. arXiv:2307.15398  [pdf, other

    cs.LG cs.CY

    The Initial Screening Order Problem

    Authors: Jose M. Alvarez, Antonio Mastropietro, Salvatore Ruggieri

    Abstract: We investigate the role of the initial screening order (ISO) in candidate screening processes, such as employee hiring and academic admissions. The ISO refers to the order in which the screener evaluates the candidate pool. It has been largely overlooked in the literature, despite its potential impact on the optimality and fairness of the chosen set, especially under a human screener. We define tw… ▽ More

    Submitted 24 April, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

  42. arXiv:2307.04106  [pdf, other

    cs.CV

    Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's Eye View

    Authors: Jiayu Yang, Enze Xie, Miaomiao Liu, Jose M. Alvarez

    Abstract: Recent vision-only perception models for autonomous driving achieved promising results by encoding multi-view image features into Bird's-Eye-View (BEV) space. A critical step and the main bottleneck of these methods is transforming image features into the BEV coordinate frame. This paper focuses on leveraging geometry information, such as depth, to model such feature transformation. Existing works… ▽ More

    Submitted 11 July, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

  43. Monitoring Electron Spin Fluctuations with Paramagnetic Relaxation Enhancement

    Authors: Daniel Jardon Alvarez, Tahel Malka, Johan van Tol, Yishay Feldman, Raanan Carmieli, Michal Leskes

    Abstract: The magnetic interactions between the spin of an unpaired electron and the surrounding nuclear spins can be exploited to gain structural information, to reduce nuclear relaxation times as well as to create nuclear hyperpolarization via dynamic nuclear polarization (DNP). A central aspect that determines how these interactions manifest from the point of view of NMR is the timescale of the fluctuati… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Journal ref: Journal of Magnetic Resonance Journal of Magnetic Resonance Volume 336, March 2022, 107143

  44. arXiv:2307.01492  [pdf, other

    cs.CV cs.RO

    FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation

    Authors: Zhiqi Li, Zhiding Yu, David Austin, Mingsheng Fang, Shiyi Lan, Jan Kautz, Jose M. Alvarez

    Abstract: This technical report summarizes the winning solution for the 3D Occupancy Prediction Challenge, which is held in conjunction with the CVPR 2023 Workshop on End-to-End Autonomous Driving and CVPR 23 Workshop on Vision-Centric Autonomous Driving Workshop. Our proposed solution FB-OCC builds upon FB-BEV, a cutting-edge camera-based bird's-eye view perception design using forward-backward projection.… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Outstanding Champion and Innovation Award in the 3D Occupancy Prediction Challenge (CVPR23)

  45. arXiv:2306.14306  [pdf, other

    cs.LG cs.CV

    Adaptive Sharpness-Aware Pruning for Robust Sparse Networks

    Authors: Anna Bair, Hongxu Yin, Maying Shen, Pavlo Molchanov, Jose Alvarez

    Abstract: Robustness and compactness are two essential attributes of deep learning models that are deployed in the real world. The goals of robustness and compactness may seem to be at odds, since robustness requires generalization across domains, while the process of compression exploits specificity in one domain. We introduce Adaptive Sharpness-Aware Pruning (AdaSAP), which unifies these goals through the… ▽ More

    Submitted 13 March, 2024; v1 submitted 25 June, 2023; originally announced June 2023.

  46. arXiv:2306.06189  [pdf, other

    cs.CV cs.AI cs.LG

    FasterViT: Fast Vision Transformers with Hierarchical Attention

    Authors: Ali Hatamizadeh, Greg Heinrich, Hongxu Yin, Andrew Tao, Jose M. Alvarez, Jan Kautz, Pavlo Molchanov

    Abstract: We design a new family of hybrid CNN-ViT neural networks, named FasterViT, with a focus on high image throughput for computer vision (CV) applications. FasterViT combines the benefits of fast local representation learning in CNNs and global modeling properties in ViT. Our newly introduced Hierarchical Attention (HAT) approach decomposes global self-attention with quadratic complexity into a multi-… ▽ More

    Submitted 1 April, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: ICLR'24 Accepted Paper

  47. GHz sample excitation at the ALBA-PEEM

    Authors: Muhammad Waqas Khaliq, José M. Álvarez, Antonio Camps, Nahikari González, José Ferrer, Ana Martinez-Carboneres, Jordi Prat, Sandra Ruiz-Gómez, Miguel Angel Niño, Ferran Macià, Lucia Aballe, Michael Foerster

    Abstract: We describe a setup that is used for high-frequency electrical sample excitation in a cathode lens electron microscope with the sample stage at high voltage as used in many synchrotron light sources. Electrical signals are transmitted by dedicated high-frequency components to the printed circuit board supporting the sample. Sub-miniature push-on connectors (SMP) are used to realize the connection… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Journal ref: Ultramicroscopy 2023

  48. Bursts of polarised single photons from atom-cavity sources

    Authors: Jan Ole Ernst, Juan-Rafael Alvarez, Thomas D. Barrett, Axel Kuhn

    Abstract: Photonic qubits play an instrumental role in the development of advanced quantum technologies, including quantum networking, boson sampling and measurement based quantum computing. A promising framework for the deterministic production of indistinguishable single photons is an atomic emitter coupled to a single mode of a high finesse optical cavity. Polarisation control is an important cornerstone… ▽ More

    Submitted 25 August, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Journal ref: Journal of Physics B: Atomic, Molecular and Optical Physics, Volume 56, Number 20, 2023

  49. arXiv:2304.06271  [pdf

    astro-ph.HE

    A Contribution of the HAWC Observatory to the TeV era in the High Energy Gamma-Ray Astrophysics: The case of the TeV-Halos

    Authors: Ramiro Torres-Escobedo, Hao Zhou, Eduardo de la Fuente, A. U. Abeysekara, A. Albert, R. Alfaro, C. Alvarez, J. D. Álvarez, J. R. Angeles Camacho, J. C. Arteaga-Velázquez, K. P. Arunbabu, D. Avila Rojas, H. A. Ayala Solares, R. Babu, V. Baghmanyan, A. S. Barber, J. Becerra Gonzalez, E. Belmont-Moreno, S. Y. BenZvi, D. Berley, C. Brisbois, K. S. Caballero-Mora, T. Capistrán, A. Carramiñana, S. Casanova , et al. (108 additional authors not shown)

    Abstract: We present a short overview of the TeV-Halos objects as a discovery and a relevant contribution of the High Altitude Water Čerenkov (HAWC) observatory to TeV astrophysics. We discuss history, discovery, knowledge, and the next step through a new and more detailed analysis than the original study in 2017. TeV-Halos will contribute to resolving the problem of the local positron excess observed on th… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Work presented in the 21st International Symposium on Very High Energy Cosmic Ray Interactions(ISVHECRI 2022) as part of the Ph. D. Thesis of Ramiro Torres-Escobedo (SJTU, Shanghai, China). Accepted for publication in SciPost Physics Proceedings (ISSN 2666-4003). 11 pages, 3 Figures. Short overview of HAWC and TeV Halos objects until 2022

  50. The James Webb Space Telescope Mission

    Authors: Jonathan P. Gardner, John C. Mather, Randy Abbott, James S. Abell, Mark Abernathy, Faith E. Abney, John G. Abraham, Roberto Abraham, Yasin M. Abul-Huda, Scott Acton, Cynthia K. Adams, Evan Adams, David S. Adler, Maarten Adriaensen, Jonathan Albert Aguilar, Mansoor Ahmed, Nasif S. Ahmed, Tanjira Ahmed, Rüdeger Albat, Loïc Albert, Stacey Alberts, David Aldridge, Mary Marsha Allen, Shaune S. Allen, Martin Altenburg , et al. (983 additional authors not shown)

    Abstract: Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted by PASP for the special issue on The James Webb Space Telescope Overview, 29 pages, 4 figures