Skip to main content

Showing 1–50 of 397 results for author: Hashimoto, T

  1. arXiv:2407.08351  [pdf, other

    cs.CL cs.LG

    AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models

    Authors: Xiang Lisa Li, Evan Zheran Liu, Percy Liang, Tatsunori Hashimoto

    Abstract: Evaluation is critical for assessing capabilities, tracking scientific progress, and informing model selection. In this paper, we present three desiderata for a good benchmark for language models: (i) salience (e.g., knowledge about World War II is more salient than a random day in history), (ii) novelty (i.e., the benchmark reveals new trends in model rankings not shown by previous benchmarks), a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: preprint

  2. arXiv:2407.07977  [pdf, other

    physics.atom-ph

    Few-electron highly charged muonic Ar atoms verified by electronic $K$ x rays

    Authors: T. Okumura, T. Azuma, D. A. Bennett, W. B. Doriese, M. S. Durkin, J. W. Fowler, J. D. Gard, T. Hashimoto, R. Hayakawa, Y. Ichinohe, P. Indelicato, T. Isobe, S. Kanda, D. Kato, M. Katsuragawa, N. Kawamura, Y. Kino, N. Kominato, Y. Miyake, K. M. Morgan, H. Noda, G. C. O'Neil, S. Okada, K. Okutsu, N. Paul , et al. (18 additional authors not shown)

    Abstract: Electronic $K$ x rays emitted by muonic Ar atoms in the gas phase were observed using a superconducting transition-edge-sensor microcalorimeter. The high-precision energy spectra provided a clear signature of the presence of muonic atoms accompanied by a few electrons, which have never been observed before. One-, two-, and three-electron bound, i.e., H-like, He-like, and Li-like, muonic Ar atoms w… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  3. arXiv:2407.04620  [pdf, other

    cs.LG cs.AI cs.CL

    Learning to (Learn at Test Time): RNNs with Expressive Hidden States

    Authors: Yu Sun, Xinhao Li, Karan Dalal, Jiarui Xu, Arjun Vikram, Genghan Zhang, Yann Dubois, Xinlei Chen, Xiaolong Wang, Sanmi Koyejo, Tatsunori Hashimoto, Carlos Guestrin

    Abstract: Self-attention performs well in long context but has quadratic complexity. Existing RNN layers have linear complexity, but their performance in long context is limited by the expressive power of their hidden state. We propose a new class of sequence modeling layers with linear complexity and an expressive hidden state. The key idea is to make the hidden state a machine learning model itself, and t… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  4. arXiv:2407.01889  [pdf, other

    astro-ph.GA

    ALMA reveals spatially-resolved properties of molecular gas in the host galaxy of FRB 20191001A at z = 0.2340

    Authors: Itsuki Yamanaka, Bunyo Hatsukade, Fumi Egusa, Tetsuya Hashimoto, Yuu Niino, Tzu-Yin Hsu, Hiroyuki Kaneko, Kotaro Kohno

    Abstract: We report the detection of the CO(2-1) emission line with a spatial resolution of 0.9 arcsec ($3.5 \mathrm{kpc}$) from the host galaxy of the fast radio burst (FRB), FRB 20191001A at $z=0.2340$, using the Atacama Large Millimeter/submillimeter Array. This is the first detection of spatially resolved CO emission from the host galaxy of an FRB at a cosmological distance. The inferred molecular gas m… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 10 pages, 7 figures, 3 tables

  5. arXiv:2407.01023  [pdf, other

    cs.LG

    DistML.js: Installation-free Distributed Deep Learning Framework for Web Browsers

    Authors: Masatoshi Hidaka, Tomohiro Hashimoto, Yuto Nishizawa, Tatsuya Harada

    Abstract: We present "DistML.js", a library designed for training and inference of machine learning models within web browsers. Not only does DistML.js facilitate model training on local devices, but it also supports distributed learning through communication with servers. Its design and define-by-run API for deep learning model construction resemble PyTorch, thereby reducing the learning curve for prototyp… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  6. arXiv:2406.19439  [pdf, other

    astro-ph.GA

    Gas conditions of a star-formation selected sample in the first billion years

    Authors: Tom J. L. C. Bakx, Hiddo S. B. Algera, Bram Venemans, Laura Sommovigo, Seiji Fujimoto, Stefano Carniani, Masato Hagimoto, Takuya Hashimoto, Akio K. Inoue, Dragan Salak, Stephen Serjeant, Livia Vallini, Stephen Eales, Andrea Ferrara, Yoshinobu Fudamoto, Chihiro Imamura, Shigeki Inoue, Kirsten K. Knudsen, Hiroshi Matsuo, Yuma Sugahara, Yoichi Tamura, Akio Taniguchi, Satoshi Yamanaka

    Abstract: We present Atacama Large Millimetre/submillimetre Array (ALMA) observations of the [O$_{\rm III}$] 88 $μ$m emission of a sample of thirteen galaxies at $z$ = 6 to 7.6 selected as [C$_{\rm II}$]-emitting companion sources of quasars. To disentangle the origins of the luminous Oxygen line in the $z$ > 6 Universe, we looked at emission-line galaxies that are selected through an excellent star-formati… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 20 pages; 13 figures; accepted for publication in MNRAS

  7. arXiv:2406.14888  [pdf, other

    astro-ph.GA astro-ph.CO

    Finding dusty AGNs from the JWST CEERS survey with mid-infrared photometry

    Authors: Tom C. -C. Chien, Chih-Teng Ling, Tomotsugu Goto, Cossas K. -W. Wu, Seong Jin Kim, Tetsuya Hashimoto, Yu-Wei Lin, Ece Kilerci, Simon C. -C. Ho, Po-Ya Wang, Bjorn Jasper R. Raquel

    Abstract: The nature of the interaction between active galactic nuclei (AGNs) and their host galaxies remains an unsolved question. Therefore, conducting an AGN census is valuable to AGN research. Nevertheless, a significant fraction of AGNs are obscured by their environment, which blocks UV and optical emissions due to the dusty torus surrounding the central supermassive black hole (SMBH). To overcome this… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 15 pages, 20 figures, 4 tables. Accepted for publication in MNRAS. The 3 min summary: https://www.youtube.com/watch?v=mWUebbgUOh8

  8. arXiv:2406.14785  [pdf, other

    cs.CL cs.LG

    Understanding Finetuning for Factual Knowledge Extraction

    Authors: Gaurav Ghosal, Tatsunori Hashimoto, Aditi Raghunathan

    Abstract: In this work, we study the impact of QA fine-tuning data on downstream factuality. We show that fine-tuning on lesser-known facts that are poorly stored during pretraining yields significantly worse factuality than fine-tuning on well-known facts, even when all facts are seen during pretraining. We prove this phenomenon theoretically, showing that training on lesser-known facts can lead the model… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: To appear in ICML 2024

  9. arXiv:2406.07975  [pdf, other

    astro-ph.IM

    FINER: Far-Infrared Nebular Emission Receiver for the Large Millimeter Telescope

    Authors: Yoichi Tamura, Takeshi Sakai, Ryohei Kawabe, Takafumi Kojima, Akio Taniguchi, Tatsuya Takekoshi, Haoran Kang, Wenlei Shan, Masato Hagimoto, Norika Okauchi, Airi Tetsuka, Akio K. Inoue, Kotaro Kohno, Kunihiko Tanaka, Tom J. L. C. Bakx, Yoshinobu Fudamoto, Kazuyuki Fujita, Yuichi Harikane, Takuya Hashimoto, Bunyo Hatsukade, David H. Hughes, Takahiro Iino, Yuki Kimura, Hiroyuki Maezawa, Yuichi Matsuda , et al. (12 additional authors not shown)

    Abstract: Unveiling the emergence and prevalence of massive/bright galaxies during the epoch of reionization and beyond, within the first 600 million years of the Universe, stands as a pivotal pursuit in astronomy. Remarkable progress has been made by JWST in identifying an immense population of bright galaxies, which hints at exceptionally efficient galaxy assembly processes. However, the underlying physic… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 12 pages, 8 figures, and 3 tables. Proceedings paper presented in SPIE Astronomical Telescope and Instrumentation 2024

  10. arXiv:2405.20456  [pdf, other

    cs.LG

    Scaling Laws for the Value of Individual Data Points in Machine Learning

    Authors: Ian Covert, Wenlong Ji, Tatsunori Hashimoto, James Zou

    Abstract: Recent works have shown that machine learning models improve at a predictable rate with the total amount of training data, leading to scaling laws that describe the relationship between error and dataset size. These scaling laws can help design a model's training dataset, but they typically take an aggregate view of the data by only considering the dataset's size. We introduce a new perspective by… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: ICML 2024 camera-ready

  11. arXiv:2405.10938  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Observational Scaling Laws and the Predictability of Language Model Performance

    Authors: Yangjun Ruan, Chris J. Maddison, Tatsunori Hashimoto

    Abstract: Understanding how language model performance varies with scale is critical to benchmark and algorithm development. Scaling laws are one approach to building this understanding, but the requirement of training models across many different scales has limited their use. We propose an alternative, observational approach that bypasses model training and instead builds scaling laws from ~80 publically a… ▽ More

    Submitted 2 July, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  12. arXiv:2404.10770  [pdf, other

    astro-ph.GA

    Unveiling the Cosmic Gems Arc at $z\sim10.2$ with JWST

    Authors: Larry D. Bradley, Angela Adamo, Eros Vanzella, Keren Sharon, Gabriel Brammer, Dan Coe, Jose M. Diego, Vasily Kokorev, Guillaume Mahler, Masamune Oguri, Abdurro'uf, Rachana Bhatawdekar, Lise Christensen, Seiji Fujimoto, Takuya Hashimoto, Tiger Y. -Y Hsiao, Akio K. Inoue, Yolanda Jiménez-Teja, Matteo Messa, Colin Norman, Massimo Ricotti, Yoichi Tamura, Rogier A. Windhorst, Xinfeng Xu, Adi Zitrin

    Abstract: We present recent JWST NIRCam imaging observations of SPT0615-JD (also known as the Cosmic Gems Arc), lensed by the galaxy cluster SPT-CL J0615-5746. The 5-arcsec-long arc is the most highly magnified $z>10$ galaxy known, straddling the lensing critical curve and revealing five star clusters with radii $\sim 1$ pc or less. We measure the full arc to have F200W 24.5 AB mag, consisting of two mirror… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 22 pages, 8 figures, 4 tables, submitted to ApJ

  13. arXiv:2404.04500  [pdf, other

    cs.CR cs.AI cs.CY cs.LG

    Trustless Audits without Revealing Data or Models

    Authors: Suppakit Waiwitlikhit, Ion Stoica, Yi Sun, Tatsunori Hashimoto, Daniel Kang

    Abstract: There is an increasing conflict between business incentives to hide models and data as trade secrets, and the societal need for algorithmic transparency. For example, a rightsholder wishing to know whether their copyrighted works have been used during training must convince the model provider to allow a third party to audit the model and data. Finding a mutually agreeable third party is difficult,… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  14. arXiv:2404.04475  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

    Authors: Yann Dubois, Balázs Galambosi, Percy Liang, Tatsunori B. Hashimoto

    Abstract: LLM-based auto-annotators have become a key component of the LLM development process due to their cost-effectiveness and scalability compared to human-based evaluation. However, these auto-annotators can introduce complex biases that are hard to remove. Even simple, known confounders such as preference for longer outputs remain in existing automated evaluation metrics. We propose a simple regressi… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  15. arXiv:2404.01773  [pdf, other

    nucl-ex nucl-th

    Measurement of the mesonic decay branch of the $\bar{K}\!N\!N$ quasi-bound state

    Authors: T. Yamaga, S. Ajimura, H. Asano, G. Beer, H. Bhang, M. Bragadireanu, P. Buehler, L. Busso, M. Cargnelli, S. Choi, C. Curceanu, S. Enomoto, H. Fujioka, Y. Fujiwara, T. Fukuda, C. Guaraldo, T. Hashimoto, R. S. Hayano, T. Hiraiwa, M. Iio, M. Iliescu, K. Inoue, Y. Ishiguro, T. Ishikawa, S. Ishimoto , et al. (45 additional authors not shown)

    Abstract: We conducted measurements of $K^- + {^3{\rm He}} \to π\!Y \!N + N'$ reactions using a $1~{\rm GeV}/c$ $K^-$-beam, with the objective of understanding the broad decay width of $\bar{K} \!N \!N$ (approximately twice as broad as that of $Λ(1405)$ considered to be the $\bar{K} \!N$ quasi-bound state). We successfully reproduced distributions of the $π\! Y \! N$ invariant mass and momentum transfer for… ▽ More

    Submitted 2 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  16. arXiv:2404.00474  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Linguistic Calibration of Long-Form Generations

    Authors: Neil Band, Xuechen Li, Tengyu Ma, Tatsunori Hashimoto

    Abstract: Language models (LMs) may lead their users to make suboptimal downstream decisions when they confidently hallucinate. This issue can be mitigated by having the LM verbally convey the probability that its claims are correct, but existing models cannot produce long-form text with calibrated confidence statements. Through the lens of decision-making, we define linguistic calibration for long-form gen… ▽ More

    Submitted 4 June, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: ICML 2024. Code available at https://github.com/tatsu-lab/linguistic_calibration

  17. arXiv:2403.17133  [pdf, other

    astro-ph.GA

    RIOJA. Complex Dusty Starbursts in a Major Merger B14-65666 at z=7.15

    Authors: Yuma Sugahara, Javier Álvarez-Márquez, Takuya Hashimoto, Luis Colina, Akio K. Inoue, Luca Costantin, Yoshinobu Fudamoto, Ken Mawatari, Yi W. Ren, Santiago Arribas, Tom J. L. C. Bakx, Carmen Blanco-Prieto, Daniel Ceverino, Alejandro Crespo Gómez, Masato Hagimoto, Takeshi Hashigaya, Rui Marques-Chaves, Hiroshi Matsuo, Yurina Nakazato, Miguel Pereira-Santaella, Yoichi Tamura, Mitsutaka Usui, Naoki Yoshida

    Abstract: We present JWST NIRCam imaging of B14-65666 ("Big Three Dragons"), a bright Lyman-break galaxy system ($M_\text{UV}=-22.5$ mag) at $z=7.15$. The high angular resolution of NIRCam reveals the complex morphology of two galaxy components: galaxy E has a compact core (E-core), surrounded by diffuse, extended, rest-frame optical emission, which is likely to be tidal tails; and galaxy W has a clumpy and… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 18 pages, 6 figures, 4 tables. Submitted to ApJ

  18. arXiv:2402.16827  [pdf, other

    cs.CL cs.LG

    A Survey on Data Selection for Language Models

    Authors: Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert, Xinyi Wang, Niklas Muennighoff, Bairu Hou, Liangming Pan, Haewon Jeong, Colin Raffel, Shiyu Chang, Tatsunori Hashimoto, William Yang Wang

    Abstract: A major factor in the recent success of large language models is the use of enormous and ever-growing text datasets for unsupervised pre-training. However, naively training a model on all available data may not be optimal (or feasible), as the quality of available text data can vary. Filtering out data can also decrease the carbon footprint and financial costs of training models by reducing the am… ▽ More

    Submitted 8 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Paper list available at https://github.com/alon-albalak/data-selection-survey

  19. arXiv:2402.10978  [pdf, other

    cs.LG cs.AI cs.CL

    Language Models with Conformal Factuality Guarantees

    Authors: Christopher Mohri, Tatsunori Hashimoto

    Abstract: Guaranteeing the correctness and factuality of language model (LM) outputs is a major open problem. In this work, we propose conformal factuality, a framework that can ensure high probability correctness guarantees for LMs by connecting language modeling and conformal prediction. We observe that the correctness of an LM output is equivalent to an uncertainty quantification problem, where the uncer… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  20. arXiv:2402.05386  [pdf, other

    astro-ph.GA astro-ph.CO

    Exploring the faintest end of mid-infrared luminosity functions up to $z\simeq 5$ with the JWST CEERS survey

    Authors: Chih-Teng Ling, Tomotsugu Goto, Seong Jin Kim, Cossas K. -W. Wu, Tetsuya Hashimoto, Tom C. -C. Chien, Yu-Wei Lin, Simon C. -C. Ho, Ece Kilerci

    Abstract: Mid-infrared (MIR) light from galaxies is sensitive to dust-obscured star-formation activities because it traces the characteristic emission of dust heated by young, massive stars. By constructing the MIR luminosity functions (LFs), we are able to quantify the overall dusty star formation history and the evolution of galaxies over cosmic time. In this work, we report the first rest-frame MIR LFs a… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 22 pages, 22 figures, 7 tables. Accepted for publication in MNRAS. A summary video can be found at https://youtu.be/TRb6bjmGfOU

  21. arXiv:2401.15866  [pdf, other

    cs.LG

    Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution

    Authors: Ian Covert, Chanwoo Kim, Su-In Lee, James Zou, Tatsunori Hashimoto

    Abstract: Many tasks in explainable machine learning, such as data valuation and feature attribution, perform expensive computation for each data point and can be intractable for large datasets. These methods require efficient approximations, and learning a network that directly predicts the desired output, which is commonly known as amortization, is a promising solution. However, training such models with… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  22. arXiv:2401.10005  [pdf, other

    cs.CV cs.CL

    Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation

    Authors: Kohei Uehara, Nabarun Goswami, Hanqin Wang, Toshiaki Baba, Kohtaro Tanaka, Tomohiro Hashimoto, Kai Wang, Rei Ito, Takagi Naoya, Ryo Umagami, Yingyi Wen, Tanachai Anakewat, Tatsuya Harada

    Abstract: The increasing demand for intelligent systems capable of interpreting and reasoning about visual content requires the development of Large Multi-Modal Models (LMMs) that are not only accurate but also have explicit reasoning capabilities. This paper presents a novel approach to imbue an LMM with the ability to conduct explicit reasoning based on visual content and textual instructions. We introduc… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  23. arXiv:2401.03224  [pdf, other

    astro-ph.GA

    Bound star clusters observed in a lensed galaxy 460 Myr after the Big Bang

    Authors: Angela Adamo, Larry D. Bradley, Eros Vanzella, Adélaïde Claeyssens, Brian Welch, Jose M Diego, Guillaume Mahler, Masamune Oguri, Keren Sharon, Abdurro'uf, Tiger Yu-Yang Hsiao, Xinfeng Xu, Matteo Messa, Augusto E. Lassen, Erik Zackrisson, Gabriel Brammer, Dan Coe, Vasily Kokorev, Massimo Ricotti, Adi Zitrin, Seiji Fujimoto, Akio K. Inoue, Tom Resseguier, Jane R. Rigby, Yolanda Jiménez-Teja , et al. (3 additional authors not shown)

    Abstract: The Cosmic Gems arc is among the brightest and highly magnified galaxies observed at redshift $z\sim10.2$. However, it is an intrinsically UV faint galaxy, in the range of those now thought to drive the reionization of the Universe. Hitherto the smallest features resolved in a galaxy at a comparable redshift are between a few hundreds and a few tens of parsecs. Here we report JWST observations of… ▽ More

    Submitted 12 June, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: Accepted for publication

  24. arXiv:2401.01087  [pdf

    cond-mat.stat-mech

    Electron transfer channel in the sugar recognition system assembled on nano gold particle

    Authors: Takayuki Goto, Takeshi Hashimoto, Kai Sato, Yukihiro Kitamoto, Takashi Hayashita, Satoshi Iguchi, Takahiko Sasaki, Dita Puspita Sari, Isao Watanabe

    Abstract: Existence of 1D spin diffusion in the electrochemical sugar recognition system consisting of a nano-sized gold particle (GNP), a ruthenium complex and a phenylboronic acid was investigated by NMR and muSR. When sugar molecules are recognized by the phenylboronic site, the response of electrochemical voltammetry of the Ru site changes, enabling the system to work as a sensitive sugar-sensor. In thi… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  25. arXiv:2401.01043  [pdf, other

    astro-ph.GA astro-ph.CO

    Polycyclic aromatic hydrocarbon (PAH) luminous galaxies in JWST CEERS data

    Authors: Yu-Wei Lin, Cossas K. -W. Wu, Chih-Teng Ling, Tomotsugu Goto, Seong Jin Kim, Ece Kilerci, Tetsuya Hashimoto, Po-Ya Wang, Simon C. -C. Ho, Tiger Yu-Yang Hsiao, Bjorn Jasper R. Raquel, Yuri Uno

    Abstract: It has been an unanswered question how many dusty galaxies have been undetected from the state-of-the-art observational surveys. JWST enables us to detect faint IR galaxies that have prominent polycyclic aromatic hydrocarbon (PAH) features in the mid-IR wavelengths. PAH is a valuable tracer of star formation and dust properties in the mid-infrared wavelength. The JWST Cosmic Evolution Early Releas… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 12 pages, 20 figures, 4 tables. Accepted by MNRAS. A summary video is at https://www.youtube.com/watch?v=UtPaVTFM4f8&ab_channel=NTHUCosmology

  26. arXiv:2312.04469  [pdf, other

    cs.LG cs.CL cs.CR

    On the Learnability of Watermarks for Language Models

    Authors: Chenchen Gu, Xiang Lisa Li, Percy Liang, Tatsunori Hashimoto

    Abstract: Watermarking of language model outputs enables statistical detection of model-generated text, which can mitigate harms and misuses of language models. Existing watermarking strategies operate by altering the decoder of an existing language model. In this paper, we ask whether language models can directly learn to generate watermarked text, which would have significant implications for the real-wor… ▽ More

    Submitted 2 May, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted at ICLR 2024

  27. arXiv:2312.02090  [pdf, other

    astro-ph.CO astro-ph.GA

    Cosmic star-formation history and black hole accretion history inferred from the JWST mid-infrared source counts

    Authors: Seong Jin Kim, Tomotsugu Goto, Chih-Teng Ling, Cossas K. -W. Wu, Tetsuya Hashimoto, Ece Kilerci, Simon C. -C. Ho, Yuri Uno, Po-Ya Wang, Yu-Wei Lin

    Abstract: With the advent of the James Webb Space Telescope (JWST), extra-galactic source count studies were conducted down to sub-microJy in the mid-infrared (MIR), which is several tens of times fainter than what the previous-generation infrared (IR) telescopes achieved in the MIR. In this work, we aim to interpret the JWST source counts and constrain cosmic star-formation history (CSFH) and black hole ac… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 15 pages, 12 figures, published in MNRAS, https://doi.org/10.1093/mnras/stad3499. A summary video is https://youtu.be/Md6wragrYyM

  28. arXiv:2312.01707  [pdf, other

    cs.HC

    Perceptual Dimensions of Physical Properties of Handheld Objects Induced by Impedance Changes

    Authors: Takeru Hashimoto, Shigeo Yoshida, Takuji Narumi

    Abstract: Haptics in virtual reality is the emerging dimension after audiovisual experiences. Researchers designed several handheld VR controllers to simulate haptic experiences in virtual reality environments. Some of these devices, equipped to deliver active force, can dynamically alter the timing and intensity of force feedback, potentially offering a wide array of haptic sensations. Past research primar… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  29. arXiv:2312.00782  [pdf, other

    astro-ph.HE astro-ph.SR

    Quantifying chaos and randomness in magnetar bursts

    Authors: Shotaro Yamasaki, Ersin Gogus, Tetsuya Hashimoto

    Abstract: In this study, we explore the dynamical stability of magnetar bursts within the context of the chaos-randomness phase space for the first time, aiming to uncover unique behaviors compared to various astrophysical transients, including fast radio bursts (FRBs). We analyze burst energy time series data from active magnetar sources SGR J1550-5418 and SGR J1935+2154, focusing on burst arrival time and… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 pages, 3 figures, accepted for publication in MNRAS Letters

  30. arXiv:2312.00364  [pdf, other

    cs.LG cs.CV

    Benchmarking Multi-Domain Active Learning on Image Classification

    Authors: Jiayi Li, Rohan Taori, Tatsunori B. Hashimoto

    Abstract: Active learning aims to enhance model performance by strategically labeling informative data points. While extensively studied, its effectiveness on large-scale, real-world datasets remains underexplored. Existing research primarily focuses on single-source data, ignoring the multi-domain nature of real-world data. We introduce a multi-domain active learning benchmark to bridge this gap. Our bench… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  31. arXiv:2311.16857  [pdf, other

    astro-ph.GA

    SERENADE II: An ALMA Multi-Band Dust-Continuum Analysis of 28 Galaxies at $5<z<8$ and the Physical Origin of the Dust Temperature Evolution

    Authors: Ikki Mitsuhashi, Yuichi Harikane, Franz E. Bauer, Tom Bakx, Andrea Ferrara, Seiji Fujimoto, Takuya Hashimoto, Akio K. Inoue, Kazushi Iwasawa, Yuri Nishimura, Masatoshi Imanishi, Yoshiaki Ono, Toshiki Saito, Yuma Sugahara, Hideki Umehata, Livia Vallini, Tao Wang

    Abstract: We present an analysis of ALMA multi-band dust-continuum observations for 28 spectroscopically-confirmed bright Lyman-break galaxies at $5<z<8$. Our sample consists of 11 galaxies at $z\sim6$ newly observed in our ALMA program, which substantially increases the number of $5<z<8$ galaxies with both rest-frame 88 and 158 $μ{\rm m}$ continuum observations, allowing us to simultaneously measure the IR… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Submitted to ApJ

  32. arXiv:2311.05553  [pdf, other

    cs.CL cs.AI

    Removing RLHF Protections in GPT-4 via Fine-Tuning

    Authors: Qiusi Zhan, Richard Fang, Rohan Bindu, Akul Gupta, Tatsunori Hashimoto, Daniel Kang

    Abstract: As large language models (LLMs) have increased in their capabilities, so does their potential for dual use. To reduce harmful outputs, produces and vendors of LLMs have used reinforcement learning with human feedback (RLHF). In tandem, LLM vendors have been increasingly enabling fine-tuning of their most powerful models. However, concurrent work has shown that fine-tuning can remove RLHF protectio… ▽ More

    Submitted 5 April, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024. (7 pages)

  33. arXiv:2310.19677  [pdf, other

    cs.CL

    MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

    Authors: Allen Nie, Yuhui Zhang, Atharva Amdekar, Chris Piech, Tatsunori Hashimoto, Tobias Gerstenberg

    Abstract: Human commonsense understanding of the physical and social world is organized around intuitive theories. These theories support making causal and moral judgments. When something bad happens, we naturally ask: who did what, and why? A rich literature in cognitive science has studied people's causal and moral intuitions. This work has revealed a number of factors that systematically influence people… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 34 pages, 7 figures. NeurIPS 2023

  34. arXiv:2310.18413  [pdf, other

    cs.LG cs.AI stat.ML

    On the Fairness ROAD: Robust Optimization for Adversarial Debiasing

    Authors: Vincent Grari, Thibault Laugel, Tatsunori Hashimoto, Sylvain Lamprier, Marcin Detyniecki

    Abstract: In the field of algorithmic fairness, significant attention has been put on group fairness criteria, such as Demographic Parity and Equalized Odds. Nevertheless, these objectives, measured as global averages, have raised concerns about persistent local disparities between sensitive groups. In this work, we address the problem of local fairness, which ensures that the predictor is unbiased not only… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 23 pages, 10 figures

  35. arXiv:2310.17623  [pdf, other

    cs.CL cs.LG

    Proving Test Set Contamination in Black Box Language Models

    Authors: Yonatan Oren, Nicole Meister, Niladri Chatterji, Faisal Ladhak, Tatsunori B. Hashimoto

    Abstract: Large language models are trained on vast amounts of internet data, prompting concerns and speculation that they have memorized public benchmarks. Going from speculation to proof of contamination is challenging, as the pretraining data used by proprietary models are often not publicly accessible. We show that it is possible to provide provable guarantees of test set contamination in language model… ▽ More

    Submitted 23 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

  36. arXiv:2310.13807  [pdf, other

    cs.LG

    Learning to (Learn at Test Time)

    Authors: Yu Sun, Xinhao Li, Karan Dalal, Chloe Hsu, Sanmi Koyejo, Carlos Guestrin, Xiaolong Wang, Tatsunori Hashimoto, Xinlei Chen

    Abstract: We reformulate the problem of supervised learning as learning to learn with two nested loops (i.e. learning problems). The inner loop learns on each individual instance with self-supervision before final prediction. The outer loop learns the self-supervised task used by the inner loop, such that its final prediction improves. Our inner loop turns out to be equivalent to linear attention when the i… ▽ More

    Submitted 7 January, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: Fixed a few small typos

  37. arXiv:2310.01846  [pdf, other

    cs.CL cs.LG

    Benchmarking and Improving Generator-Validator Consistency of Language Models

    Authors: Xiang Lisa Li, Vaishnavi Shrivastava, Siyan Li, Tatsunori Hashimoto, Percy Liang

    Abstract: As of September 2023, ChatGPT correctly answers "what is 7+8" with 15, but when asked "7+8=15, True or False" it responds with "False". This inconsistency between generating and validating an answer is prevalent in language models (LMs) and erodes trust. In this paper, we propose a framework for measuring the consistency between generation and validation (which we call generator-validator consiste… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: preprint

  38. arXiv:2309.15817  [pdf, other

    cs.AI cs.CL cs.LG

    Identifying the Risks of LM Agents with an LM-Emulated Sandbox

    Authors: Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto

    Abstract: Recent advances in Language Model (LM) agents and tool use, exemplified by applications like ChatGPT Plugins, enable a rich set of capabilities but also amplify potential risks - such as leaking private data or causing financial losses. Identifying these risks is labor-intensive, necessitating implementing the tools, setting up the environment for each test scenario manually, and finding risky cas… ▽ More

    Submitted 17 May, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  39. arXiv:2309.14337  [pdf, other

    astro-ph.HE astro-ph.CO

    The true fraction of repeating fast radio bursts revealed through CHIME source count evolution

    Authors: Shotaro Yamasaki, Tomotsugu Goto, Chih-Teng Ling, Tetsuya Hashimoto

    Abstract: Fast Radio Bursts (FRBs) are classified into repeaters and non-repeaters, with only a few percent of the observed FRB population from the Canadian Hydrogen Intensity Mapping Experiment (CHIME) confirmed as repeaters. However, this figure represents only a lower limit due to the observational biases, and the true fraction of repeaters remains unknown. Correcting for these biases uncovers a notable… ▽ More

    Submitted 12 December, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: 10 pages, 10 figures, MNRAS in press, updated to match the accepted version

  40. arXiv:2309.07875  [pdf, other

    cs.CL

    Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

    Authors: Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Röttger, Dan Jurafsky, Tatsunori Hashimoto, James Zou

    Abstract: Training large language models to follow instructions makes them perform better on a wide range of tasks and generally become more helpful. However, a perfectly helpful model will follow even the most malicious instructions and readily generate harmful content. In this paper, we raise concerns over the safety of models that only emphasize helpfulness, not harmlessness, in their instruction-tuning.… ▽ More

    Submitted 19 March, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  41. Identifying and Mitigating the Security Risks of Generative AI

    Authors: Clark Barrett, Brad Boyd, Elie Burzstein, Nicholas Carlini, Brad Chen, Jihye Choi, Amrita Roy Chowdhury, Mihai Christodorescu, Anupam Datta, Soheil Feizi, Kathleen Fisher, Tatsunori Hashimoto, Dan Hendrycks, Somesh Jha, Daniel Kang, Florian Kerschbaum, Eric Mitchell, John Mitchell, Zulfikar Ramzan, Khawaja Shams, Dawn Song, Ankur Taly, Diyi Yang

    Abstract: Every major technical invention resurfaces the dual-use dilemma -- the new technology has the potential to be used for good as well as for harm. Generative AI (GenAI) techniques, such as large language models (LLMs) and diffusion models, have shown remarkable capabilities (e.g., in-context learning, code-completion, and text-to-image generation and editing). However, GenAI can be used just as well… ▽ More

    Submitted 28 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Journal ref: Foundations and Trends in Privacy and Security 6 (2023) 1-52

  42. Accelerating Aggregation Queries on Unstructured Streams of Data

    Authors: Matthew Russo, Tatsunori Hashimoto, Daniel Kang, Yi Sun, Matei Zaharia

    Abstract: Analysts and scientists are interested in querying streams of video, audio, and text to extract quantitative insights. For example, an urban planner may wish to measure congestion by querying the live feed from a traffic camera. Prior work has used deep neural networks (DNNs) to answer such queries in the batch setting. However, much of this work is not suited for the streaming setting because it… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 14 pages, 11 figures, to be published in Proceedings of the VLDB Endowment, Vol. 16, No. 11

    Journal ref: PVLDB, 16(11): 2897 - 2910, 2023

  43. arXiv:2308.04635  [pdf

    cs.CY cs.AI

    Where's the Liability in Harmful AI Speech?

    Authors: Peter Henderson, Tatsunori Hashimoto, Mark Lemley

    Abstract: Generative AI, in particular text-based "foundation models" (large models trained on a huge variety of information including the internet), can generate speech that could be problematic under a wide range of liability regimes. Machine learning practitioners regularly "red team" models to identify and mitigate such problematic speech: from "hallucinations" falsely accusing people of serious miscond… ▽ More

    Submitted 16 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Published in the Journal of Free Speech Law (2023)

  44. arXiv:2307.15593  [pdf, other

    cs.LG cs.CL cs.CR

    Robust Distortion-free Watermarks for Language Models

    Authors: Rohith Kuditipudi, John Thickstun, Tatsunori Hashimoto, Percy Liang

    Abstract: We propose a methodology for planting watermarks in text from an autoregressive language model that are robust to perturbations without changing the distribution over text up to a certain maximum generation budget. We generate watermarked text by mapping a sequence of random numbers -- which we compute using a randomized watermark key -- to a sample from the language model. To detect watermarked t… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: reformatting of camera-ready version accepted to TMLR, with minor edits to introduction

  45. arXiv:2307.03576  [pdf, ps, other

    cs.LG

    One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention

    Authors: Arvind Mahankali, Tatsunori B. Hashimoto, Tengyu Ma

    Abstract: Recent works have empirically analyzed in-context learning and shown that transformers trained on synthetic linear regression tasks can learn to implement ridge regression, which is the Bayes-optimal predictor, given sufficient capacity [Akyürek et al., 2023], while one-layer transformers with linear self-attention and no MLP layer will learn to implement one step of gradient descent (GD) on a lea… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  46. arXiv:2307.02811  [pdf, other

    astro-ph.HE astro-ph.GA

    Machine Learning Classification of Repeating FRBs from FRB121102

    Authors: Bjorn Jasper R. Raquel, Tetsuya Hashimoto, Tomotsugu Goto, Bo Han Chen, Yuri Uno, Tiger Yu-Yang Hsiao, Seong Jin Kim, Simon C. -C. Ho

    Abstract: Fast Radio Bursts (FRBs) are mysterious bursts in the millisecond timescale at radio wavelengths. Currently, there is little understanding about the classification of repeating FRBs, based on difference in physics, which is of great importance in understanding their origin. Recent works from the literature focus on using specific parameters to classify FRBs to draw inferences on the possible physi… ▽ More

    Submitted 6 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 24 pages, 14 figures, accepted for publication in MNRAS. For summary video, please see https://www.youtube.com/watch?v=wYx6t2G__84&list=PLOpYDs2PkYlYIiKDjDz6r6aKXcXdJZXYb&index=13&ab_channel=NCHUAstronomy

  47. arXiv:2307.02104  [pdf, other

    astro-ph.GA

    Molecular outflow in the reionization-epoch quasar J2054-0005 revealed by OH 119 $μ$m observations

    Authors: Dragan Salak, Takuya Hashimoto, Akio K. Inoue, Tom J. L. C. Bakx, Darko Donevski, Yoichi Tamura, Yuma Sugahara, Nario Kuno, Yusuke Miyamoto, Seiji Fujimoto, Suphakorn Suphapolthaworn

    Abstract: Molecular outflows are expected to play a key role in galaxy evolution at high redshift. To study the impact of outflows on star formation at the epoch of reionization, we performed sensitive ALMA observations of OH 119 $μ$m toward J2054-0005, a luminous quasar at $z=6.04$. The OH line is detected and exhibits a P-Cygni profile that can be fitted with a broad blue-shifted absorption component, pro… ▽ More

    Submitted 17 November, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted to ApJ

  48. arXiv:2307.00874  [pdf, ps, other

    quant-ph math-ph

    Center Preserving Automorphisms of Finite Heisenberg Group over $\mathbb Z_N$

    Authors: T. Hashimoto, M. Horibe, A. Hayashi

    Abstract: We investigate the group structure of center-preserving automorphisms of the finite Heisenberg group over $\mathbb Z_N$ with $U(1)$ extension, which arises in finite-dimensional quantum mechanics on a discrete phase space. Constructing an explicit splitting, it is shown that, for $N=2(2k+1)$, the group is isomorphic to the semidirect product of $Sp_N$ and $\mathbb Z_N^2$. Moreover, when N is divis… ▽ More

    Submitted 2 October, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: 23 pages, 1 figure

  49. arXiv:2306.02663  [pdf, other

    astro-ph.SR astro-ph.EP astro-ph.GA

    A T-Dwarf Candidate from JWST Early Release NIRCam data

    Authors: Po-Ya Wang, Tomotsugu Goto, Simon C. -C. Ho, Yu-Wei Lin, Cossas K. -W. Wu, Chih-Teng Ling, Tetsuya Hashimoto, Seong Jin Kim, Tiger Y. -Y. Hsiao

    Abstract: We present a distant T$-$type brown dwarf candidate at $\approx2.55$ kpc discovered in the Cosmic Evolution Early Release Science (CEERS) fields by James Webb Space Telescope (JWST) NIRCam. In addition to the superb sensitivity, we utilised 7 filters from JWST in near-IR and thus is advantageous in finding faint, previously unseen brown dwarfs. From the model spectra in new JWST/NIRCam filter wave… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 5 pages, 6 figures and 1 table; accepted for publication in MNRAS; A summary video is available at https://youtu.be/PQW79tuS0mI

  50. arXiv:2305.18619  [pdf, other

    cs.CL cs.LG

    Likelihood-Based Diffusion Language Models

    Authors: Ishaan Gulrajani, Tatsunori B. Hashimoto

    Abstract: Despite a growing interest in diffusion-based language models, existing work has not shown that these models can attain nontrivial likelihoods on standard language modeling benchmarks. In this work, we take the first steps towards closing the likelihood gap between autoregressive and diffusion-based language models, with the goal of building and releasing a diffusion model which outperforms a smal… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.