RoadTones is a dataset-model-evaluation stack designed for tone-controllable text generation for road event videos. While existing models generate neutral, factual descriptions, they lack control over how events are expressed: their tone, urgency, or style. RoadTones bridges this gap by enabling audience-adaptive communication across mobility, ADAS development, and public engagement. RoadTones highlights:
@misc{parikh2026roadtonestonecontrollabletext,
title={RoadTones: Tone Controllable Text Generation from Road Event Videos},
author={Chirag Parikh and Siddhi Pravin Lipare and Ravi Kiran Sarvadevabhatla},
year={2026},
eprint={2605.21411},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2605.21411},
}