George Danforth
Jonathan Mortensen
https://confident.security
This document proposes a standardized method for individuals to signal their desire to opt out of audio recording and transcription through a sub-audible digital signal embedded in audio streams. The DO NOT RECORD ME (DNRM) signal standard aims to provide users with a technical mechanism to express their privacy preferences regarding automated speech transcription and audio capture systems.
As AI-powered audio transcription technology becomes more powerful and more cost-effective to run, we are seeing a proliferation of tools making use of this technology across a wide range of use-cases. While many such use-cases are seemingly benign (e.g. transcribing work meetings), others are explicitly adversarial (e.g. cheating on job interviews). In any case where an individual does not explicitly consent to being the subject of an AI-powered transcription, there is the potential for an abuse of that individual's privacy.
There currently exists no technical mechanism allowing individuals to opt out of AI-powered transcriptions. Whether or not an individual's privacy is respected in the context of AI transcription software is purely up to the discretion of the individual operating the transcription software.
We propose DO NOT RECORD ME (DNRM), a standardized infrasonic, sub-audible digital signal that individuals can inject into their audio streams to indicate their opt-out preference from audio capture and AI audio transcription. The DNRM signal functions as an audio-domain analog to the HTTP DO NOT TRACK header, establishing a technical mechanism for individuals to assert their privacy preferences in digital spaces where audio capture and transcription may occur. When detected by compliant transcription systems, the DNRM signal triggers the exclusion of the originating speaker's audio stream from transcription or capture.
The DNRM signal operates in the infrasonic range to ensure compatibility with most widely used digital voice and video communication systems. This design choice addresses the constraint that most digital audio streams used for voice transmission use a 16 kHz sample rate, which would severely undersample ultrasonic frequencies, rendering them undetectable. Infrasonic frequencies, by contrast, are oversampled in these systems, ensuring reliable detection.
The DNRM signal consists of three summed sine waves at frequencies of 2 Hz, 3 Hz, and 5 Hz. This combination provides the following advantages:
The signal employs an amplitude control constant of 0.0025, maintaining the entire signal below -50 dB. Although most consumer speakers and headphones filter frequencies outside the audible range before signal reproduction, this constant ensures that the signal will have very low amplitude, providing a safeguard against unintended audibility in cases where audio streams might be connected to high-end equipment with extended low-frequency response.
Detection of the DNRM signal follows a straightforward signal processing approach:
© 2025 Confident Security. All rights reserved.