Audio Transcription & Meeting Notes
AI-Corporate can turn audio recordings into text and create meeting notes from the transcript. Transcription uses the provider from the central model catalog, such as OpenAI or European AI. When you start, choose whether the recording is for personal use, a meeting, or a lesson/presentation.
Start screen
On the transcription screen you can start a new recording or upload an existing audio file.
Providing audio
There are two ways to provide audio for transcription.
Record directly in AI-Corporate
Click Start recording to begin. Before recording starts, a dialog opens with the recording settings.
Recording settings
When starting a recording you can set:
- Recording type:
- Private recording: one person close to the microphone.
- Meeting: multiple speakers in one room.
- Lesson or presentation: one main speaker with possible interaction.
- Specialist vocabulary and keywords: add names, abbreviations, product names, or terms that are often recognized incorrectly.
- Language: AI-Corporate uses your account language to guide transcription.
Behavior per recording type
The selected type determines how the recording is processed:
- Private recording with OpenAI uses realtime transcription. Text appears while you speak. Because this is meant for one person, speaker diarization is not applied and no interim audio files are stored.
- Meeting and Lesson or presentation use file-based processing. AI-Corporate processes audio parts during the recording and also processes the complete final recording when you stop. This path is suitable for longer recordings, multiple speakers, and recovery after interruptions.
- European AI/Mistral uses file-based processing. For private recordings, diarization is disabled so the transcript is not unnecessarily split into speakers.
Use an existing audio file
You can also upload an existing recording. Supported formats include MP3, WAV, M4A, and WebM. After upload, the file is processed with the same transcription approach as a recording of the same type.
Transcription and speakers
The transcript can contain time blocks and speaker labels. For conversations and meetings, the model tries to distinguish speakers. For private recordings this is disabled because the transcript is intended for one speaker. Sometimes labels such as Speaker A and Speaker B are used instead of real names. AI-Corporate post-processes explicit introductions in the text when possible.
Speaker recognition still depends on audio quality, overlapping speech, and the selected model. If names or terms are not recognized correctly, you can improve the transcript with AI.
Improve with AI
After processing, use Improve with AI for targeted corrections, such as renaming speakers, fixing a technical term, or applying a spelling correction consistently. Always check the result when the transcript is used for reporting or decisions.