Bulk Transcribe Audio to Markdown Online Free

Convert speech to Markdown for Obsidian, Notion & note-taking apps.

Drop up to 50 files at once — no install, no sign-up required.

Drop Audio or Video Files Here

100 MB or 1 hour per file Up to 50 files 3 parallel conversions 2 credits per minute

Encrypted AI-Powered Global Servers Auto-delete 1h

⚡ Median transcription time (last 10k jobs): 732ms per minute

Outputs: Markdown Model: Fast

How it works

1 · Drop your files
Drag & drop audio or video files. Supports MP3, WAV, M4A, MP4, and more. No account required.
2 · We create your notes
Transcribed by OpenAI Whisper (choose Fast or Quality model) with AI-formatted title. Encrypted in transit & at rest.
3 · Download & auto-delete
Get your Markdown file in seconds. Files delete automatically after 1 hour.

Frequently Asked Questions

How do I import Markdown into Obsidian?

Just save the .md file to your vault folder. Obsidian auto-indexes new files, so your transcript becomes instantly searchable.

Add YAML frontmatter (tags, date, source) for better organization.

The transcript integrates seamlessly with your existing notes.

Why Markdown instead of plain text?

Markdown works natively with PKM apps (Obsidian, Notion, Logseq, Roam).

It supports headers, links, and formatting that plain text doesn't.

Most importantly, Markdown syncs via any cloud service, renders beautifully in these apps, and future-proofs your notes.

Which apps support Markdown?

Popular apps with native Markdown support:

Obsidian, Notion, Roam Research, Logseq
Bear, Craft, Typora, iA Writer
GitHub, GitLab, and documentation tools

It's the lingua franca of knowledge management.

Can I add tags or frontmatter?

The generated file includes an AI-formatted title as an H1 heading.

For Obsidian workflows, add YAML frontmatter (---) at the top with tags, date, or custom metadata.

The transcript body uses clean paragraphs — ready to link to your other notes.

When should I use Markdown vs Word vs EPUB?

Markdown — PKM apps, developer tools, GitHub
Word — Editing, collaboration, Track Changes
EPUB — E-readers for offline reading

Want to read on Kindle or Kobo? Try Audio to EPUB →

What is the difference between Fast and Quality models?

Two OpenAI Whisper models — choose speed or accuracy:

Model	Engine	Speed	Cost
Fast	Whisper V3 Large Turbo (809M)	~216x realtime	2 credits/min
Quality	Whisper V3 Large (1.55B)	~189x realtime	5 credits/min

Fast is the default — great for clear audio, podcasts, and lectures.

Quality uses the full 1.55B-parameter model. Independent benchmarks show ~10% WER for Quality vs ~12% for Fast (Artificial Analysis). Choose Quality for accented speech, noisy recordings, or technical content.

Both models support 99+ languages. Switch in the options panel above.

Sources: Groq docs, Artificial Analysis benchmark, Hugging Face model cards.

Can I use Meeting Intelligence with my notes?

Yes — Meeting Intelligence adds speaker labels to your Markdown transcript, making it easy to track who said what in your knowledge base.

When enabled, speaker attribution appears inline with the text:

# Meeting: Q4 Planning

**John:** Let's review the quarterly goals.

**Sarah:** Revenue is up 15% but we need to address churn.

This is perfect for Obsidian, Notion, or Logseq workflows where you want to:

Link speaker names to their profile notes
Search for everything a specific person said
Track decisions back to who proposed them

Our AI post-processing attempts to identify speakers by name when they're introduced or addressed in the audio. Important: Name detection isn't perfect — it works best when speakers introduce themselves ("Hi, I'm John...") or are called by name. If names aren't detected, you'll see generic labels like "Speaker 1" and "Speaker 2" which you can easily edit in your notes app.

Meeting Intelligence costs extra credits and is ideal for meeting notes, interviews, and any content where speaker attribution matters in your PKM system.

What are the limits for this converter?

Tier	Max File Size	Max Files/Batch	Parallel Processing
Guest/Free	100 MB	50 files	3 at once
Pro	1024 MB	1000 files	6 at once

Note: File size limits are specific to this converter. Batch and parallel processing limits apply to all images converters site-wide. See all converter limits →

How are credits calculated for this conversion?

Cost: 2 credits per minute

How it works:

Files up to 1 minutes: 2 credits
2 minutes: 4 credits
3 minutes: 6 credits
4 minutes: 8 credits

Example: A 10-minute file = 20 credits. A 180-minute (3h) audiobook = 360 credits.

Why per-minute? Audio conversion time scales with content duration, not file size. Longer audio requires proportionally more processing.

What are my daily and monthly credit limits?

Credit allocations vary by account tier:

Tier	Daily Limit	Monthly Limit
Guest	50 credits/day	—
Free	50 credits/day	—
Pro	—	12,000 credits/month

Daily credits (Guest & Free tiers) reset every day at midnight UTC. Monthly credits (Pro) reset on your billing cycle date.

Note: With 2 credit per minute, audio files under 1 MB cost 2 credit each. Pro users can convert 6,000 audio files per month.

Answers at a Glance

Quick answers to common questions.

Other Transcription Formats

Need a different format for your transcript?

EPUB — For e-readers Word — Editable with collaboration Plain Text — Raw transcript

What's New in Audio to Markdown

Latest improvements to this converter

Last updated February 27, 2026

Feb 27, 2026

Now available via the Convert.FAST REST API.

Feb 9, 2026

Added Whisper V3 Large as a Quality mode for higher-accuracy transcription.

Jan 22, 2026

Launched Audio to Markdown transcription for notes apps.

Need to get more done? Pro starts from $5.

1 GB files 1,000 per batch Priority queue Web + API

See Pricing →

No subscription required.