Enrich Media with AI

In iconik, you can add transcriptions and extremely detailed metadata, frame by frame, automatically.

MacBook mockup
Transcriptions

AI Video Transcription

Get affordable, fast, and accurate AI transcriptions for 36 languages with one click. 

With AI voice-to-text technology: 

Text is timestamped 

Text is attributed to speakers

Text becomes searchable as part of the asset metadata

At $0.03/minute, AI transcriptions are the most accessible way to enrich your media archive. *

View all supported languages for speech-to-text transcriptions
Arabic
Catalan
Croatian
Czech
Danish
Dutch
English
Farsi
Finnish
French
German
Greek
Hebrew
Hindi
Hungarian
Indonesian
Italian
Japanese
Korean
Latvian
Lithuanian
Malay
Mandarin
Norwegian
Polish
Portuguese
Romanian
Russian
Slovak
Slovenian
Spanish
Swedish
Tamil
Talegu
Turkish
Languages

Supported languages for speech-to-text transcriptions:

Arabic
Catalan
Croatian
Czech
Danish
Dutch
English
Farsi
Finnish
French
German
Greek
Hebrew
Hindi
Hungarian
Indonesian
Italian
Japanese
Korean
Latvian
Lithuanian
Malay
Mandarin
Norwegian
Polish
Portuguese
Romanian
Russian
Slovak
Slovenian
Spanish
Swedish
Tamil
Talegu
Turkish
Auto-tagging

Auto-tag Media with AI

Manually tagging media is time-consuming and hard to scale with consistency. AI auto-tagging automatically analyzes and tags media files based on the visual content of the media.

People

Animals

Objects

Colors

Scene changes

Pop culture references

Moods

Each tag is timestamped and instantly searchable.

Iconik can auto-tag your media for around $3/1000 images and $0.14/minute for video. *

Bring your own AI

Out of the box, you can use AI services through iconik. You can also choose to use your own AI license for these services:

Google Vision

Google Video AI

Amazon Rekognition

Rev AI

* Pricing for AI services is an estimate and is subject to change.

Artificial Intelligence

In iconik, you can add transcriptions and extremely detailed metadata, frame by frame, automatically.

Auto-tag your media

Once you open an asset, you can choose to “analyze” it. This will use AI to recognize every detail in your media such as color, subjects, feelings, and environments.

When the analysis is complete, new tags will be added and instantly searchable. Your videos will also have time-based tags so you can search for an exact frame!

For tags with a low confidence score, you will be able to decide if it's accurate or not.

Bring your own AI

Out of the box, you can use AI services through iconik. You can also choose to use your own AI license for these services:

Google Vision

Google Video AI

Amazon Rekognition

Rev AI

Amazon Transcribe

One-click video transcription

Transcriptions can easily be added in iconik using AI to convert voice to text. The text is timestamped, attributed to speakers, and then becomes part of the metadata for that video. This means you can locate a video even if you can only recall part of a spoken phrase.

Speakers are not always clear and AI can occasionally struggle to identify words. Editing text or speaker attributions is a simple and intuitive process.

Once your transcriptions are ready, they can appear as closed captions in the iconik player or be downloaded at text or WebVTT.

We use cookies to enhance your experience. By continuing to visit this site you agree to our use of cookies. More info on our cookie usage.