Roadmap
What we're working on
Planned
Add ability to preview designs.
Users should be able to preview designs before generating a video
Add Image slideshow to videos
Include the ability to include a series of images timed with audio in generated videos.
Allowing CNAME and white labelling of the video landing page.
This means that people could easily display videos on their own websites.
More granular security access for team members
Will need to be scoped out, but essentially allowing or blocking access to individual audio files, designs and videos etc
Lip-Synced Animated Photos for Speakers
Animated Avatars Seamlessly integrate with the current voice separation technology to ensure that each speaker has a unique animated representation, enhancing clarity and engagement.
Advanced transcript formatting options
Ability to break lines for different speakers and to be able to add spacing. The ability to add formatting - bold, italics etc, and the ability to add free form text for sound effects or action notes
In progress
Allow videos to be created and exported via an API
Will require a way of importing audio and exporting video, potentially somewhere like dropbox. Potentially add Zapier integration.
Add more visualisers and support multicoloured visualisers and gradients!
Add more visualisers and progress bars! Possibility of enabling gradients and multicoloured items. Add the ability to "tune" the behaviour of visualisers slightly ie speed and movement. This will be an ongoing task with many incremental improvements, so it will probably stay under "In progress" for some time
Complete
Add pauses in between text to speech paragraphs
Add ability to add pauses of variable lengths in between paragraphs for text to speech so it doesn't sound so jarring.
Voice cloning
Make it possible for users to clone their own voices to be used with our text to speech functionality. Also, make it possible for users to be able to provide their own Elevenlabs keys.
Advanced analytics tracking on video view page
Track video views for video views on the public view page and in embeds and display this in user's video page.
Bulk audio / video file uploads
Enable uploading of multiple audio / video files and have them process in the background.
Remove filler words and pauses
Remove all annoying filler words - ums and ahs etc
Enable teams or organisations so users can share assets/designs and videos.
Some people work with virtual assistants or in large organisations, so this would be useful for them.
Add a simple timeline editor and the ability to create intro and outro cards
We might want to have transitions between the elements of the timeline. Please note that this is a fairly large task and might take some time to implement, but is something We are definitely keen to move towards.
Allow users to upload their own transcription files
These files will need to be in vtt or srt format rather than plain text so we can extract timecodes.
Make background videos available in more colours
Update the UI so it supports selecting a type of video, then drilling down into the colour scheme.
Enable AI generated background videos.
Generate background videos using AI - potentially generate the prompts for these based on the user's audio content and synchronise the video with audio.
Request a feature
Have an idea for a feature you'd like to see in SoundMadeSeen? Let us know!