Skip to content

Latest commit

 

History

History
545 lines (378 loc) · 24.6 KB

CHANGELOG.md

File metadata and controls

545 lines (378 loc) · 24.6 KB

Changelog

All notable changes to this project will be documented in this file, starting with version 0.17.7.

[0.24.0] - 2024-02-22

Project Management: create projects, then link transcriptions, stories and text files to access them easier or perform project-wide searches faster.

Please back up your StoryToolkitAI configuration folder before updating to this version! The project management feature will restructure the project.json files irreversibly.

Added

  • Project Management - create and manage projects with multiple transcripts and stories
  • Project Files List - view all the transcriptions, stories and other files linked to a project
  • Ability to link transcriptions, stories and other files to Projects
  • Ability to open transcriptions, stories and other files from the Project Files List
  • Ability to find transcriptions, stories and other files by typing directly in the Project Files List
  • Ability to search through all project files and synchronized NLE timeline markers using Search -> Advanced Search in current project...
  • Ability to search transcriptions from the Project Files List context menu (right click on transcription)
  • Ability to send full transcriptions to the Assistant from the Project Files List context menu (right click on transcription)
  • Ability to copy and paste from transcription to story and from story to another story
  • Ability to change transcription name and timecode data using Transcription Settings window (Edit -> Transcription Settings...)
  • Metadata tab in Ingest window that allows setting timecode, frame rate, timeline/sequence name and others during ingest
  • Improved navigation to project files from the Project Files List context menu (right click on file)

Changes

  • Ability to save projects independent of their NLE (e.g. Resolve) counterparts
  • Removed Resolve-related buttons from main window (still available in Main Menu -> Integrations)
  • Transcription SRT and TXT are no longer saved automatically when saving transcriptions
  • Segments in transcriptions need to be selected to see the relevant right-click context menu for them
  • Removed key 't' shortcut from main window to avoid conflicts with the find function

Fixes

  • Fixed a bug that was preventing the tool from opening transcriptions that were initially not accessible
  • Added contrast to text window cursors to improve readability for light themes on macOS
  • Optimized Resolve updates using observer patterns
  • Fixed a bug that was opening the same text file in multiple windows
  • Fixed a bug that was pointing to the wrong result in text files when multiple similar results were found in Search
  • Fixed a bug that was triggering a shot removal error during video indexing
  • Updates check no longer locks the app if no internet connection is available
  • Fixed a bug that was causing the playhead to endlessly move backwards when syncing transcription windows with Resolve
  • Transcriptions and Stories are now reloaded from disk when opened in a new window, instead of reusing them from memory
  • Improved file list validation in Ingest window
  • Improved Resolve render and ingest
  • Other bug fixes and optimizations

[0.23.2] - 2024-01-26

Changes

  • Speakers are sent to Story Editor as uppercase plain text to respect Fountain format
  • Speakers segments are excluded from semantic search to avoid biases - will make use of them differently in the future
  • Improved Speaker Detection queue info

Fixes

  • Fixed XML export error when using Join Gaps feature with non-consecutive story lines
  • Fixed buggy behaviour when writing on non-text lines in Story Editor
  • Fixed Speaker Detection on segments with out of bounds approximated end times

[0.23.1] - 2024-01-24

Added

  • Added support for standalone version for Windows machines without CUDA-enabled graphics cards

Changes

  • Simplified installation instructions, see INSTALLATION.md
  • Improved requirements and FFmpeg check
  • Better handling of FFmpeg

Fixes

  • Fixed a bug that prevented using the BACKSPACE and DEL key in the Story Editor on Windows
  • Fixed a bug that didn't allow the user to select gpt-3.5-turbo in the Assistant
  • Fixed some auto-update issues
  • Other bug fixes and optimizations

[0.23.0] - 2024-01-16

Speaker Detection: detect speaker changes in audio either while ingesting or directly from the transcription window.

Meta Transcription Segments: add speakers, notes, etc. to your transcripts using SHIFT+ENTER while editing a transcript.

Added

  • Automatic Speaker Change Detection via Ingest or from the Transcription Window
  • Ability to add transcript meta segments, such as speaker, notes, etc. - use SHIFT+ENTER while editing a transcript to add a meta segment
  • Ability to choose whether to keep or not whisper debug info in Transcriptions
  • Ability to delete multiple transcript lines at once
  • Find and Replace for Transcriptions via Find window

Changes

  • Any line or group selections in the transcript are now cleared when editing transcript
  • Assistant also receives speakers when user requests transcription or story responses [t] or [st]
  • Improved transcript text editing
  • Removed Re-transcribe and Group Questions buttons from the Transcription Window (still available in main menu and on right-click)

Fixes

  • Fixed incorrect timecode calculation on EDL and FCP7XML export of Stories
  • Fixed a bug that prevented AVID DS exports dealing with 00:00:00:00 timecodes
  • Fixed incomplete Fusion Comp exports
  • Transcript segment selections are now cleared before editing the transcript
  • Fixed ingest time interval validation bug introduced in previous version
  • Fixed a bug that prevented the saving of the transcription_max_per_line_unit in Preferences
  • Other bug fixes and optimizations

[0.22.0] - 2024-01-06

Major improvements to the Assistant, including the ability to provide new Transcriptions or Stories as responses.

This means that you can now use the Assistant to translate, summarize, group, or even create selects via stories based on your transcriptions. More info here.

Added

  • Ability to request transcription responses from the Assistant using the [t] keyword
  • Ability to request story responses from the Assistant using the [st] keyword
  • Ability to create new transcriptions and stories based on Assistant responses
  • Ability to create new transcript groups based on Assistant responses
  • Ability to change the models used for the Assistant via the Preferences window or Assistant Preferences window
  • Ability to change advanced settings for the Assistant via the Preferences window or Assistant Preferences window
  • Ability to edit the Assistant chat history by enabling or disabling messages via right click context menu
  • Ability to reuse Assistant prompts via right click context menu
  • Ability to copy the Assistant prompts and conversations to clipboard via right click context menu
  • Support for storytoolkit.ai Assistant Models

Changed

  • Improved calculation of tokens and costs for the Assistant
  • Added support for new OpenAI Whisper patch
  • Added support for OpenAI version 1.6.0
  • Preferences: API Token becomes API Key to prevent confusion with the notion of tokens

Fixes

  • Fixed incorrect reporting of inaccessible media on Story Export
  • Other bug fixes and optimizations
  • Assistant queries no longer lock the UI

[0.20.0] - 2023-12-19

The Story Editor: a new feature that allows you to select text from transcripts and search results within a screenplay-like interface, while retaining the original timecode data and source for reference.

The Story Editor also supports exporting your selections as timelines in EDL or XML format, which can be used for editing in software like Resolve, Premiere Pro, and Avid Media Composer, or as Fountain screenplays for further editing in screenplay applications or for printing.

More details here.

Added

  • Story Editor - write screenplays and add text from your transcripts and search results
  • Export Story Editor content as EDL, XML or Fountain for editing in Resolve, Premiere, Final Cut Pro, Avid, etc.

Changed

  • Removed legacy app.py - the tool can now be started exclusively using storytoolkitai package
  • Automated Git Update (for git installations only)

Fixes

  • Fixed a bug that was messing results in the Advanced Search window when one of the results was at frame 0

[0.19.6] - 2023-11-29

Changed

  • Added support for OpenAI Whisper Large_v3 model

Fixes

  • Fixed a bug that was preventing the tool from indexing single-shot videos

[0.19.5] - 2023-08-15

Added

  • Ability to copy text from Assistant, Search and other text windows via right-click, top menu or Ctrl+C
  • Right-click on Transcript segment now shows the start and end timecodes of the segment

Fixes

  • Fixed a bug that was not allowing the entry of the API Token for new installations

[0.19.4] - 2023-07-08

Early update for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI

Changed

  • Faster transcriptions due to new OpenAI Whisper patch
  • Better signaling of invalid files in the Ingest window

Fixes

  • Fixed a bug that caused incomplete video search results, when the found frame was at frame 0
  • Fixed a bug that prevented the tool from rendering Timelines in Resolve on Windows (standalone version)
  • Disallow deleting of text beyond prompt in advanced search window
  • Better handling of text and video results in the Advanced Search window
  • Other optimizations and bug fixes

[0.19.3] - 2023-07-04

Early update for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI

Added

  • Transcript context menu - right click on segments to access features

Changed

  • On Windows: menus are now available on each window
  • When Re-transcribing, user must manually enable Video Indexing otherwise it will be skipped
  • Better FFmpeg check on startup - needed for processing video files

Fixes

  • Fixed a bug that was preventing the tool from rendering Timelines in Resolve
  • Fixed a few bugs related to the menu bar
  • Fixed a bug that was preventing grouping of questions on Windows

[0.19.2] - 2023-06-27

An early version of the Advanced Video Search feature! Search for specific content in your video files just by typing plain text. Early update for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI

Added

  • Ingesting of video content which includes AI based scene detection and video content indexing
  • Advanced Video Search - find anything you can describe with words in your footage

Changed

  • Transcriptions are now done via the Ingest window

Fixes

  • Fixed a few bugs in the Queue system that were either preventing the queue from moving forward or causing crashes
  • Fixed a few bugs related with the transcription process which were introduced with the new UI
  • Fixed scaling issues on Windows

Known Issues

  • The new UI is causing some issues on the menu on Windows - to be fixed asap

[0.19.1] - 2023-06-12

Added

  • Handling of large Search indexes via Queue

Changed

  • Improved handling of Transcriptions, with less read/write operations on the disk
  • Finished Transcriptions no longer open automatically, but by click on queue item

Fixes

  • Solved a bug that prevented including phrases that ended with spaces in the Advanced Search index
  • Improved handling of Timecodes

[0.19.0] - 2023-06-05

A new UI and a new Queue system that allows us to process both audio and video ingesting jobs. Early update available for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI

Changed

  • New User Interface
  • A better Queue system which allows queuing and managing of all CPU/GPU intensive tasks
  • Easier handling of multiple files when using the Transcription Settings window
  • Recursive folder ingesting in Transcription Settings window
  • Improved handling of Transcript Groups
  • Significant code refactoring and cleanup

Fixes

  • When using the Transcribe Timeline feature, Resolve no longer blocks the tool's UI

[0.18.5] - 2023-05-22

Early update available for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI

Added

  • Improved Advanced Search Interface (more updates coming soon)
  • Advanced Search automatic file cache now avoids the initial speed penalty when searching transcriptions that were already encoded for search
  • Text Analyzer now performs an initial clustering of text for better search results (experimental)

Changed

  • Group Questions feature now creates additional Question groups if others already exist

Fixes

  • Fixed a bug that returned an incomplete transcript group list in the UI after Group Questions was used
  • Fixed a bug that ignored the --skip-python-check argument when starting the tool

[0.18.4] - 2023-05-16

Added

  • Advanced Search on Transcript Group Notes
  • Go To Timecode function in Transcription Window via menu button or Equal Key

Fixes

  • Fixed a bug that prevented exporting to Fusion Text if the first segment was at frame 0

[0.18.3] - 2023-05-02

Info

Early update available for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI

Added

  • Group Questions - use AI to automatically detect and group questions from transcripts
  • Ability to export transcript to Blackmagic Fusion Text+ via menu button
  • Time Intervals in the Transcription Settings Window can now be set using timecodes too
  • Show transcription in File Explorer menu button

Fixes

  • Fixed a bug that was preventing CUDA machines from using the CPU for transcriptions

[0.18.2] - 2023-04-28

Info

Early update available for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI Please back up your StoryToolkitAI configuration folder before updating to this version if updating from 0.17

Added

  • Ability to use timecodes for copying transcript segments even without Resolve API connection
  • Export to Avid DS format via menu button
  • Transcriptions of audio rendered from Resolve are now auto-linked to their respective timelines

Fixes

  • The Find menu button now works correctly for all windows
  • Disable Resolve API menu button now disables all Resolve-related menu buttons
  • API Token is now validated correctly

[0.18.1] - 2023-04-19

Info

This update improves the usability of the Transcription Window and adds more menu buttons for easier access to features.

Early update available for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI Please back up your StoryToolkitAI configuration folder before updating to this version if updating from 0.17

Added

  • Ability to export transcripts as SRT and Text files via menu buttons
  • Most transcription related features are now available via menu buttons
  • Connect and Disable Resolve API menu buttons
  • Open Last Used Folder button in File menu
  • Custom Punctuation Marks via Preferences Window to be used for splitting segments on punctuation
  • Warning when trying to add markers to non-linked timeline in Resolve

Changed

  • Resolve API connection is disabled by default on new tool installations - see Connecting to the Resolve API
  • Additional buttons on the Transcription Window for easier access to features

Bugfixes

  • Fixed a bug that prevented the tool from using some Resolve API features introduced in version 0.18.0
  • Fixed a bug that didn't allow aligning and splitting segments according to Resolve playhead position

[0.18.0] - 2023-04-11

Info

Early update available for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI Please back up your StoryToolkitAI configuration folder before updating to this version.

Added

  • Ability to cancel transcriptions from the queue
  • Auto Add button in Transcription Groups allows adding of segments to group when they're selected
  • Select All button to Find feature in Transcription Window allows selecting all segments that match the search results
  • Ability to Open Transcriptions and Transcribe via menu click
  • Added Search and Assistant menus

Changed

  • Major restructuring of code and files, please back up StoryToolkit configuration folder before updating!
  • New start command - see here
  • CMD/CTRL+A now also deselects segments in Transcription Window

Bugfixes

  • Resolve-related buttons now refresh correctly in the Transcription window on resolve connection status change
  • Re-transcribing a re-transcribed transcription in the same session now works correctly
  • Closing of transcription windows is now handled correctly
  • Fixed a bug that prevented adding selected segments to groups with non-lowercase names

[0.17.19] - 2023-04-07

Info

Early update available for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI

Added

  • Split On Punctuation setting in the Transcription Settings window - more info in Transcription Settings
  • Prevent Gaps Shorter Than setting in the Transcription Settings window - more info in Transcription Settings
  • CMD+SHIFT+G in transcription window adds the selected segments to the currently selected group without clearing the group
  • Assistant [calc] function to get the minimum number of tokens you're sending with each message to OpenAI

Bugfixes

  • Fixed API token check default value
  • Fixed project settings file check
  • Fixed m shortcut bug in transcription window

[0.17.18] - 2023-03-30

Info

This update allows you to use ChatGPT for whatever section of your transcripts you want and leverage state-of-the-art AI in your process. More ChatGPT-related optimizations will be added in the future! Early update available for Patreon Frequent Users and Producers only - more info on patreon.com/StoryToolkitAI

Added

  • Direct interface to ChatGPT via Assistant window
  • Key O sends selected transcript segments to Assistant window as context for conversation
  • SHIFT+O includes times when sending transcript segments to Assistant window
  • OpenAI Key entry in the Preferences window - needed to use ChatGPT

Bugfixes

  • Fixed a bug that prevented the tool from starting on Windows machines
  • Fixed a formatting bug for text windows

[0.17.17] - 2023-03-28

Added

Bugfixes

  • Fixed python_check and project path checks

[0.17.16] - 2023-03-25

Added

  • Automatically disable Resolve API connection if running on a Windows machine and multiple Python versions are installed
  • Tool accepts --skip-python-check argument to skip the above Python version check
  • App Icons for Windows and macOS

Changed

  • Improved the way the tool checks for updates internally

Bugfixes

  • Small interface improvements
  • Advanced Search - search_corpus_min_length can now be set via config.json to prevent the tool from not searching single-character languages properly.

[0.17.15] - 2023-03-20

Added

  • "Increased Time Precision" gives better transcription timestamps due to the new openai-whisper module
  • Tool now warns user on exit if there still are transcriptions in progress
  • User can choose to not be notified about the current update
  • User Token check - see StoryToolkit on Patreon for more info

Bugfixes

  • Fixed a bug that caused the transcription process to fail for files with an associated render.json file

[0.17.14] - 2023-03-04

Added

  • Transcription progress now shown in the Transcription Log window
  • Status change when tool is pre-detecting speech in the Transcription Log window
  • mots_whisper.py (experimental - more info below)
  • Auto-install of required packages if requirements are not met

Changed

  • Update notifications based on the user's machine and architecture for the standalone version

Bugfixes

  • Fixed a bug that caused the tool to ignore pre_detect_speech when reloading transcription queues at startup
  • Fixed a bug that ignored transcribing audio without pre_detect_speech enabled

Additional Info

  • Starting with this update we're using a custom version of the openai-whisper module which will allow us more precise transcriptions and process visibility.
  • Word-level timings and other transcription-related updates coming soon!

[0.17.13] - 2023-02-22

Added

  • Option to Pre-Detect Speech in the Transcription Settings window aiming to reduce the transcription time and AI hallucinations

Additional Info

  • Please use pip install -r requirements.txt to add the new required packages

[0.17.12] - 2023-02-21

Added

  • The Advanced Search now supports changing the model using the command [model:<model_name>] (prompt [help] in Advanced Search for more details)
  • The Advanced Search window now has a prompt history that can be navigated using the up and down arrow keys (only available for the current session)
  • Preferences -> Skip Transcription Settings allows the user to make the Tool skip the Transcription Settings window on new transcriptions, by using the saved Transcription settings
  • Tool Update notifications now include the CHANGE LOG for the new updates

Fixed

  • Transcription Log Window lets the user know when the log is empty instead of rendering and empty window
  • Updated requirements.txt to use future==0.18.3 due to a security vulnerability in the previous future version

Bugfixes

  • Fixed crashes due to wrong encoding when reading certain files for Advanced Search
  • Fixed re-opening of Preferences window causing a crash

Additional Info

  • Please use pip install -r requirements.txt to add the new required packages

[0.17.11] - 2023-02-20

Added

  • CMD/CTRL+M in the Transcription Window now select all segments under markers filtered by color or name from the current Resolve timeline
  • "Starts With" can be used to filter the markers to be rendered when using the "Render Markers to Stills" or "Render Markers to Clips" buttons
  • When adding timeline markers from Transcription Window both the marker name and the color can be entered
  • ESC key now closes the Transcription Settings window
  • SHIFT+A if text is selected in the Transcription Window will now select all segments under that text

Fixed

  • Tool knows which window is currently focused - to be used for future features

[0.17.10] - 2023-02-19

Added

  • Ability to semantically search markers from project.json files using the "Advanced Search" button in the main window

Changed

  • Advanced Search is now performed in a console-like window to improve functionality and to preserve search history
  • Advanced Search window now has a Find function (Ctrl/CMD+F) to search for text in the window

[0.17.9] - 2023-02-18

Added

  • Advanced search results of text files are now opened and highlighted in a new window
  • Resolve timeline markers are now saved in the project.json file to allow future project-wide marker searches
  • Groups button in transcription window opens up the groups window for that transcription (same as SHIFT+G)

Changed

  • Find function in transcripts is now moved to its own window (use Ctrl/CMD+F to open it)

[0.17.8] - 2023-02-17

Added

  • Ability to search semantically in any .txt file on your machine using the "Advanced Search" button in the main window
  • Advanced search caching for significantly faster searches
  • Using --noresolve argument on main script disables Resolve API polling for that run

[0.17.7] - 2023-02-08

Changed

  • Switched to new openai-whisper module which supports the large-v2 model

Bugfixes

  • Fixed a few Resolve API communication issues on timeline and project change