AutoSync Guide

The basic structure of the AutoSync3 system and its standard modules.

This guide assumes you already have AutoSync setup correctly. Please follow the Installation & Setup guide first if you haven't already.

Overview

In version 1.5 of LipSync Pro, AutoSync was expanded into a flexible, modular system, that can be used to automate any part of the clip creation process. The most common use, however, is still in detecting phonemes in an audio clip to generate phoneme markers, and there are two methods included to do that:

PocketSphinx

This is the same system used in the previous version, AutoSync2. It requires only an AudioClip containing dialogue, but provides results that range wildly in quality, and is only compatible with Microsoft Windows. This module is included with the LipSync Pro download by default.

Montreal Forced Aligner (MFA)

This is a newly-added system which makes use of a text transcript in addition to your AudioClip to provide much higher-quality results than PocketSphinx, and is compatible with both Windows and macOS. The transcript is required, however. Due to its large file size, this module is downloaded separately. If you have not already, please follow the Installation & Setup guide to get this module set up.

Basic Procedure

The easiest way of using AutoSync is simply using the included presets from the AutoSync Menu. With the MFA module installed, a preset called "Default" will be added to your project. To use it, open the Clip Editor from the Window > Rogo Digital > LipSync Pro menu, and select an AudioClip to use.

If you have a transcript pre-made for your dialogue (e.g. from an existing script), you can place it in a .txt file in the same folder and with the same name as your audio clip, and the Clip Editor will load it it in automatically. If not, you will need to type one out. In the Clip Editor's top menu, go to Edit > Clip Settings, type your transcript in the text box, and click the "Save" button.

Now, go to the AutoSync menu in the top menu, and click on AutoSync > Presets > Default. A progress bar will appear, and after a few seconds phoneme markers should appear along the timeline.

Modules

pageAutoSync Modules

Modules are the core components of AutoSync3. Each module is self-contained system that does some work on an input LipSyncData clip, either adding, removing or modifying data, then gives a clip back with the new changes in. The AutoSync system can queue up a number of modules in sequence, each one working on the results of the previous one.

Presets

pageAutoSync Presets

Presets are lists of modules and settings for each, that can be run to complete a certain task. For example, a preset is included that replicates the "High Quality" option from older versions of LipSync Pro, by running the PocketSphinx module with specific settings, then the "Marker Cleanup" module on the phonemes generated by PocketSphinx.

AutoSync Window

pageAutoSync Window

The AutoSync Window serves two purposes: providing advanced control over the AutoSync process (along with allowing the creation of presets), and enabling the batch processing feature, where a list of AudioClips can be provided and have AutoSync run on them one after another, automatically creating and saving a LipSyncData (or .xml) file for each.

Last updated