Introduction
Gen-3 Alpha is the first of upcoming models that offer improvements in fidelity, consistency, motion, and speed over previous generations of models.
Act-One allows you to bring a character image to life by uploading a driving performance to precisely influence expressions, mouth movements, and more.
In this article, driving performance refers to the video that will influence an image. Character image refers to the image that will be animated by the driving performance.
This article outlines how to use Act-One on Gen-3 Alpha, input best practices, the available settings, and more.
Spec Information
Spec | Gen-3 Alpha |
Cost | 10 credits per second, 50 credit minimum |
Maximum output duration | 30 seconds |
Explore Mode on Unlimited Plans | Yes |
Platform availability | Web |
Required base prompt inputs | Video Image |
Output resolutions | 1280x768 |
Frame Rate (FPS) | 24fps |
Best Practices for Act-One Input
Before diving in, review these best practices to ensure that your input selections will set your generation up for success. Most output issues can be addressed by using inputs that follow these recommendations.
Driving Performance
- Well-lit with defined facial features
- Single face framed from around the shoulders and up
- Forward-facing in the direction of the camera
- Face is in frame for the entire video
- Ensure the face doesn't move in and out of the frame
- Clear mouth movement and expressions
- Certain expressions, such as sticking out a tongue, are not supported
- Minimal body movement
- No face occlusions in frame
- No cuts that interrupt the shot
- Follows our Trust & Safety standards
Character Images
- Well-lit with defined facial features
- A single face framed from around the shoulders and up
- Forward-facing in the direction of the camera
- Follows our Trust & Safety standards
Step 1 – Uploading the Driving Performance
Begin by navigating to Generative Video in your Dashboard.
From here, make sure the Gen-3 Alpha model is selected from the top left corner dropdown. You’ll find the Act-One icon in the left hand toolbar:
In the top half of the Act-One window, drag and drop a new video or select an existing video from your Assets to add your driving performance.
Make sure that the driving performance follows our recommended best practices for the best results.
Your driving performance should always be forward-facing, even if the character image you plan to upload is in a different angle.
Preliminary face-detection will run on your driving performance before you’re allowed to generate.
Below are examples of driving performances and their outputs:
Driving performance | Output |
Once your driving performance is uploaded, you’re ready to choose your character image.
Step 2 – Selecting the Character Image
Select the character reference image in the bottom half of the Act-One window.
Choose from an existing preset image, or switch to the Custom tab to upload your own.
Act-One can support a wide variety of input images, but images that closely follow our best practices will provide more consistent results when compared to more experimental images.
Below is a chart that outlines our recommendations in more detail. Variations annotated with a ✅ should work well in most cases, ⚠️ may sometimes work or provide unexpected results, and ❌ will likely not provide ideal results in most cases.
This chart isn’t meant to deter experimentation, but rather act as a resource for those who need each generation to be satisfactory. Don’t be afraid to travel outside of these recommendations if you’re looking to push the limits of Act-One.
Category | Variation | Example | Support |
Character type | Human | ✅ | |
Non-human | ❌ | ||
Character angle | Forward-facing/Front view | ✅ | |
Profile view | ❌ | ||
Character distance | Shoulders and up | ✅ | |
Torso and up | ✅ | ||
Full body | ⚠️ | ||
Character silhouette | Intermediate | ✅ | |
Complex | ⚠️ |
Step 3 – Generating the Act-One Video
You can hover over the duration modal to see the calculated credit cost before generating.
Click the Generate button after confirming that you’re content with the selected inputs and credit costs.
Your video will begin processing in your current session, where each video will be available for review once complete.
Understanding Act-One Pricing
Act-One charges 10 credits per second with a minimum of 5 seconds. This means that driving performance videos under 5s will result in a charge of 50 credits.
After the 5 second minimum, each additional second is charged 10 credits, with partial seconds accounted for and rounded up to the nearest decimal. In example, a 5.6s driving performance would be charged 56 credits.
Reiterating and Troubleshooting
Most issues or errors will be specific to your driving performance or character reference image inputs and can be resolved by ensuring that the inputs follow the recommended best practices.
Below is a list of Act-One errors and how to troubleshoot them:
Error | Troubleshooting |
Unable to detect a human face in your video. | Ensure driving performance is properly lit and the face is unobscured and centered in frame. |
Unable to detect a human face in your image. | Ensure character image follows best practices. |
An error occurred while detecting a human face in your video. Please try again later. | Ensure driving performance contains minimal body and background movement. |
We detected too much movement from your video. | Ensure driving performance contains minimal body and background movement. |
We detected unusable audio from your video. | Ensure the audio of your driving performance complies with our Trust & Safety standards. |
This content was flagged by our moderation policy. | Ensure the character image complies with our Trust & Safety standards. |
There may be cases where you don’t encounter an error before generation but receive an issue in your output. These edge cases can generally be resolved by following the best practices or re-running a generation:
Issue | Troubleshooting |
Face improperly detected | Use a character image that follows best practices. |
Intermittent artifacts | Re-run the generation. |