If you’re going to record a video of your screen + yourself talking (a “talking head” + screen capture), using Descript will let you handle a lot yourself—so you get something that’s clean, usable, and can be polished later. Here’s a step-by-step you can follow. Use this as your checklist; we (the editor) will take over after certain steps.
Why Descript is a natural evolution from Camtasia → Loom → Descript
Before the how-to, small philosophy:
Camtasia: Powerful screen capture + on-screen editing, but can feel heavy. More training needed.
Loom: Quick & easy, very low friction—great for sharing rough ideas, instant feedback—less powerful for trimming, polishing, adding effects.
Descript: Bridges the gap. You get the simplicity of capturing + basic trimming/mistakes removal, plus more advanced audio editing, text-driven editing, filler-word removal, overdubs, etc. It lets you do enough to stand up, so the editor doesn’t have to build everything from scratch. Good sweet spot.
Thus: you get more control, faster turnarounds; we (the editor) spend less time on the basics, more time on the polish, branding, visuals.
How to Record & Prepare (Your Part in Descript)
Here’s what you do so that handing off to me (or another editor) works nicely:
1. Plan your content & script.
Write a rough script of what you'll say and show. Mark when you’ll switch to screen actions and when you’ll use on-camera narration. If you don’t want to script every word, even simple bullet points will help you stay focused. This keeps your pacing tight and avoids rambling or long pauses.
2. Set up your environment.
Find a quiet room with no background noise. Use a decent mic. Make sure your background is clean and your lighting is good for your talking head shot. On your computer, turn off notifications, close unused tabs, and if possible, use a demo account or dummy data in the software you’re recording.
3. Configure Descript properly.
Inside Descript, choose the right resolution (1080p at 30fps is a safe standard). Select “Screen + Camera” capture mode, test your mic levels, and make sure your talking-head overlay isn’t covering important parts of the UI. Also zoom in on the software a bit so viewers can easily see what’s happening.
4. Record in passes if needed.
You can record everything live—screen, talking head, and narration all at once—or you can break it into steps. Some people prefer to record the narration first and then play it back while capturing the screen to match the timing. Both approaches work; it depends on your comfort level.
5. Do basic editing in Descript.
After recording, take a few minutes to clean things up. Use Descript’s filler word removal to get rid of “ums” and stutters. Trim long pauses. If you know you want certain graphics or text callouts later, add markers or notes inside the project. You can also tweak exposure on the camera feed or reduce background hiss in the audio.
6. Organize and export your assets.
Decide whether you want to share your full Descript project with me (recommended) or export separate files. If exporting, clearly label screen-only and camera-only versions. Gather your brand assets—logos, fonts, color codes—so I can integrate them smoothly.
7. Provide context and instructions.
Send me your script or bullet points, the Descript project link, and any style guides. Let me know what platform this is for (YouTube, social media, training, etc.) and what the deadline is. If you have sample videos you like, include them—it saves a lot of guesswork.
What I’ll Do (Editor’s Part)
Here’s what I take over after your prep, so we can both agree on responsibilities.
Polishing the UI animations / effects / transitions / highlighting clicks, cursor paths.
Add titles / lower thirds / callouts / graphics / annotations.
Fine-tune audio: equalization, leveling, background noise removal.
Color grading / matching camera feed to feel consistent.
Ensuring render settings are optimal for the intended platform (bitrate, resolution).
Final quality check: sync, etc.
Feedback round: I’ll send you a draft, you review, ask for tweaks, etc.
Workflow Preferences: Because One Size Doesn’t Fit All
Not everyone works the same way when creating content. Some people thrive on spontaneity, others need structure. Knowing your natural style helps you choose the best recording process.
On-the-fly communicators
These are people who like to hit “record” and just go. They capture the screen and their narration together, with minimal scripting. This approach feels authentic and energetic, and Descript’s filler-word removal and trimming tools can clean up most rough edges. The tradeoff is that timing between your narration and the screen actions can sometimes drift, which might require extra editing or even a few retakes.
Scripted or rehearsed creators
This style starts with a full script. You may even record your narration first, then capture the screen while listening to your voice to line everything up. It produces very clean, efficient videos because your pacing is planned from the start. The downside is that it can feel a little stiff if you over-rehearse, and it requires more upfront preparation.
Mixed or hybrid approach
Many people land in the middle. They jot down bullet points, rehearse a bit, and record in segments. This way you get a balance between natural delivery and structured timing. Descript works well here too—you can drop markers, trim on the fly, and keep the flexibility of improvisation while staying anchored by your plan.
Checklist You Can Send Me / Use Before We Start
Use this so both of us are aligned. You can check things off.
Script or bullet points ready, with cues for screen actions.
Mic & camera tested; lighting good for camera shot.
Screen is clean; subscriptions / notifications off; software in demo mode or with dummy data if needed.
Capture is set up in Descript: Screen + Camera mode; correct resolution / frame rate.
One test recording done, both screen & camera, with your typical pace. Review it: is audio clear? Is camera well-framed? Any UI detail cut off?
Basic editing done by you: filler removals, rough trims, markers where you want edits.
Brand assets / style guides collected and sent.
Priority list: what parts are most important (e.g. screen clarity, talking head polish, graphics) + deadline.
Why Doing This Prep Changes Everything
Speeds up editing & lowers cost: Less “fixing mistakes” equals less time.
Better quality: fewer surprises (audio issues, bad lighting, hidden UI) means less frustration.
More consistent output: Over time you’ll build up a style; doing things similarly each video helps build your brand.
More freedom: With Descript you have control to tweak small things yourself between edits (so you feel ownership), while letting the pro handle the heavy lifts.