You are here

This ATEM (mini) Tips” series of short video tutorials for your Blackmagic ATEM hardware includes tips for all users, from beginner to advanced. Scroll down to see more in the series!

This is a collection of all PhotoJoseph LIVE shows, from the PhotoJoseph YouTube channel. Additional episodes are listed below!

How to Bring LIVE Callers on Your LIVE Show!

Photo Moment - September 29, 2021

If you've ever wanted to bring a live caller, from Zoom for example, into your LIVE show, using your ATEM Mini , then this ATEM (mini) Tip will explain exactly how to do it!


(see video for navigation)

  1. What hardware you'll need
  2. Configuring the hardware
  3. Configuring Zoom
  4. Creating SuperSource layouts for Zoom caller
  5. Configuring audio monitoring (super important!)
  6. Monitoring audio with na ATEM Mini Pro (not Extreme)
  7. Sending clean video (just you; not Program!) to Zoom
  8. No SuperSource? A better layout on ATEM Mini Pro (not Extreme)

Gear listed in this video

Need an ATEM Mini?

What this tip is about…

Using your “real” camera for a zoom call with an ATEM Mini is easy — just plug the camera into the ATEM, and the ATEM into your computer! But what if you want to bring that live caller… on a live show? That's what this tip is all about. Outlining every step, including how to wire it, how to setup the computer, how to set up zoom, where to position the windows, how to build a super source or PiP (Picture in Picture) to make it look good… and most importantly, how to monitor the audio!!

📄 Full Video Transcript


Past Shows

Leave a comment on YouTube

Comments from YouTube

Couldnt you just connect another webcam to the laptop instead of using the Flint 4DP solution? Feels like a much chesper and simpler solution.

Thank you for the great content!!!
If a webcam is good enough, yes. But this method allows your guest to see the real camera, looking in their eyes, etc.
Good to know that the Zoom's internal software deals with the zoom speaker's echo/feedback. I thought for sure you were going to have to split the audio out of HDMI feed to prevent that. That little gem really makes the setup much easier. Thanks so much.
it surprised me, to be honest. But I've tested in many scenarios and it works!
Great video ! Very helpful 😊
Any way to do this with Skype, which probably doesn't have as good a feedback rejection as Zoom?
Of course; same workflow. But as you pointed out, you may need headphones.
@PhotoJoseph Thanks, but will the Skype guest hear themselves delayed, even with headphones? I'm not sure if Skype is as good as Zoom.
@Sunrise Water Media in some ways Skype is better. Better image quality. Don’t send the guest program audio if they get a delay. Use the routing software and only send them your mic.
@PhotoJoseph Ok, that makes sense. Are you referring to audio routing software of the Atem? Is that an option?
10:17 Betty looks scary 😜
But what if there are also other audiences on Zoom as well. Here is the issue, when 1 of the guest speaker talks, Zoom will spotlight him on the other audiences screen.

And if I were to spotlight myself, i would have to lose the pins created on Aaron and the Manequin. Is there a way to work around this?
What your call-in guests sees is up to them, but it has no effect on your setup. If the pin or don’t pin, see a self view or not, is up to them and irrelevant to you. And what the audience sees is only what you send them, not zoom.

This is basically what I needed to broadcast.. Zoom Seminar and make layouts out of the Zoom's participants video...

BUT... I'm still confused what I should purchase. ATEM Mini Extreme or ATEM Mini Extreme ISO?? 🤔
The ISO means you have a recording of every input for re-editing later. Very convenient
Fantastic training video, thank you! Are you familiar with Streamyard? Would you happen to know if we swap out Zoom Conferencing for Streamyard if this setup and configuration would work much the same?
Oh and... I forgot to give a well earned nod to Betty and Aaron. 😊
It should work the same, yeah. The basics apply to any video conferencing app. The hugest challenge is if you do multiple callers and their positions on screen aren’t consistent.
Life is so much richer with you imparting such great knowledge. Amazing info.
Aww shucks ☺️
Your demos are always so awesome and clear, especially whenever you share your Mac desktop. How do you get that so sharp? I do a lot of Zoom calls and when I share my desktop via the ATEM, the text and icons are little blurry. I have to resort to sharing my screen from within Zoom. Any suggestions?
That’s because through zoom, sharing your screen through the ATEM means that it’s no different than a webcam, which zoom defaults to a very low resolution. You can actually now ask zoom (via email tech support) to grant you 720p but that’s still lower than the 1080p you’re expecting. In these videos I capture my screen using Screenflow, but that doesn’t help for a live situation.
Took me a sec to realize that wasn't a real person "Betty".
Are you for hire?
Buckle your seatbelt as I'm going to be hitting your channel hard over the next few weeks.

I've been "hired" to do a livestream for a local non-profit (raising money to help find a cure for childhood cancers). I did it last year too, but that was with a bunch of webcams and while "good"... Not what it could have been, so... I've bought a couple Canon cameras and looking to buy the Atem Extreme ISO.
Awesome and welcome! FYI if you join the channel as a member you will have access to a private Discord server which could be a good place to ask questions, and of course you’ll be helping support the channel! Details at — also if you really need help, you can book 1:1 sessions with me
Joseph, such a great video. I am learning my ATEM extreme ISO and use zoom with work and OBS. Hope you are reading comments - QUESTION - Do you have specific SuperSource overlay tutorials? I need to use super source with 2-3 people on zoom and me on the BMPCC 6K and ATEM. This would be if I did a panel conversation for an internal meeting/conference. Don't need to stream it out. This would keep OBS/eCamm out of things and really simplify my workflow for this use case.
Thanks for watching. I don’t have a specific supersource video and this one probably covers it the most. What specifically do you need help doing? FYI if you join the channel as a member you will have access to a private Discord server which could be a good place to ask questions, and of course you’ll be helping support the channel! Details at
Great content - as always 🙂Thank you very much
Once again thank you very much, your videos are really helpful
Thanks this was very helpful, I will need to fiddle around with it to try it. We used to have a lot of guests on our live stream over the lockdown when we were using OBS but I bought an Atem mini pro to upgrade the stream but couldn;t work out how to get them on. We did 500 consecutive daily live streams before dropping down to just 3 a week.
@PhotoJoseph OK thanks but how and why would I do this? I had nothing but problems with OBS, mainly the encoding freezing up all the time (despite many settings changes form various tutorials), it was so unreliable, that's why I switched to the hardware encoder in the Atem Mini Pro.
@The Jive Aces got it — well if you’re missing features like the call-in, you could use OBS just for that, feeding the program out of a computer into the ATEM. Let the ATEM handle encoding and graphics overlays. Just use OBS for the features you don’t have in the ATEM.
@PhotoJoseph I just tried this all out and the caller does actually hear themself.
@The Jive Aces this is getting way too detailed to cover here, mate. FYI if you join the channel as a member you will have access to a private Discord server which could be a good place to ask questions, and of course you’ll be helping support the channel! Details at
@PhotoJoseph OK understood, thanks for your time. I will check out the Discord.
WOW - I like that, however I'm a beginner and need some help...!
This is definitely not a beginner setup but all the info you need is here!
@PhotoJoseph Are you for hired?
@William Stewart
Excellent video and you’re the first To address this topic in any accessible detail IMHO. I’ve been looking for help on this since I picked up an Atem Mini.

My personal use case is to bring up to 3 guests in via Zoom and get the benefit of customised layouts rather than the default Zoom view. I also want to bring in pre recorded video and some desktop sharing. Not much of a wish list right 😉

I’m now in the early stages of working with Extreme ISO and Roadcaster Pro.

From what I understand using so called Zoom drones might be the way to go i.e. dedicated laptops with a pinned video for each caller coming in on one of the Atems inputs? The other challenge being audio…

Also for bringing the video back in would something like an Elgato Cam Link 4K work?

Your video has certainly got me further along on my understanding of what could be possible.

Thank you and look forward to more content on this if possible please.
You’re on the right track. Dedicated computers for each caller is the best way to handle it. They don’t gave to be special — just enough to run Zoom. Then manage each computer through screen sharing software like Jump. A cheap windows NUC, or apparently zoom runs on Linux, so maybe that’s an option too?
Great post. Very helpful. I notice you reply to questions more often than most. So, my question: how many Zoom participants can be accommodated by your ATEM setup?
I try! So, if you want to break them out exactly as I did here, then since you have four super sources, you could do four callers this way. But keep in mind that with four callers, each will be only ¼ of your screen, so you really won’t want to scale them up to go full screen or even much bigger than ¼ screen or they will look rubbish. Plus one real risk is that if a caller drops out for any reason, the windows will rearrange (which I think I mentioned in this video), and suddenly your super source layouts will break. Even if they reconnect, there’s no guarantee that they will be put back in the same positions. If you do you want to use zoom as a more professional tool for bringing in guest callers, there is a better way but it requires more hardware. In brief, you’d set up a separate computer for each caller and display each of them full screen (the screen being the ATEM input). Audio mixing still sucks because you can’t separate audio for each caller, but at least the video will be solid and full screen resolution.
Great video. Question: I have multiple hosts in the same room and we are bringing one guest through Zoom. I use a sound transmitter from the headphone jack on the atem to individual receivers that each of my host have. When there is not guest, it works great because they can hear each other speak however, when I bring the zoom guest in, I can only turn on the zoom guest sound on and they lose the option of hearing each other speak. Not sure if this clear but any solution?
You’ll likely need to use a hardware mixer. Route everyone’s audio into the hardware and route from there. Audio gets quite complicated in situations like that.
@PhotoJoseph I use a RodeCaster Pro but it is quite challenging because I have to bring the sound of the Zoom caller in the room for the audience to listen and I also have to mix it into the streaming for the live audience. Not sure if I can manage.
I can not confirm your statement about audio delay directly on camera or input of the ATEM. As i notices with the ATEM Minis and also TVS Models when audio comes directly over camera attached microphones you normally will get no delay because it comes over the same way gets same processing. When using HDMI on the ATEM Minis and feed Audio into the inputs, there is a bigger delay because the signal processing via HDMI needs some time and is mostly some frames delayed. This is why BMD brought an internal delay to audio setting of Input 1&2, so you can correct the delay until 8 frames to match the video.
I'm not sure I understand what you're saying… you are confirming what I've said in this video. If the audio comes WITH the video, then it's in sync, but delayed from realtime. If the audio comes in SEPARATELY from the video, then the audio will be realtime, but the video is delayed, and therefore they are not in sync.
@PhotoJoseph OK i got it - sorry i miss understood what you explaining with "delayed from realtime". Of course the whole signal over camera is delayed from realtime, but it's in sync. When Audio comes over separate inputs it is out of sync to the video - but you are right - it's not delayed to what's outside the ATEM going on. That's a bit confusing.
Do you have a good way to building macros
I have MANY video on macros. Search youtube for “PhotoJoseph ATEM macros”
We can monitor audio from ATEM MINI (pro) using a headphone output of multi view monitor via hdmi or using hdmi audio extractor
Correct. But you don’t get the “solo” option without the Extreme
Awesome video, Joseph! Thanks so much! – Didn‘t quite get, though, what you said around 16:45 regarding the splitting of the Zoom feed that shows two guests into two HDMI inputs on the ATEM. Could you elaborate a bit here? 😳
It's a single HDMI input with both guests side by side (the two guests pinned in zoom), then I'm cropping out guest 1 and guest 2 in two different supersource "sources" (both looking at the same input, but cropping different pieces of it), then arranging those in whatever layout you want.
@PhotoJoseph Ohhh! I see, nice! I‘m new to the ATEM Mini Extreme & Super Source (coming from an ATEM Mini Pro) and was under the impression that each of the max. 4 boxes has to have a different input source! 🤦‍♂️ 😂 Thanks so much for taking the time & answering!! Much appreciated! Really like your channel! 👍👍
Thanks Stefan, and glad I could help!
@PhotoJoseph I had the same question as @Stefan Wolfrum ... thanks for elaborating. Just to confirm, so in order to take Aaron or Betty full screen, they need to be enlarged significantly beyond 100%?
@tvpierce1 if you set it up as I did, then yes. You’re scaling up an image that’s basically ¼ of the screen. If you want better quality then you need a dedicated display for each guest.
Fantastic tutorial. Learned a ton.
awesome, thanks!
I have a ATEM MINI EXTREME ISO but for the capture card could I just use my ATEM MINI ISO or should I just use my Elgato HD60+. I do like that Flint D4P though.
Oh totally, you could loop that output through another ATEM!
@PhotoJoseph ok cool. I'll give try this weekend for sure. Thanks Bro
Great video!! Thanks 😊
Thank you!
Such a great demo. THANKS for sharing it. Your style is so much more enjoyable than other videos trying to show off the ATEM features.
Thank you Ant! I really appreciate that
Very detailed great content as always 😊 thank you very much for sharing, really appreciated.
my pleasure, Xin!
Thank you @PhotoJoseph! You've definitely got some gears turning in my head. I'm going to try some slicing up of Zoom participants and manipulating with SuperSource. QUESTION: Is the Flint D4P your "Capture Card" of choice for M1 Series Macs?
Not necessarily… it’s just one a company sent me to check out. I only broke it out for this video. Nice thing is that it’s 4K and it has dual inputs. You could use it as a 4K input for a live stream using OBS, if you wanted.
Would an Elgato cam link work?
@Neville Henderson it definitely should
Brilliant, well researched and expertly presented as always - thank you so much!
Thank you Gerry!
I use meeaudio’s M6 IEMs with a Stereo to mono cable in Clear. It loops over the ear, and I choose the ear that doesn’t face the camera as much. It’s very discreet, kinda like what TV broadcasters use, but cheaper.
A lot cheaper! Nice one.
PhotoJoseph - as always entertaining and informative - I just want to confirm (since I just set up a mix-minus configuration) - this setup eliminates any need for a mix-minus audio feed to the caller, correct ? Are there still instances in which the mix-minus will be useful or even required? Thank you for all the work you've done, it has certainly helped me.
Thanks Billy! I’ve done this setup with Zoom and Skype and never needed a real mix-minus, which is pretty amazing!
@PhotoJoseph Very kind of you to get back to me - are you aware of a way for musical artists to perform together w/o latency?
@Billy Mac Songs I’ve heard that services exist but never tried one. SRT is the lowest latency system I know of but I don’t know if it’s good enough for music.
Great Video. Which lens would you recommend for the g7?
Thanks Brandon! I was using the kit lens here; it’s a 12-23 or something like they. Tiny. But if you’re spending money, I really like the 12-35/2.8. That’s what I use in my main office camera setup.
Excellent probably isn't a fair description. Brilliant maybe is better.
Why thank you, Mr. Black 🙇🏼
This is a great explanation and another good idea for using an accidental “feature” of Zoom. You’ve given me a great idea for recoding a podcast this way. I have only a basic Atem Mini so I also use OBS and an HDMI splitter to get the second screen displayed and also into the Atem, so I can route it all back into the Mac; on the Mac I then have OBS to gather sources in scenes and use the OBS virtual camera to feed out to MS Teams or Zoom - or to record. I then have the flexibility to switch sources on the Atem (cameras, iPad, HDMI)… or in OBS where I can add video, other webcams, or make other changes with filters. I have also picked up a few old Nikon D7100s that output clean hdmi and can be set to use ext power and run all day - they are 4:3 not 16:9 though so need aspect shifting in OBS; but then, that’s already how I mix my video. I have no idea how it all hangs together, but it seems to work. And all through videos like yours and Aaron’s to give ideas of how you can use different tools together; thanks!
I love it… it's so cool to see what people have hacked together over the years! Awesome, and thanks for watching, Richard!
Update : it worked! Thanks for the tip. And I have found out how to get 16:9 from the Nikon too.
Sony a6100 is cheaper and has mic input.
New Sony e10 is also cheaper.
Also, I believe Rx100 (starting mark iV) have it, and is about 200-300 dollars.

Btw, besides mic input and clean hdmi, you also need no-sleep battery power. That mean using dummy battery most of the time.
Not familiar with those cameras; I'll check them out, thanks! As to no=sleep, is that even a thing… every camera I've used has the option to disable sleep
The a6100 isn't cheaper. G7 is going for $598 w/ kit lens. A6100 is $710 with no lens. The ZV-E10 is $798 w/ the lens.
Hi Joseph
I just ran a hybrid event with remote presenters coming in via zoom, and the event going back out to YouTube, and the audio kept dropping out on YouTube. We did have some bandwidth issues, but it didn’t seem to effect the video, only the audio. My zoom audio was going into my stream from the atem extreme iso, via a seperate mixing desk, and I was monitoring audio on the atem which was fine coming from zoom but not good once it reached YouTube. Any tips on improving this, much appreciated.
Great content BTW
Hm that's tough to troubleshoot… did you have the "use original audio" option enabled? Just to remove any processing by Zoom? And how was it getting to YouTube; via Zoom's own publish to YouTube option>
@PhotoJoseph hi, I dont think I did have the ‘original audio’ option set, and noticed that in-your tutorial, I hadn’t realised the benefit of disabling Zoom audio processing. The audio was getting to YouTube through a Rodecaster connected to the Mac mini which was running the zoom call and I was using an output from the rode to the extreme iso, then streaming directly to YouTube from the iso.
Got it. Lots of pieces. The “original audio” is what I’d try next!
Thanks for that great explanation!
Do you have some experience with blackhole for the audio-routing?
I don't! What does it do?
I used the Blackhole routeing on Mac until I bought a Scarlet 2i2. Blackhole allows you to route back audio and works well with amalgamated sources in the midi settings; except MS Teams didn’t like that much. The 2i2 gives a headphone out and audio for monitors and takes audio input from an external mic. That mic can be routed, or selected as the feed to Zoom, Teams, etc. Adding OBS and sending different scenes through its virtual video camera to Zoom is also a neat trick. OBS then also allows me to record.
Awesome as always. One suggestion! Loopback is fantastic, but you can also make a Aggregate output in a Mac with the AudioMIDI app. It's not as user friendly as loopback, but it will work a-okay if you need to save some pennies.
I totally forgot about that, Kyle. But it'll do the same thing? Create a virtual interface that I can pick up in another app?
After watching this video, I realise how novice I am in doing my live shows esp on audio during a mix of live real audience and live virtual audience.

Just can't get the audio right.
Audio is HARD, for sure! What are your common issues? I definitely need to do some videos on audio for the ATEM.
@PhotoJoseph Thankyou. I think my issue is on hybrid events. The way to go about is to have a mixer with an aux out and choose which audio to send.
PhotoJoseph I watch most all of the You Tube channels that have livestream content. I have to say you are the best at demonstrating how to videos. Most channels tell me how a piece of hardware works. You show how it works in detail. That sets you apart from the others. Glad to see you back at it again! Thanks
oh thank you so much Roger, I really really appreciate that! By all means, please help the channel grow… share the channel, and your enthusiasm! 🤗 Viewers like you make this all worthwhile.
As usual, you bring this into simplicity Thank you very much Joseph, I really enjoyed it
Thank you Ilan!
Using Zoom's noise canceling to do mix-minus audio - genius! Lots of good tips here Joseph, thanks.
I keep expecting it to fail… but it really really works! Of course what the ATEM needs is a mix-minus, but that's another story ;-) Thanks for watching, Gil!
Great detailed explanation! It's almost exactly like how we run Zoom calls during livestreams, so very nice to have confirmation on this method.
Oh sweet! What are you doing differently (since you said "almost exactly")? Thanks for watching, Jorg!
it’s too complicated, why not just using the laptop? Zoom is already self contained.
because the quality of Zoom to YouTube is complete rubbish, AND you have zero control over the layout, AND you don't have everything else you can do on the ATEM.
Have you tried bone conduction earphones? You can listen to your device while still being able to hear what's going on around you/
I never have, actually. They work well? Isn't that what Jabra was back in the day… I remember those being totally trash, but that was a good decade ago! Do you have a brand you like?
@PhotoJoseph I've only got bluetooth connected bone conducting earphones and have found Aftershokz to work well. A limitation is that they are the opposite of noise cancelling so great in low ambient sound. Near traffic when I'm running they are ok for music but not speech.
@PhotoJoseph Of course, your standard for acceptable earphones may be much higher than mine.
ahh right. Being bluetooth limits their usability for a broadcast setup, too. Thanks though!
Want more ATEM? How about 4+ hours of pure ATEM training?
Watch this: (also this is my LIVE channel; be sure to sub there, too!)
You may login with either your assigned username or your e-mail address.
Passwords are case-sensitive - Forgot your password?