When your process_frame() is called, you can (but don't have to) call getNumInputs() to see how many inputs are upstream of you, getInputInfo() to query the track/item of a given input, and renderInputVideoFrame() to render a given input's frame. Note that by "input" I mean "video media file" or "video processor".