The embodiment of the present application discloses a video processing method, device, equipment, and storage medium. The video processing method involves technologies such as artificial intelligence, cloud computing, computer vision, and machine learning. The video processing method includes: acquiring the target video to be processed ; Extract a frame sequence from the target video, the frame sequence includes N tested video frames, and N is an integer greater than 1; call the watermark detection model to perform watermark detection on the frame sequence, and obtain the watermark indication of each tested video frame; from N M watermark indications are selected from the watermark indications for time-domain joint discrimination processing to obtain watermark data of the target video, where M is an integer greater than 1 and M≤N. By adopting the embodiment of the present application, the time-domain multi-frame joint detection can be performed on the frame sequence of the target video, effectively reducing calculation redundancy, improving the efficiency of video watermark detection, and improving the accuracy of video watermark detection results.