Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

51 results about "Computational vision" patented technology

Method for the reduction of image content redundancy in large image libraries

A method of increasing information content for content-based image retrieval (CBIR) systems includes the steps of providing a CBIR database, the database having an index for a plurality of stored digital images using a plurality of feature vectors, the feature vectors corresponding to distinct descriptive characteristics of the images. A visual similarity parameter value is calculated based on a degree of visual similarity between features vectors of an incoming image being considered for entry into the database and feature vectors associated with a most similar of the stored images. Based on said visual similarity parameter value it is determined whether to store or how long to store the feature vectors associated with the incoming image in the database.
Owner:UT BATTELLE LLC

Mobile robot and simultaneous localization and map building method thereof

A simultaneous localization and map building method of a mobile robot including an omni-directional camera. The method includes acquiring an omni-directional image from the omni-directional camera, dividing the obtained omni-directional image into upper and lower images according to a preset reference to generate a first image, which is the lower image, and a second image, which is the upper image, extracting feature points from the first image and calculating visual odometry information calculating visual odometry information to track locations of the extracted feature points based on a location of the omni-directional camera, and performing localization and map building of the mobile robot using the calculated visual odometry information and the second image as an input of an extended Kalman filter.
Owner:SAMSUNG ELECTRONICS CO LTD

Computerized eye testing and exercises

A method, apparatus, and software for exercising human eyes with a monitor onto which is projected a plurality of shapes such that portions of the shapes have a contrast changing at a speed less than or equal to approximately 2.0 cycles / sec. The shapes comprise paired shapes of opposite colors (black / white, red / green, or blue / yellow, or combinations thereof), and the speed is preferably less than or equal to approximately 0.8 cycles / sec. Also a method, apparatus, and software projecting a plurality of symbols each comprising a plurality of bars one of which has a length different than that of others in the symbol. A visual efficiency is calculated based upon a number of identical symbols correctly located by a user and a time to locate the identical symbols.
Owner:COUTURE PAUL M

3D monocular visual tracking therapy system for the rehabilitation of human upper limbs

It is described a 3D monocular tracking system, being robust, having low cost, easy to install and use, useful for the upper limbs rehabilitation in a patient in need thereof, as well as a home self-directed therapy method for patients having upper limbs' movement disability. The system comprising a) a handle or gripper; b) a computational vision system comprising a video camera; c) software comprising a set of games; d) a processor; and, e) a display apparatus.
Owner:NAT INST OF ASTROPHYSICS OPTICS & ELECTRONICS

GPS-fused robot vision inertial navigation integrated positioning method

The invention discloses a GPS-fused robot vision inertial navigation integrated positioning method. The method comprises the following steps: extracting and matching feature points of left and right images and front and back images of a binocular camera, and calculating three-dimensional coordinates of the feature points and relative poses of image frames; selecting a key frame in an image stream,creating a sliding window, and adding the key frame into the sliding window; calculating a visual reprojection error, an IMU pre-integration residual error and a zero offset residual error and combining the errors into a joint pose estimation residual error; carrying out nonlinear optimization on the joint pose estimation residual error by using an L-M method to obtain an optimized visual inertial navigation (VIO) robot pose; if the GPS data exist at the current moment, performing adaptive robust Kalman filtering on the GPS position data and the VIO pose estimation data to obtain a final robot pose; and if no GPS data exist, replacing the final pose data with the VIO pose data. According to the method, the positioning precision of the robot is improved, the calculation consumption is reduced, and the demands of large-range and long-time inspection are satisfied.
Owner:NANJING UNIV OF SCI & TECH

Using image similarity to deduplicate video suggestions based on thumbnails

A system and computer program product are provided for improving the utility of video recommendations in a content system via de-duplication of highly similar thumbnail images. For each video added to an online content system, a thumbnail image is generated and stored. For each such thumbnail image a compressed representation is computed. During playback of a video, a set of related videos is generated. For each video in the set, the corresponding thumbnail image and its compressed representation are retrieved. A measure of visual distance is computed for each pair in the set of representations, and measures indicating excess similarity are identified. Similarity is reduced via selective removal of some of the representations. An identification of the thumbnail images and videos corresponding to the remaining representations is produced.
Owner:GOOGLE LLC

Road vehicle tracking method

The invention discloses a road vehicle tracking method, belonging to the technical field of computational vision processing. As that image between frame of the binocular camera is more than that of asingle purpose, the invention proposes a multi-angle optical flow characteristic with strong description ability to replace the original multi-surface optical flow characteristic, effectively solves the technical problems of wrong heel, drift and the like when vehicles with similar appearance and close distance are occluded from each other.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Method for quickly checking video

The invention relates to the field of multimedia information (pictures and videos), machine vision and computational vision, in particular to a method for quickly checking a video. The method for quickly checking the video is characterized in that a system automatically distributes parameters to finish lens boundary extraction or similarity filtering, then, extracted boundary frames are combined into a short video, and finally, a checking person or an automatic system checks the short video. The method comprises the following steps that: S1: inputting a checking type, and uniformly inputting the checking type of a video to be checked into a system preprocessing system; S2: extracting a boundary lens, and carrying out similar filtering; S3: carrying out video synthesis, utilizing an open source tool ffmpeg (for example: ffmpeg-fimage2-i%d.jpg-vcodec lix264-rNtest.mp4) command to generate a video of which the frame rate is N frame / s; and S4: checking and outputting the video, and outputting the video of which the frame rate is N frame / s so as to bring convenience for manual checking or system checking.
Owner:央视国际网络无锡有限公司

VSLAM method, controller and mobile device

The invention relates to a VSLAM method, a controller and a mobile device, and belongs to the field of mobile devices. According to the VSLAM method, when a visual relative pose is calculated, calculation is carried out according to two-dimensional coordinates of mutually matched feature points between a successfully matched image and a key frame. According to the invention, the success rate of calculating the visual relative pose can be improved, so that the accuracy and the operation speed are improved during positioning and mapping.
Owner:SUGAN TECH BEIJING

A 3D road vehicle tracking method

The invention discloses a 3D road vehicle tracking method. The invention belongs to the technical field of computational vision processing. A 3D spatial feature and 2D image feature of an object are combined with an MDP model. The evaluation function is reconstructed, and a multi-target tracking method based on 2D and 3D joint features is proposed. The original 2D image domain tracking is extendedto 3D spatial domain tracking, which effectively solves the technical problems such as mistracking and drift when vehicles with similar appearance and close distance occlude each other.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Image cartoon processing method and device, electronic equipment and storage medium

The invention discloses a cartoon processing method and device for an image, electronic equipment and a storage medium, belongs to the field of computers, and particularly relates to computing vision,image processing, face recognition and deep learning technologies in artificial intelligence. According to the implementation scheme, skin color recognition is carried out on a to-be-processed face image, and the target skin color of the face is determined; if a cartoon model set does not contain a cartoon model corresponding to the target skin color, any cartoon model is used for processing theface image, and a reference cartoon image corresponding to the face image is obtained; a pixel adjustment parameter is determined according to the target skin color and the reference skin color corresponding to any cartoon model, and the pixel value of each pixel point in the reference cartoon image is adjusted based on the pixel adjustment parameter to obtain a target cartoon image. Therefore, when the cartoon model set does not contain the cartoon model corresponding to the target skin color, the cartoon processing of various skin color objects can be realized by utilizing the existing cartoon model, and the application range is expanded.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Method for solving polymorphic statement video positioning task by using space-time graph reasoning network

The invention discloses a method for solving a polymorphic statement video positioning task through a space-time graph reasoning network, and belongs to the field of natural language visual positioning. According to the method, firstly, a video is parsed into a space-time region graph, and the space-time region graph not only has implicit and explicit space sub-graphs of each frame, but also has across-frame time dynamic sub-graph; next, a text clue is added into the space-time region graph, and multi-step cross-modal graph reasoning is established; the multi-step process may support multi-order relational modeling; and thereafter, a temporal boundary of the pipeline is determined using a temporal locator, then the object is located in each frame using a spatial locator having a dynamic selection method, and a smooth pipeline is generated. According to the method, the video does not need to be trimmed when the natural language is positioned, so that the video positioning cost is reduced; and question sentences and declaration sentences can be effectively processed, technical support is provided for higher-level natural language processing and computational vision combined research(such as video questions and answers), and the application prospect is wide.
Owner:ZHEJIANG UNIV

Real-time data processing pipeline and pacing control systems and methods

A data processing system includes a transaction bus, a console application in communication with the transaction bus, and a view predictor subsystem in communication with the transaction bus. The transaction bus receives, from a user application executing on a client device, a call for visual information to be provided to the user application. The view predictor subsystem determines a likelihood that the visual information will be viewable within a viewport of the user application, and a plurality of respective values for a plurality of sources of the visual information are computed based on the likelihood and a respective priority for each source. The console application provides to the transaction bus the set of potential sources of the visual information, and the transaction bus selects, based on the computed values, one of the potential sources of the visual information to be the result, which is provided to the user application.
Owner:MICROSOFT TECH LICENSING LLC

Three-dimensional reconstruction precision testing method and device and electronic equipment

The invention discloses a three-dimensional reconstruction precision testing method and device and electronic equipment, and relates to the technical field of high-precision maps. The method comprisesthe steps that three-dimensional coordinates of a first object and a second object in an image sequence in visual point cloud are acquired, the visual point cloud is obtained through three-dimensional reconstruction, and the first object and the second object are objects existing in a high-precision map; calculating a similarity transformation matrix between the visual point cloud and the high-precision map according to the three-dimensional coordinates of the first object in the visual point cloud and the three-dimensional coordinates of the first object in the high-precision map; performingcoordinate transformation on the three-dimensional coordinates of the second object in the visual point cloud according to the similarity transformation matrix to obtain first three-dimensional coordinates of the second object; and calculating the precision of three-dimensional reconstruction according to the error between the first three-dimensional coordinate and the three-dimensional coordinate of the second object in the high-precision map. According to the method, existing high-precision map data are fully utilized, only a small amount of calibration needs to be carried out on the imagesequence, and the three-dimensional reconstruction precision test efficiency can be improved.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Robustness control method and system for multi-source fusion positioning

The invention provides a robustness control method and system for multi-source fusion localization, and the method comprises the steps: carrying out the visual feature point extraction of the same frame of image data of the same scene, obtaining a first pose through visual inertial navigation fusion, and calculating a visual feature point error and a first re-projection error according to the first pose; carrying out inertial navigation and laser radar feature point extraction on the point cloud data of the same frame in the same scene, and carrying out inertial navigation and laser radar fusion to obtain a second pose; calculating a point cloud feature point error and a second re-projection error according to the second pose; calculating a third pose of the laser radar and the camera, and calculating a camera position feature point error and a third reprojection error according to the third pose; and comparing each re-projection error, carrying out weight ranking on each feature point error, and carrying out quality grade division. Based on the method, the invention further provides a robustness control system. According to the method, the resolving speed of the system can be increased, the environmental adaptability and precision of the system are improved, and the robustness of the system is improved.
Owner:北京理工大学前沿技术研究院

An objective evaluation method of stereo video quality without reference based on sparse representation

The invention discloses a method for objectively evaluating the quality of stereoscopic video without reference based on sparse representation: the stereoscopic video is downsampled to obtain stereoscopic video pair. The left and right viewpoints of stereoscopic video pair are respectively used to obtain the monocular energy magnitude map, and the weighting factors of the left and right views areobtained. Stereoscopic video weighted the two viewpoints to get the visual perception map. Visual perception saliency map is obtained by calculating ROI on visual perception map; dictionary learning.The sparse representation of visual perception saliency graph is used to obtain the coefficient matrix and its entropy. The coefficient matrix is obtained by means of mean, variance and two-norm operation. Choosing the coefficient matrix of video pair and MOS in the video database for training, the training model is obtained. The training model is used to predict the quality of any stereo video, and the final objective prediction value is obtained. The invention makes more comprehensive and accurate objective evaluation of stereoscopic video quality according to the visual perception image andthe sparse representation as a tool.
Owner:TIANJIN UNIV

Visual inertia real-time initialization alignment method and system

ActiveCN112284381AReduce dimensionalitySolving unknown quantities requires less calculationNavigation by speed/acceleration measurementsAlgorithmClassical mechanics
The invention discloses a visual inertia real-time initialization alignment method and system. The visual inertia real-time initialization alignment method comprises the steps of: acquiring a currentvisual frame, and acquiring a plurality of continuous visual frames of the current visual frame and the measurement data of a corresponding inertia measurement unit; calculating visual frame poses, and performing pre-integration on the measurement data of the inertial measurement unit between two adjacent visual frames to obtain integral data; acquiring an ontology frame pose corresponding to thevisual frame, establishing a linear equation set by utilizing the integral data and the ontology frame pose, and iteratively updating a coefficient matrix and a constant term of the linear equation set; solving the linear equation set to obtain a current ontology frame speed and initial visual frame gravitational acceleration; and if an absolute error between a module value of an initial visual frame gravitational acceleration value and a local gravitational acceleration value is smaller than a preset value, and the value of n is larger than the preset value, judging that visual inertia initial alignment succeeds. The dimension of linear equation set is small, the amount of calculation for solving unknown quantities is smaller, the efficiency is high, and the real-time performance is improved.
Owner:BEIJING HUAJIE IMI TECH CO LTD

Image processing method and device, electronic equipment and storage medium

The invention discloses an image processing method and device, electronic equipment and a storage medium, belongs to the field of computers, and particularly relates to computing vision, image processing, face recognition and deep learning technologies in artificial intelligence. The implementation scheme comprises the steps of performing skin color recognition on a to-be-processed face image, anddetermining a target skin color of a face in the face image; if the style migration model set does not contain the style migration model corresponding to the target skin color, processing the face image by using any style migration model to obtain a reference transformation image corresponding to the face image, and adjusting a hue value, a saturation value and a brightness value of each pixel point in a target area in the reference image according to the target skin color, and obtaining a target transformation image matched with the target skin color. Therefore, when the target skin color ofthe human face in the to-be-processed human face image does not have the corresponding style migration model, style transformation processing can be carried out on various skin color users by utilizing the existing style migration model, and the application range is expanded.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Multi-mode fused unsupervised pedestrian re-identification rearrangement method

The invention discloses a multi-modal fused unsupervised pedestrian re-identification rearrangement method. The method comprises the following steps of collecting multi-modal information of pedestrians in a walking process; extracting pedestrian features by using a convolutional neural network model, and calculating visual similarity; constructing image space-time distribution by utilizing the image space-time information; constructing WiFi space-time distribution by utilizing the WiFi information; and the visual similarity, fusing the image space-time distribution and the WiFi space-time distribution, and arranging a pedestrian re-identification sorting result. Multi-modal information is integrated for secondary rearrangement, the method is an effective measure for reducing the search space, and the defect that a traditional pedestrian re-identification method based on visual features is sensitive to the monitoring environment is effectively overcome.
Owner:SOUTH CHINA UNIV OF TECH

Rail-mounted robot anti-jamming detection method and device based on artificial intelligence

The invention relates to the technical field of artificial intelligence, in particular to a rail-mounted robot anti-jamming detection method and device based on artificial intelligence. The method comprises the following steps of acquiring an ROI area image of a photovoltaic cell panel; obtaining a segmentation result graph of a ROI area by using a semantic segmentation network model; obtaining abinary image of an edge frame of the photovoltaic cell panel from the segmentation result image, and judging a first longitudinal edge frame, a first transverse edge frame and a second transverse edgeframe in the binary image; calculating the maximum gap width between the first transverse edge frame and the second transverse edge frame, and averaging the maximum gap width and the measured gap width to obtain the gap width; calculating a visual fall height, and averaging the visual fall height and the measured fall height to obtain a fall height; and when the gap width or the fall height is larger than a set threshold value, returning the rail-installed cleaning robot. According to the method and device, the problems that the robot is stuck due to too large gaps or too high fall, and the distance measurement of an infrared distance measurement sensor is inaccurate are solved, and the accuracy and reliability of a judgment result are improved.
Owner:河南颂达信息技术有限公司

Pan-tilt visual control system and control method

The invention discloses a pan-tilt visual control system and control method, and the control method comprises the steps: obtaining environment information, and obtaining a visual image; identifying ato-be-detected target from the visual image; calculating three-dimensional coordinates of the to-be-detected target in a camera coordinate system; calculating a visual control angle according to the three-dimensional coordinates of the to-be-detected target in the camera coordinate system; calculating three-dimensional coordinates of the to-be-detected target in a ground coordinate system; calculating a prediction compensation angle according to the three-dimensional coordinates of the to-be-detected target in the ground coordinate system; and according to the visual control angle and the prediction compensation angle, a holder motor is controlled based on a fuzzy control algorithm. According to the technical scheme, when the pan-tilt motor is controlled, the yaw angle prediction compensation value and the pitch angle prediction compensation value of the to-be-detected target are obtained by referring to the three-dimensional coordinates of the to-be-detected target in the ground coordinate system and the gyroscope detection data, so that the response speed of an operation system and the control accuracy are improved.
Owner:FOSHAN UNIVERSITY

Automatic visual sorting equipment applied to crayfish

The invention provides automatic visual sorting equipment applied to crayfish. The automatic visual sorting equipment comprises a feeding device, an automatic weighing device, an automatic recognition device, a product sorting device and a master control device. Crayfish is placed in the feeding device and conveyed to the automatic weighing device through the feeding device, the weight of each single product is automatically recorded and fed back to the master control device, the weight is conveyed to the automatic recognition device through the automatic weighing device, and photographing recognition of the size, color and the like of the product is conducted in the automatic recognition device. And the master control system carries out internal algorithm calculation according to parameters such as weight parameters in the automatic weighing device, the size and color of the automatic identification device and the like, so that the products are automatically sorted in the sorting device according to requirements. The whole device is stable in transmission, all products are automatically sorted according to requirements on the premise that the product quality is not damaged, and the production efficiency is remarkably improved. Under the condition that the product quality is not damaged, the products are automatically sorted according to different parameters such as weight, size and color by utilizing technologies such as computational vision, and the production efficiency of the whole production line is improved.
Owner:JIANGSU ACADEMY OF AGRICULTURAL SCIENCES +2

Image processing method and device and medium

The invention discloses an image processing method and device and a medium, and relates to the technical field of image processing, and the method comprises the steps: obtaining a plurality of original cabin images at different visual angles; splicing the plurality of original vehicle cabin images to obtain a vehicle cabin panoramic image; determining a target object in the plurality of original cabin images based on computer visual perception; determining a target virtual image corresponding to the target object, and obtaining a target layer containing the target virtual image; and superposing the vehicle cabin panoramic image and the target image layer to obtain a target panoramic image containing the target virtual image. According to the method, the target object in the vehicle cabin image is perceived by using the computational vision technology, and the detected target object is represented by using the target virtual image, so that the perceiving effect of the synthesized target panoramic image on the vehicle cabin can be effectively improved.
Owner:中汽创智科技有限公司

Vision acquisition and calibration method, device and system based on multi-degree-of-freedom robotic arm

The invention belongs to the technical field of intelligent robots, and discloses a vision acquisition and calibration method based on a multi-degree-of-freedom mechanical arm. The method comprises the following steps that a designated point on the mechanical arm is kept fixed, and the mechanical arm is respectively rotated for one circle around at least two coordinate axes of a coordinate systemtaking the designated point as an original point; coordinates of each corner point on a calibration plate in the rotating process are obtained; a circle center coordinate of the motion track of the calibration plate and a space offset vector from each corner point to the designated point under a vision acquisition coordinate system on a designated frame are calculated; the posture of the mechanical arm is kept unchanged from the designated point to the tail end of the mechanical arm, and the mechanical arm is respectively translated towards different planes; a plurality of frames of images aresampled for the calibration plate in the translation process, and meanwhile the coordinates of the tail end of the mechanical arm are obtained through the mechanical arm system; and the transformation relation between the vision acquisition coordinate system and the mechanical arm coordinate system is calculated. By adopting the method, the coordinate system transformation relation between visionacquisition equipment and the mechanical arm system can be calculated without accurately fixing the calibration plate without measurement equipment.
Owner:YIJIAHE TECH CO LTD

Autonomous robot anti-jamming prediction method and device based on artificial intelligence

The invention relates to the technical field of artificial intelligence, in particular to an autonomous robot anti-jamming prediction method and device based on artificial intelligence. The method comprises the following steps: acquiring an image of the surface of a photovoltaic cell panel; carrying out edge detection on the image by adopting a semantic segmentation network model to obtain edge frame information; adopting a grid key point detection network model to detect grid key points of the photovoltaic cell panel; determining a first grid and a second grid according to the distances fromthe grid key points to the edge lines and the perpendicular bisectors; calculating a visual height difference according to the areas of the first grid and the second grid; and obtaining the height difference through weighted fusion with the infrared height difference obtained through the infrared distance measuring sensor, and when the height difference is larger than a threshold value, stopping the autonomous robot from advancing and giving an alarm. According to the method, the problem that the autonomous robot is stuck due to overlarge height difference and the problem that the distance measurement of the infrared distance measurement sensor is inaccurate due to the anti-reflection characteristic of the photovoltaic cell panel are solved, and the accuracy and reliability of a judgment result are improved.
Owner:郑州迈拓信息技术有限公司

Method and equipment for computational vision detection and computer readable storage medium

The invention discloses a method for computational vision detection. The method comprises the following steps: controlling a pulse light source in communication connection with equipment for computational vision detection to be turned on at a beginning of a preset detection period; controlling an area-array camera in communication connection with the equipment for computational vision detection toshoot at a moment T1 of the preset detection period to obtain an image of a first face of a target product, wherein the target product is a transparent or translucent product; and controlling the area-array camera to shoot at a moment T2 of the preset detection period to obtain an image of a second face of a target product, wherein the first face of the target product is located on an upper layerof the second face. In addition, the invention also discloses the equipment for computational vision detection and a computer readable storage medium. Thus, the method for computational vision detection provided by the invention can detect multiple different faces of the product without turning over the product and using multiple sets of equipment, can effectively improve detection efficiency, and can also save detection costs at the same time.
Owner:SHENZHEN GONGJIN ELECTRONICS CO LTD

Oral mucosal disease detection model construction and prediction method and device, terminal and medium

According to the oral mucosal disease detection model construction and prediction method and device, the terminal and the medium provided by the invention, an oral mucosal disease auxiliary diagnosis system is constructed by utilizing a computational vision technology to detect, segment and identify an oral white lesion map, so that a doctor can be helped to quickly position the part and the form of the oral mucosal white lesion, and the doctor can be assisted to make a diagnosis. The method has the following advantages: (1) the computer vision technology can strengthen expression of medical images and can observe subtle differences, including characteristics of parts, colors, forms, areas and the like, of oral white lesions, and the vision algorithm can perform multidirectional analysis on the images and capture changes which are not easily perceived by human eyes; (2) the diagnosis levels of doctors on the oral mucosal disease are different, and the standardized visual model can make more accurate and more standard diagnosis; and (3) intelligent equipment with a good visual algorithm can be applied to thousands of households and common families in the future to help people to monitor the oral health condition by themselves, so that the medical expenditure is reduced.
Owner:SHANGHAI NINTH PEOPLES HOSPITAL SHANGHAI JIAO TONG UNIV SCHOOL OF MEDICINE +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products