Data Annotation

For training AI models, as well as for assessing the quality of already trained models, we need data with ground truth targets. These annotations can be manually created in a photo editor or a dedicated annotation tool.

Neura CVAT is a specialized computer vision annotation tool developed by Neura, built upon the foundation of the open-source CVAT platform. It provides a powerful and user-friendly interface for labeling images and videos for computer vision tasks.

The system offers comprehensive annotation capabilities including bounding boxes, polygons, and keypoints, along with multi-user labeling functionality for efficient team workflows. These features are essential for tasks like object detection, segmentation, and tracking.

Neura CVAT enhances the base CVAT platform with custom features specifically designed for Neura’s AI development pipeline, while maintaining the robust and flexible nature of the original system.

Note

The guide is simplified to focus on the most common usage scenarios. For more advanced features and options, please refer to the official CVAT documentation.

Sign in / Sign up

Access Neura CVAT through your web browser at http://192.168.2.14:8050.

In the start page, you will encounter a login interface, where you enter your email and password.

New users can register by clicking Create an Account:

../_images/cvat-login.png

Create a Project

Labeling is managed through Project functionality. Within each Project, a user can set up task-specific settings, track changes and create versions.

Click + and proceed to Create a new project.

../_images/cvat-create-a-new-project.png

Provide a name for your Project and add labels. Click Submit & Open to complete. Later, you can edit the name, as well as add, modify or remove labels.

Create a Task

../_images/cvat-create-a-new-task.png

Click on +. Assign a name to the task, associate it with a project, and proceed to Select files. Add images either by dragging them into the designated square or clicking on them. Click Submit & Open to complete.

After that, a list of Jobs should appear. You can start the annotation process by clicking on Job#1.

../_images/cvat-jobs.png

Annotation Menu

AI Tools

../_images/cvat-ai-tools.png

AI Tools include Interactors, Detectors, and Trackers used for automatic annotation.

  • Interactors allow labeling objects in an image or video, or refining existing annotations. Select a pretrained model, click on an object, and get an automatically generated object mask for this object.

  • Detectors are designed to automate the annotation process of large image or video datasets. Choose a pretrained object detection model, and get an automatically generated bounding boxes for objects in an image or video.

  • Trackers are specifically tailored for annotating moving objects in videos, e.g., for object tracking in surveillance or action recognition.Trackers match the objects across frames in a video sequence to maintain temporal consistency of annotations.

OpenCV Tools

../_images/cvat-opencv-tools.png

OpenCV Tools include Drawing, Image, and Tracking.

  • Drawing allows adding shapes, lines, and other graphical elements to an image or a video frame. With Drawing, a user can mark regions of interest or highlight specific areas or objects within an image.

  • Image allows adding metadata, text, or labels to an image to descriptive information or labels for images within a dataset. With Image, a user can add descriptions or comments on images, add labels or categories.

  • Tracking allows following and annotating object motions across video frames. Tracking can enable the analysis of object trajectories and evolution of dynamics cenes, as well as annotation of moving objects in videos, such as vehicles or pedestrians.

Apart from automatic tools, there are manual annotation options providing full control over the annotation process.

Click is the basic labeling operation, performed by pressing and releasing the left mouse button.

Draw a Rectangle

../_images/cvat-draw-rectangle.png

Target application. Rectangular annotations (also known as bounding boxes) are used for object detection and object tracking, e.g., as the initial regions for tracking.

Procedure. Rectangles are defined by clicking the corner points. Axis-aligned rectangles can be annotated with two diagonal corners: Top-Left and Bottom-Right. Oriented rectangles can be annotated with four corners.

Draw a Polygon

../_images/cvat-draw-polygon.png

Target application. Polygonal annotations are used for object segmentation, object detection, and object tracking. With polygons, objects can be outlined precisely. Besides, polygons can serve as initial regions for tracking algorithms.

Procedure. Polygons can be labeled by clicking on vertices sequentially. The last vertex is connected to the first, forming a closed polygon.

Draw a Polyline

../_images/cvat-draw-polyline.png

Target application. Polyline is a series of connected line segments. With polyline, a user can outline objects or define paths or trajectories within a scene for object segmentation, detection, and tracking.

Procedure. Polylines can be labeled by clicking on vertices sequentially. The last vertex is connected to the first, forming a closed shape.

Draw Points

../_images/cvat-draw-points.png

Target application. Points mark locations of interest within an image, that can be used for keypoint detection and as a weak supervision for other computer vision tasks.

Procedure. Points can be labeled by clicking on the desired location in an image.

Draw an Ellipse

../_images/cvat-draw-ellipse.png

Target application. Ellipses can be used for labeling objects with elliptical shapes for object segmentation, detection and tracking.

Procedure.
  1. Specify the coordinates of a center

  2. Define the major and minor axes lengths

  3. Specify the rotation angle.

Draw a Mask

../_images/cvat-draw-mask.png

Target application. Pixel-level binary masks can be used to define regions of interest for object segmentation, detection and tracking.

Procedure. Masks are drawn on top of an image using a painting tool.

Setup a Tag

../_images/cvat-menu-setup-tag.png

Target application. A tag allows categorizing annotated objects so that they can be easily found, sorted, filtered or analyzed.

Procedure. An object can be assigned with a tag via Setup tag. The tag can be selected from the existing tags, or a new tag can be created.

Basic Image Manipulation

../_images/cvat-menu-image-manipulation.png

The basic image manipulation functions include:

  • Cursor allows navigating and interacting with the CVAT interface: pointing, clicking, and selecting various elements on the screen.

  • Move the Image allows moving the image within the CVAT workspace to focus on specific areas of the image.

  • Rotate the Image rotates the image clockwise or anticlockwise.

  • Fit the Image allows adjusting the view to fit the entire image within the CVAT workspace to get an overview of the entire content.

  • Select a Region of Interest allows specifying an area or region in the image that you want to focus on or annotate.

Shortcuts

Shortcut

Action

F1

Open/hide the list of available shortcuts

N

Repeat the latest procedure of drawing with the same parameters

M

Activate or deactivate mode to merging shapes

Alt+M

Activate or deactivate mode to splitting shapes

G

Activate or deactivate mode to grouping shapes

Shift+G

Reset group for selected shapes (in group mode)

Esc

Cancel any active canvas mode

Ctrl+C

Copy shape to CVAT internal clipboard

Ctrl+V

Paste a shape from internal CVAT clipboard

Hold Ctrl while pasting

When pasting shape from the buffer for multiple pasting.

Ctrl+B

Make a copy of the object on the following frames

Ctrl+(0..9)

Changes a label for an activated object or for the next drawn object if no objects are activated