Java Options and Label Studio Guide: Difference between pages
No edit summary |
|||
| Line 1: | Line 1: | ||
= Preparing a Dataset with Label Studio = | |||
This guide explains how to create an annotated dataset for YOLO training using Label Studio. | |||
Install Label Studio using pip: | |||
<pre> | |||
pip install label-studio | |||
</pre> | |||
Verify the installation: | |||
<pre> | |||
python -m label_studio.server --help | |||
</pre> | |||
Or refer to the official installation guide: | |||
https://labelstud.io/guide/install | |||
=== Enable Local File Storage === | |||
For large projects it is not recommended to upload images directly through the Label Studio interface. Instead, configure a local directory that contains the images to annotate. | |||
To enable local file access, configure the following environment variables before starting Label Studio: | |||
< | <pre> | ||
LABEL_STUDIO_LOCAL_FILES_SERVING_ENABLED=true | |||
LABEL_STUDIO_LOCAL_FILES_DOCUMENT_ROOT=/path/to/images | |||
</pre> | |||
=== Starting Label Studio === | |||
Label Studio can be started using one of the following methods. | |||
==== Default Startup ==== | |||
If local file storage is not required, Label Studio can be started with the default configuration: | |||
<pre> | |||
label-studio start | |||
</pre> | |||
or | |||
<pre> | |||
python -m label_studio.server start | |||
</pre> | |||
By default, the application is available at: | |||
<pre> | |||
http://localhost:8080 | |||
</pre> | |||
==== Startup with Local File Storage ==== | |||
When working with large datasets, it is recommended to configure Local Storage so that images are accessed directly from the filesystem. | |||
Windows example: | |||
<pre> | |||
set LABEL_STUDIO_PORT=8081 | |||
set LABEL_STUDIO_LOCAL_FILES_SERVING_ENABLED=true | |||
set LABEL_STUDIO_LOCAL_FILES_DOCUMENT_ROOT=C:\Users\username\Documents\label-studio | |||
python -m label_studio.server start | |||
</pre> | |||
After startup, Label Studio will be available at: | |||
<pre> | |||
http://localhost:8081 | |||
</pre> | |||
The directory specified by '''LABEL_STUDIO_LOCAL_FILES_DOCUMENT_ROOT''' can then be configured as Local Storage within a Label Studio project. | |||
Port '''8081''' is used to avoid conflicts with the default LogicalDOC installation, which typically runs on port '''8080'''. | |||
=== Create a Project === | |||
# Login to Label Studio | |||
# Click '''Create Project''' | |||
# Enter a project name | |||
# Configure the labeling interface | |||
# Save the project | |||
=== Import Images === | |||
# Open the project | |||
# Click '''Import''' | |||
# Select '''Local Storage''' | |||
=== Configure Local Storage === | |||
# Open the project | |||
# Navigate to '''Settings > Cloud Storage''' | |||
# Click '''Add Source Storage''' | |||
# Select '''Local Files''' | |||
# Configure the path specified by LABEL_STUDIO_LOCAL_FILES_DOCUMENT_ROOT | |||
# Click '''Sync Storage''' | |||
[[File:local-storage-button.png|thumb|800px|center|Local Storage button]] | |||
[[File:Storage-Settings-Label-Studio.png|thumb|800px|center|Local Storage Selection]] | |||
[[File:LabelStudio-local-storage.png.png|thumb|800px|center|Local Storage configuration showing a synchronized directory of document images]] | |||
After synchronization, Label Studio automatically creates one task for each imported document image. | |||
When importing images, choose '''Files''' as the import method. | |||
=== Annotate Documents === | |||
# Open a task | |||
# Select a label | |||
# Draw a bounding box around the target area | |||
# Save the annotation | |||
Example labels: | |||
* Invoice Number | |||
* Date | |||
* Seller Name | |||
* Buyer Name | |||
* Total Amount | |||
[[File:label-studio-annotated-image-example.png|thumb|600px|center|Example annotation]] | |||
=== Export the Dataset === | |||
# Open the project | |||
# Click '''Export''' | |||
# Select the desired format | |||
Supported formats include: | |||
* YOLO | |||
* COCO | |||
* Pascal VOC | |||
* CSV | |||
For YOLO training, export the dataset in YOLO format. | |||
=== Dataset Formats === | |||
==== COCO ==== | |||
COCO is a JSON-based dataset format commonly used for object detection datasets. | |||
More information: | |||
https://docs.aws.amazon.com/rekognition/latest/customlabels-dg/md-coco-overview.html | |||
==== YOLO ==== | |||
YOLO datasets contain images and annotation files organized according to a predefined directory structure. | |||
More information: | |||
https://docs.cvat.ai/docs/dataset_management/formats/format-yolo/ | |||
==== YOLOv8 OBB ==== | |||
YOLOv8 OBB (Oriented Bounding Boxes) extends the standard YOLO format by supporting rotated bounding boxes using eight normalized coordinates. | |||
Revision as of 13:25, 23 June 2026
Preparing a Dataset with Label Studio
This guide explains how to create an annotated dataset for YOLO training using Label Studio.
Install Label Studio using pip:
pip install label-studio
Verify the installation:
python -m label_studio.server --help
Or refer to the official installation guide: https://labelstud.io/guide/install
Enable Local File Storage
For large projects it is not recommended to upload images directly through the Label Studio interface. Instead, configure a local directory that contains the images to annotate.
To enable local file access, configure the following environment variables before starting Label Studio:
LABEL_STUDIO_LOCAL_FILES_SERVING_ENABLED=true LABEL_STUDIO_LOCAL_FILES_DOCUMENT_ROOT=/path/to/images
Starting Label Studio
Label Studio can be started using one of the following methods.
Default Startup
If local file storage is not required, Label Studio can be started with the default configuration:
label-studio start
or
python -m label_studio.server start
By default, the application is available at:
http://localhost:8080
Startup with Local File Storage
When working with large datasets, it is recommended to configure Local Storage so that images are accessed directly from the filesystem.
Windows example:
set LABEL_STUDIO_PORT=8081 set LABEL_STUDIO_LOCAL_FILES_SERVING_ENABLED=true set LABEL_STUDIO_LOCAL_FILES_DOCUMENT_ROOT=C:\Users\username\Documents\label-studio python -m label_studio.server start
After startup, Label Studio will be available at:
http://localhost:8081
The directory specified by LABEL_STUDIO_LOCAL_FILES_DOCUMENT_ROOT can then be configured as Local Storage within a Label Studio project.
Port 8081 is used to avoid conflicts with the default LogicalDOC installation, which typically runs on port 8080.
Create a Project
- Login to Label Studio
- Click Create Project
- Enter a project name
- Configure the labeling interface
- Save the project
Import Images
- Open the project
- Click Import
- Select Local Storage
Configure Local Storage
- Open the project
- Navigate to Settings > Cloud Storage
- Click Add Source Storage
- Select Local Files
- Configure the path specified by LABEL_STUDIO_LOCAL_FILES_DOCUMENT_ROOT
- Click Sync Storage



After synchronization, Label Studio automatically creates one task for each imported document image.
When importing images, choose Files as the import method.
Annotate Documents
- Open a task
- Select a label
- Draw a bounding box around the target area
- Save the annotation
Example labels:
- Invoice Number
- Date
- Seller Name
- Buyer Name
- Total Amount

Export the Dataset
- Open the project
- Click Export
- Select the desired format
Supported formats include:
- YOLO
- COCO
- Pascal VOC
- CSV
For YOLO training, export the dataset in YOLO format.
Dataset Formats
COCO
COCO is a JSON-based dataset format commonly used for object detection datasets.
More information: https://docs.aws.amazon.com/rekognition/latest/customlabels-dg/md-coco-overview.html
YOLO
YOLO datasets contain images and annotation files organized according to a predefined directory structure.
More information: https://docs.cvat.ai/docs/dataset_management/formats/format-yolo/
YOLOv8 OBB
YOLOv8 OBB (Oriented Bounding Boxes) extends the standard YOLO format by supporting rotated bounding boxes using eight normalized coordinates.