Introduction

Related Publications

This repository contains the source code and datasets for our ICRA 2025 and ICCVW 2025 papers:

📌 ICRA 2025, focus on individual objects (logistics use cases)

X. Zhu, J. Henningsson, D. Li, P. Mårtensson, L. Hanson, M. Björkman, and A. Maki,
"Domain Randomization for Object Detection in Manufacturing Applications Using Synthetic Data: A Comprehensive Study,"
Proceedings of the 2025 IEEE International Conference on Robotics and Automation (ICRA), Atlanta, GA, USA, 2025, pp. 16715–16721.
📄 Paper (IEEE Xplore)

📌 ICCVW 2025, focus on assembly objects, two objects aligned to each other (assembly use cases).

X. Zhu, J. Henningsson, D. Li, P. Mårtensson, L. Hanson, M. Björkman, and A. Maki,
"Towards Automated Assembly Quality Inspection with Synthetic Data and Domain Randomization,"
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, October 2025, pp. 1395–1403.
📄 Paper (CVF Open Access)
For assembled objects, you should follow the Object Preparation to prepare the CAD models.

This repository is also part of a broader research line on synthetic data for manufacturing vision systems.

➡️ Subsequent Work

This work focuses on generating a static synthetic dataset and training an object detection model without using any real data. In our subsequent work, we extend this approach with Synthetic Active Learning (SAL), where synthetic data is iteratively generated to target the model’s weaknesses and continuously improve detection performance. If you are interested, please refer to our
📄 Journal of Manufacturing Systems paper: Designing Synthetic Active Learning for Model Refinement in Manufacturing Parts Detection.
And its 💻 GitHub Repo: SAL.

⬅️ Previous Work

This work focuses on object detection tasks; our previous work focused on the classification task. We introduced a synthetic classification dataset and systematically evaluated sim-to-real transfer for manufacturing components. For details, please refer to our previous paper:
📄 CVPRW paper: Towards Sim-to-Real Industrial Parts Classification With Synthetic Dataset
and its 💻 GitHub Repo: SIP-17

Related Datasets

This code generates synthetic data from 3D models using domain randomization. We use two datasets to generate synthetic images and train an object detection model, which performs well on real-world data.

Robotic Dataset: Published by Horváth et al., this dataset includes both 3D models and real images.
- 📂 3D Models: Located in data/Objects/Robotic/, containing 10 .obj files.
- 🖼️ Real Images: Download from Dropbox – Public Robotic Dataset. We We use the yolo_cropped_all subset for real-image evaluation.
SIP15-OD Dataset: Developed by us. It contains 15 manufacturing object 3D models across three use cases, along with 395 real images featuring 996 annotated objects taken in various manufacturing environments.
Due to company policy, the original CAD models cannot be publicly released. However, the real-world annotated images are available via: Roboflow-SIP15OD.
SIP2A-OD Dataset: Developed by our team, this dataset focuses on two assembly use cases and contains 249 annotated real-world images collected from diverse industrial environments. Due to company policy, the original CAD models cannot be publicly released. However, the real-world annotated images are available via: Roboflow-SIP2AOD.

Below are samples of the synthetic data and their real-world counterparts from the robotic dataset, as well as the three use cases from the SIP-15-OD dataset.

Introduction

This repository presents our domain randomization pipeline for synthetic data generation in manufacturing object detection.

The pipeline generates synthetic images using Blender, where domain randomization is applied across five components of the generation process:

3D scene initialization
Object sampling
Illumination sampling
Camera view capture
Post-processing

From parameter configuration to rendering and automatic ground-truth annotation, the system produces fully labeled synthetic datasets. These synthetic images are used to train an object detection model, which is then evaluated on real-world test data to assess sim-to-real performance.

Domain randomization pipeline: synthetic data generation in Blender followed by model training on synthetic data and evaluation on real images.

Through systematic experiments, we identify material properties, rendering methods, post-processing strategies, and distractors as critical factors for real-world generalization.

Training exclusively on synthetic data, the proposed pipeline achieves 96.4% mAP@50 on the public robotics dataset and 94.1%, 99.5%, and 95.3% mAP@50 across the three SIP15-OD use cases. These results demonstrate that carefully designed domain randomization can approximate real industrial data distributions without requiring real training images.

Getting Started

Setup Python environment

Setup conda environment using conda env create -f environment.yml
Activate environment using conda activate SynMfg_Code

Setup Blender

Download Blender 3.4

Go to Blender 3.4, and download the appropriate version of Blender for your system. As an example blender-3.4.1-windows-x64.msi for Windows or blender-3.4.1-linux-x64.tar.xz for Linux.
Install Blender.
Set blender environment variable BLENDER_PATH to the Blender executable. As an example C:\Program Files\Blender Foundation\Blender 3.4\blender.exe for Windows or /user/blender-3.4.1-linux-x64/blender for Linux.

Setup Texture folders

Downloaded textures are put into their corresponding folders inside the data folder structure.

SynMfg_Code/
└── data/
    ├── Background_Images/
    ├── Objects/
    ├── PBR_Textures/
    └── Texture_Images/

Download background images

Go to Google Drive.
Download all image files from train and testval folders.
Put all images into data/Background_Images.

Download texture images

Go to Flickr 8k Dataset.
Download all image files.
Put all images into data/Texture_Images.

Download PBR textures

Run blenderproc download cc_textures data/PBR_Textures. It downloads textures from cc0textures.com.
To use specific material textures like metal, create a new folder named data/Metal_Textures and place only the metal textures from the cc_textures data there.

3D model preparation

The preparation of 3D models used in the pipeline can be read about in the objects section.

Configuration file

Our data generation pipeline considers five components of DR: object characteristics, background, illumination, camera settings, and post-processing. The range of random settings for these components is defined in the config-sample.json file. To customize these settings, copy the sample configuration file and make the necessary changes. Once the desired parameters are set, the pipeline generates synthetic data accordingly. The parameters specified in the configuration file for the five DR components and rendering are detailed in the table below:

Parameter	Description	Default value
Background
background_texture_type	Type of texture: 1: no texture; 2: Random images from the BG-20L dataset.	2
total_distracting_objects	Maximum number of distractors in the scene.	10
Object
max_objects	Maximum number of objects; Set to -1 includes all objects and empty background images.	-1
multiple_of_same_object	Allow multiple instances of the same object in one scene.	TRUE
object_weights	Weights for object categories; [] for equal distribution.	[]
nr_objects_weights	Weights for the number of objects; [] for equal distribution.	[]
object_rotation_x_min	Min x-axis rotation angle for objects.	0
object_rotation_x_max	Max x-axis rotation angle for objects.	360
object_rotation_y_min	Min y-axis rotation angle for objects.	0
object_rotation_y_max	Max y-axis rotation angle for objects.	360
object_distance_scale_min	Min ratio of distance between objects; Set to 0.53 to prevents overlap.	0.53
object_distance_scale_max	Max ratio of distance between objects.	1
objects_texture_type	Type of textures: 1: RGB; 2: image; 3: PBR materials; 0: random.	3
Camera
camera_zoom_min	Minimum zoom level of the camera.	0.1
camera_zoom_max	Maximum zoom level of the camera.	0.7
camera_theta_min	Minimum azimuthal angle of the camera.	0
camera_theta_max	Maximum azimuthal angle of the camera.	360
camera_phi_min	Minimum polar angle of the camera.	0
camera_phi_max	Maximum polar angle of the camera. Max: 90 degrees.	60
camera_focus_point_x_shift_min	Min shift in the x-direction for the camera focus point.	0
camera_focus_point_x_shift_max	Max shift in the x-direction for the camera focus point.	0.5
camera_focus_point_y_shift_min	Min shift in the y-direction for the camera focus point.	0
camera_focus_point_y_shift_max	Max shift in the y-direction for the camera focus point.	0.5
camera_focus_point_z_shift_min	Min shift in the z-direction for the camera focus point.	0
camera_focus_point_z_shift_max	Max shift in the z-direction for the camera focus point.	0.5
Illumination
light_count_auto	Auto set light count based on scene size.	1
light_count_min	Min number of lights (when light_count_auto = 0).	0
light_count_max	Max number of lights (when light_count_auto = 0).	0
light_energy_min	Min energy level of the lights.	5
light_energy_max	Max energy level of the lights.	150
light_color_red_min	Min red color value of the lights.	0
light_color_red_max	Max red color value of the lights.	255
light_color_green_min	Min green color value of the lights.	0
light_color_green_max	Max green color value of the lights.	255
light_color_blue_min	Min blue color value of the lights.	0
light_color_blue_max	Max blue color value of the lights.	255
Post-processing
image_sp_noise_probability	Probability of applying salt-and-pepper noise (0-1).	0.1
image_sp_noise_amount_min	Min amount of salt-and-pepper noise.	0.01
image_sp_noise_amount_max	Max amount of salt-and-pepper noise.	0.05
image_gaussian_blur_probability	Probability of applying Gaussian blur (0-1).	0.1
image_gaussian_blur_sigma_min	Min sigma value for Gaussian blur.	1
image_gaussian_blur_sigma_max	Max sigma value for Gaussian blur.	3
Rendering
generate_nr_samples	Total number of synthetic images to generate.	4000
nr_blender_instances	Number of blender instances to run.	10
render_image_width	Width of the rendered image.	720
render_image_height	Height of the rendered image.	720
render_engine_samples_max	Maximum number of render engine samples	32
render_image_format	Format of the rendered image	PNG
render_engine	Render engine to use for generating images, (-1: Random engine, 0: Cycles, 1: EEVEE)	0
eevee_postprocessing	Enable postprocessing for the Eevee engine	FALSE
render_image_format	Format of the rendered image (PNG or JPEG).	PNG
background_samples	Include background images without objects.	TRUE
segmentations	Whether to generate segmentation mask annotations.	TRUE
clean_paths	If true, start rendering anew; if false, continue from previous.	TRUE
object_label	Labels of the 3D objects.	{"L-bracket.obj": "0", "U-bracket.obj": "1", etc.}

Unless otherwise specified, the ranges are as follows: angles (0 to 360 degrees), ratios (0 to 1), colors (0 to 255), and other numbers (0 to unlimited).

Running the pipeline

Run python generation_main.py --config config-sample.json to start the generation.

Training Yolov8 model

After generating synthetic data, train the YOLOv8 model by following the instructions on the official Ultralytics GitHub repository (YOLOv8 GitHub) and evaluate its performance on real images.

License

This project is licensed under the GNU General Public License v3.0 — see the LICENSE file for details.

Acknowledgement

The robotic dataset is from Horváth et al., including their .obj files and real images accessed from their GitLab repository. Thanks for their great work!

We also thank previous works in domain randomization for industrial applications, including Tobin et al., Eversberg and Lambrecht, and Horváth et al..

We acknowledge the contributions of the YOLOv8 model from Ultralytics, which we used for training our model.

Citation

If you find our work helpful for your research, please consider citing:

ICRA 2025 (Separated objects, logistics use cases)

@inproceedings{Zhu2025ICRA,
  author    = {Zhu, Xiaomeng and Henningsson, Jacob and Li, Duruo and Mårtensson, Pär and Hanson, Lars and Björkman, Mårten and Maki, Atsuto},
  title     = {Domain Randomization for Object Detection in Manufacturing Applications Using Synthetic Data: A Comprehensive Study},
  booktitle = {2025 IEEE International Conference on Robotics and Automation (ICRA)},
  year      = {2025},
  pages     = {16715--16721},
  doi       = {10.1109/ICRA55743.2025.11128647}
}

ICCVW 2025 (Assembled objects, assembly use cases)

@inproceedings{Zhu2025ICCVW,
  author    = {Zhu, Xiaomeng and Henningsson, Jacob and Li, Duruo and M{\aa}rtensson, P{\aa}r and Hanson, Lars and Bj{\"o}rkman, M{\aa}rten and Maki, Atsuto},
  title     = {Towards Automated Assembly Quality Inspection with Synthetic Data and Domain Randomization},
  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
  year      = {2025},
  pages     = {1395--1403}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Related Publications

📌 ICRA 2025, focus on individual objects (logistics use cases)

📌 ICCVW 2025, focus on assembly objects, two objects aligned to each other (assembly use cases).

➡️ Subsequent Work

⬅️ Previous Work

Related Datasets

Introduction

Getting Started

Setup Python environment

Setup Blender

Download Blender 3.4

Setup Texture folders

Download background images

Download texture images

Download PBR textures

3D model preparation

Configuration file

Running the pipeline

Training Yolov8 model

License

Acknowledgement

Citation

ICRA 2025 (Separated objects, logistics use cases)

ICCVW 2025 (Assembled objects, assembly use cases)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
Blender		Blender
Figures		Figures
Paper		Paper
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config-sample.json		config-sample.json
environment.yml		environment.yml
generation_main.py		generation_main.py

License

jacobhenningsson95/SynMfg_Code

Folders and files

Latest commit

History

Repository files navigation

Related Publications

📌 ICRA 2025, focus on individual objects (logistics use cases)

📌 ICCVW 2025, focus on assembly objects, two objects aligned to each other (assembly use cases).

➡️ Subsequent Work

⬅️ Previous Work

Related Datasets

Introduction

Getting Started

Setup Python environment

Setup Blender

Download Blender 3.4

Setup Texture folders

Download background images

Download texture images

Download PBR textures

3D model preparation

Configuration file

Running the pipeline

Training Yolov8 model

License

Acknowledgement

Citation

ICRA 2025 (Separated objects, logistics use cases)

ICCVW 2025 (Assembled objects, assembly use cases)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages