Skip to content

Python library for automatically creating synthetic training data using Blender.

License

Notifications You must be signed in to change notification settings

autognc/starfish

Repository files navigation

Starfish Documentation Status

Starfish is a Python library for automatically creating synthetic, labeled image data using Blender.

This library it extends Blender's powerful Python scripting ability, providing utilities that make it easy to generate long sequences of synthetic images with intuitive parameters. It was designed for people who, like me, know Python much better than they know Blender.

These sequences can be smoothly interpolated from waypoint to waypoint, much like a traditional keyframe-based animation. They can also be exhaustive or random, containing images with various object poses, backgrounds, and lighting conditions. The intended use for these sequences is the generation of training and evaluation data for machine learning tasks where annotated real data may be difficult to obtain.

Though Starfish has only been tested with Blender 2.82, it should work with any version 2.8+. Please open an issue if it doesn't.

  1. Identify the location of your Blender scripts directory. This can be done by opening Blender, clicking on the 'Scripting' tab, and entering bpy.utils.script_path_user() in the Python console at the bottom. Generally, on Linux, the default location is ~/.config/blender/[VERSION]/scripts. From now on, this path will be referred to as [SCRIPTS_DIR].
  2. Create the addon modules directory, if it does not exist already: mkdir -p [SCRIPTS_DIR]/addons/modules
  3. Install the library to Blender: pip install git+https://github.com/autognc/starfish --no-deps --target [SCRIPTS_DIR]/addons/modules. Starfish does not require any additional packages besides what is already bundled with Blender, which is why --no-deps can be used.

Starfish can also be pip-installed normally without Blender for testing purposes or for independent usage of certain modules.

To use Starfish, you're probably going to have to interact with Blender using the Python API. This library also makes heavy use of mathutils, which is an independent math library that comes bundled with Blender.

The easiest way to experiment with the library is by opening Blender, navigating to the Scripting tab, and hitting the plus button to create a new script. You can then import starfish, write some code, and hit Alt+P to see what it does.

Once you're ready to execute a more long-running script, you can write it outside Blender and then execute it using blender file.blend --background --python script.py (or blender file.blend -b -P script.py for short).

Frames

At the core of Starfish is the Frame class, which represents a single image of a single object. A frame is defined by 6 parameters:

frame = starfish.Frame(
    position=(0, 0, 0),
    distance=10,
    pose=mathutils.Euler([0, math.pi, 0]),
    lighting=mathutils.Euler([0, 0, 0]),
    offset=(0.3, 0.7),
    background=mathutils.Euler([math.pi / 2, 0, 0])
)

See the starfish.Frame documentation for more details about what each parameter means. Once you have a frame object, you use it to 'set up' your scene:

frame.setup(
    bpy.data.scenes['MyScene'],
    bpy.data.objects['MyObject'],
    bpy.data.objects['Main_Camera'],
    bpy.data.objects['The_Sun']
)

This moves all the objects so that the image that the camera sees matches up with the parameters in the frame object. At this point, you can render the frame using bpy.ops.render.render. You can also dump metadata about a frame into JSON format using the Frame.dumps method.

Sequences

Of course, Starfish wouldn't be very useful without the ability to create multiple frames at once. The Sequence class is essentially just a list of frames, but with several classmethod constructors for generating these sequences of frames in different ways. For example, Sequence.interpolated generates 'animated' sequences that smoothly interpolate between keyframes, and Sequence.exhaustive generates long sequences that contain every possible combination of the parameters given.

Once you've created a sequences, you can iterate through its frames like so:

seq = starfish.Sequence...

for frame in seq:
    frame.setup(...)
    bpy.ops.render.render()

The Sequence.bake method also provides an easy way to 'preview' sequences that you're working on in Blender. See Sequence for more detail.

Utils

The utils module provides a few more functions that may be useful for core image generation, such as random_rotations or uniform_sphere.

Annotation

Starfish also contains an annotation module that provides utility functions related to annotating generated data.

  • One common type of annotation generated for computer vision tasks is some sort of segmentation mask (e.g. using the ID Mask Node) where having perfectly uniform colors is important. Unfortunately, I've often encountered an issue in Blender where the output colors differ slightly: for example, instead of the background being solid rgb(0, 0, 0) black, it will actually be a random mix of rgb(0, 0, 1), rgb(1, 1, 0), etc. The normalize_mask_colors function can be used to clean up such images.
  • Once a mask has been cleaned up, get_bounding_boxes_from_mask and get_centroids_from_mask can be used to get the bounding boxes and centroids of segmented areas, respectively.
  • Another common type of annotation is keypoints: e.g. where particular 3D points on the object appear in the 2D image. generate_keypoints can be used to automatically generate evenly distributed 3D keypoints from an object's mesh; project_keypoints_onto_image can then take these keypoints and map them to 2D image locations after rendering a particular frame.

See example.py.

About

Python library for automatically creating synthetic training data using Blender.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages