For those who do not know what is the Azure Kinect…  basically it is a developer kit with advanced AI sensors that provide sophisticated computer vision and speech models.

Topics covered in this post:

  1. Hardware
  2. Views RGB
  3. SDKs

1 – Hardware

66078292_708626469609469_1955189207554727725_n

This is the hardware presented by Microsoft. As you can see in the picture this Kinect which only weights 440g has:

  1. RGB Camera:
    • OV12A10 12MP CMOS sensor rolling shutter sensor.
    • USB video class-compatible and can be used without the Sensor SDK (Check at the bottom of this post to understand what is the SDK).
    • Color space : BT .601 full range [0..255]
  2. Depth Camera:
    • 1-Megapixel Time-of-Flight (ToF) imaging chip enabling higher modulation frequencies and depth precision.
    • Two NIR Laser diodes enabling near and wide FoV depth modes.
    • Depth provided outside of indicated range depending on object reflectivity.
    • Implements the Amplitude Modulated Continous Wave (AMCW) ToF principle. Casts modulated ilumination in the near IR (NIR) spectrum onto the scene. It then records an indirect measurment of the time ir takes the light to travel from the camera to the scene and back.
  3. IR emitters.
  4. Motion Sensor (IMU):
    • LSM6DSMUS includes accelerometer and a gyroscope sampled at 1.6 kHz, reporting to the host at 208 Hz. Origin [0,0,0] both coordinate systems are right-handed. IMU coordinate system
  5. Microphone array:
    • 7 microphone circular array identifies as a standard USB audio class 2.0 device.
    • Sensitivity: –22 dBFS (94 dB SPL, 1 kHz)
    • Signal to noise ratio > 65 dB
    • Acoustic overload point: 116 dB

image

2 – Field of View RGB / Depth

66078292_708626469609469_1955189207554727725_n

3D coordinate conventions

The best way understand the field-of-view, and the angle that the sensors “see” are thought this diagram.

  • This diagram shows the RGB Camera 1. in 4:3 mode from a distance of 2000mm.
  • Regarding the Depth Camera views (is tilted 6 degrees downwards of the color camera), both 2. and 3. Its important to understand that this camera transmits modulated IR images to the host PC. Then the depth engine software converts the raw signal into depth maps. As described in the image, the supported modes are:
    • NFOV (Narrow field-of-view): This modes are ideal for scenes with smaller extents in X and Y but larger in Z. One of the illuminators in this mode is aligned with the depth camera case, no tilted.
    • WFOV (Wide field-of-view): This modes are ideal for scenes with larger extents in X and Y but smaller in Z. The illuminator used in this view is tilted an additional 1.3 degrees downward relative to the depth camera.
      • Depth camera supports 2×2 binning modes (at the cost of lowering image resolution), to extend the Z-range in comparison to the corresponding unbinned modes we described before.

Note: When depth is in NFOV mode, the RGB camera has better pixel overlap in 4:3 than in 16:9 resolutions.

image

3 – SDK

Azure Kinect SDKs diagram

The K4A DK consists on the following SDKs:

Joint hierarchy

    • Features
      • Body Segmentation
      • Anatomically correct skeleton for each partial or full body in FOV.
      • Unique identity for each body.
      • Can track bodies over time.
    • Tools
      • Viewer tool to track bodies in 3

2 responses to “Azure Kinect–Hardware specifications in detail”

  1. […] The code is simple and easy to read. We have now our first application and we can already point our device. Now we need to get to our cameras… before programming this we need to understand the different views we have available. You can read about them in this post. […]

    Like

  2. […] You can check more information on the hardware and software of Azure Kinect in this post “AZURE KINECT Azure Kinect–Hardware specifications in detail”. […]

    Like

Leave a Reply

Your email address will not be published. Required fields are marked *

I’m Ivana

I’m a Technology Advocate who is living proof that Technology changes lives. I started my career with Microsoft from my small city (Salta), in Argentina. Now I train people and teams globally in the powerful international language of Tech. I inspire people from all walks of life to become world citizens and “geeks” like me who dream big and achieve amazing things. As a proud woman in Tech, content creator and public speaker I love travelling, connect and create magic moments of transformation; and I learn from everyone I meet. When I am not on the road, I am home with my husband and two dogs. My adventurous spirit in my work life is echoed in my love for Disney movies like Moana and Lilo & Stitch. Who knows “how far I’ll go” on my journey, but I know the power of Technology can get me there!

Let’s connect