BLE (Bluetooth Low Energy) Room occupancy detection

This project uses BLE technology to estimate the occupancy of a room. It capitalizes on the widespread use of BLE-capable devices, such as smartphones, smartwatches, and tablets, which individuals carry with them as part of their daily lives. The system is designed to be accurate, scalable, and privacy-friendly, offering a reliable solution for room occupancy detection.

This section provides an introduction to the project, while the deployment process is detailed here.

Introduction

This project is based on the work of Justin Steven Herbrich, whose project can be accessed here. Herbrich introduced an approach that capitalizes on the ubiquity of BLE-capable devices. The core concept is to utilize BLE technology to detect nearby devices to estimate the occupancy of a room. The primary objective of this project is to enhance and refine Herbrich's approach, making it more accurate and scalable.

Herbrich's approach uses BLE technology to estimate the occupancy of a room. This approach is particularly relevant due to the widespread use of BLE-capable devices, which range from smartphones and tablets to laptops and other devices. By using BLE, Herbrich's project aims to provide an accurate and flexible solution to estimate the occupancy of a room.

However, while Herbrich's work showcases promise, it also leaves room for improvement, particularly in the realms of accuracy and scalability. This project aims to enhance the precision of occupancy estimates and expand the system's scalability. It does so by facilitating the simultaneous monitoring of multiple rooms and by extending the range of detectable devices within a single room.

Motivation

The motivation behind this project is rooted in the critical need for accurate and scalable room occupancy detection systems. The significance of such systems has been magnified by recent global events that necessitate stricter adherence to social distancing and capacity limitations within indoor spaces.

Traditional methods for estimating room occupancy, such as physical headcounts or camera-based systems, have limitations. They often require manual intervention, which can be time-consuming, labor-intensive, and susceptible to human error. Additionally, camera-based systems raise privacy concerns and may not be viable in scenarios where individuals require anonymity.

Bluetooth Low Energy (BLE) technology presents an elegant and privacy-friendly solution to this problem. It capitalizes on the widespread use of BLE-capable devices, such as smartphones, smartwatches, and tablets, which individuals carry with them as part of their daily lives.

Improving Herbrich's Approach

Current Approach

As outlined in Herbrich's original work, and verified by an examination of the current source code, the system identifies a person in a room through the following criteria:

The peripheral is within the scanner's range.
The peripheral allows a connection.
The received signal strength indicator (RSSI) must surpass a predefined threshold, typically set at -100 dB.
The peripheral must possess a public address, which is a unique MAC address used for device identification.

When all of these conditions are met, the system associates the peripheral as a consumer device, and consequently, with a person in the room. While these criteria help narrow down the devices that can be attributed to individuals, they also introduce certain limitations. For instance, if a person carries more than one consumer device, the system would count them as separate individuals. Moreover, the restriction to public addresses is becoming less viable as an increasing number of devices employ random addresses to protect user privacy. [1]

Improving the Accuracy of Occupancy Detection

The enhancements to our system primarily focus on two pivotal aspects of BLE device interaction: the analysis of advertising data and the possibility to establish connections with devices for more comprehensive data retrieval. The objective of this approach is to gather extensive information about each device, enabling us to classify it as a smartphone or otherwise. This classification is essential for enhancing the accuracy of room occupancy estimates, operating on the assumption that most individuals carry at least one BLE-capable smartphone. It's worth noting that this refinement is achieved while meticulously safeguarding user privacy, ensuring the collection of only necessary and entirely non-personal data.

Complementing these advancements is the introduction of support for multiple BLE scanners. This strengthens the system's reliability and scalability. Multiple scanners operating in tandem form a robust and adaptable solution to meet the demands of varying room setups and sizes.

Analyzing Advertising Data

BLE advertising data consists of information broadcasted by BLE devices in short, intermittent packets. In the initial stage, where connections to the devices is not required, we gather insights about the type of device by closely scrutinizing the advertising data. The primary elements that constitute our focus include:

Shortened Local Name (Data type: 0x08)
Complete Local Name (Data type: 0x09)
Appearance (Data type: 0x19)

The Local Name while not commonly found in advertising data can be valuable resources for identifying the type of device. As outlined in the Core Specification Supplement, Part A, Section 1.2, the local name typically designates the device with a name for example "Speaker XY". The appearance type serves as a valuable indicator for recognizing smartphones. This characteristic is represented as a 2-byte value that delineates the device's category. Notably, values within the range of 0x0040 to 0x007F are indicative of a Phone. [2]

Connecting to the Device and Reading the Device Info Service

While advertising data analysis offers an initial classification, the project recognizes the potential need for deeper insights. This is where the second improvement comes into play, as it involves connecting to the BLE device and accessing the "Device Info Service". The Device Info Service (UUID 0x180A) might contain relevant characteristics such as:

Manufacturer Name String (UUID 0x2A29)
Model Number String (UUID 0x2A24)

For instance, an iPhone 13 provides the following information:

Manufacturer Name String: Apple Inc.
Model Number String: iPhone13,4

This two-pronged approach, starting with advertising data analysis and progressing to device connection and service interrogation, ensures more reliable and accurate classification. It also accommodates devices that do not directly advertise their names in the advertising data but include this information in the Device Info Service. [3]

Multiple BLE Scanners

The project has also tackled scalability and precision challenges by introducing support for multiple BLE scanners. In Herbrich's original approach, room occupancy detection was limited to a single BLE scanner operating in a room at a given time, potentially resulting in data duplication and room-specific constraints.

In the new approach, multiple scanners can be deployed within the same or different rooms. These scanners are efficiently managed by the central system to ensure accurate data aggregation from various rooms. This is achieved by assigning each scan with a unique scan ID and room ID. The scan ID distinguishes between scans, while the room ID identifies the specific room in which the scanner is located. Notably, the Bluetooth address of the device is included in the scan results to prevent the same device from being counted twice. However, it's important to highlight that the only the hash of the address is stored.

Implementation

The project was structured into two main components: the central system and the BLE scanner. The central system is responsible for orchestrating the BLE scanners, collecting and analyzing data, and storing the results in a database. The BLE scanner is responsible for scanning for BLE devices and publishing the results to the central system. As seen in the overview diagram, the central system needs to run NodeRed which is used to orchestrate the BLE scanners, collect and analyze the data and send it to the Database which is ultimately used to persist the data. The BLE scanner is implemented on an ESP32 running micropython and communicates with the central system via MQTT.

The System Architecture

The system architecture is detailed in this section, offering a step-by-step explanation of its operation. A visual representation of the system's workflow can be found in the flowchart diagram.

Each scan is initiated by sending an MQTT message. This message includes a room ID that identifies the specific room which should be scanned, a room ID of all can be also specified if every room should be scanned. We suggest and this is configured by default, to trigger the scans on a timer using the TIME_BETWEEN_SCANS_MS parameter in the config.json file.

{
  "room": "myRoom"
}

BLE Scanners commence scanning only if one of the following conditions are met:

The room ID within the message matches the scanner's designated room.
The room ID is set to all.

The trigger for initiating scans is published to the topic roomUtilization/doScan, while the BLE Scanner subscribes to the same topic.

Once scanning is initiated a UUID is generated for the scan and the BLE Scanner collects scan results and publishes them to a designated topic. The topic is structured as roomUtilization/scans/myRoom, where myRoom signifies the room's unique identifier. The central system, responsible for data aggregation, subscribes to the general topic roomUtilization/scans/+, ensuring it can receive data from all scanner-equipped rooms. As seen in the sample scan result below, each scan includes a timestamp, room identifier, a UUID and the actual data. The scan results are also partitioned to minimize the amount of data that the microcontroller has to hold in memory. This partitioning ensures that the system can handle a larger number of devices and scan results without running into memory constraints. Since we cannot directly control the memory, most likely this will help but not solve the problem. To have a reliable system, it would be necessary to reimplement the system using C.

{
  "timestamp_utc": 1711724776000,
  "room": "myRoom",
  "part": 1,
  "totalParts": 3,
  "uuid": "84086033a9225f2233c338630de705cf",
  "scanresult": [
    {
      "connAttempts": 1,
      "rssi": -79,
      "addr": "xx:xx:xx:xx:xx:xx",
      "connectable": true,
      "connSuccessful": true,
      "descriptor": "Apple Inc. iPhone15,3",
      "manufacturerCode": 76
    },
    {
      "connAttempts": 0,
      "rssi": -81,
      "addr": "xx:xx:xx:xx:xx:xx",
      "connectable": true,
      "connSuccessful": false,
      "descriptor": "2;0:04:2F:C1:39:8F;eTRV",
      "manufacturerCode": null
    },
    {...}
  ]
}

To accommodate multiple scanners across various rooms that each send partitioned data, the central system employs a waiting mechanism with a predetermined timeout of 1 hour. During this interval, it collects scan results from different scanners merging them by uuid and dropping scans older then 1 hour. When a scan is complete and all partitions have been received, the central system persists the data in the InfluxDB database. This means that duplicates can be potentially stored in the database, but this is not a problem since the system is designed to handle this at a later stage and gives the opportunity to analyze the data of each scanner separately. The data is stored in InfluxDB 2 in the following format:

Field Name	Description
`connAttempts`	Number of connection attempts.
`connSuccessful`	Boolean indicating whether the connection attempt was successful (`true`/`false`).
`connectable`	Boolean indicating whether the device is connectable (`true`/`false`).
`descriptor`	Description of the device.
`manufacturerCode`	Manufacturer code of the device.
`rssi`	Received Signal Strength Indicator (RSSI) of the device.

Tag Name	Description
`addr`	Address of the device.
`room`	Room where the scan was performed.

On demand, the central system retireves the data from the database and performs an analysis of the scans displaying it on a NodeRed Dashboard.

The current system employs a rule-based solution for classifying scan results into two categories: smartphones and unknown devices. This classification is vital for estimating room occupancy accurately.

The heart of the classification process lies in comparing the scan results with a list of known smartphone models. If a match is found, the system counts the device as a smartphone. However, if no match is found in the known smartphones list, the device is considered an unknown device. Its descriptor is then added to the list of previously encountered unknown devices if not already present.

Above some example data collected over few hours in my studio shows all the manufacturers found by the scanner (this is if they advertise it)

This chart shows the average RSSI of the devices found in the room over time.

Configuration

The scanner can be configured by modifying the config.json file. The following parameters can be adjusted:

Here's the markdown table documenting all the options in the configuration:

Option	Description	Default Value
MQTT	Use MQTT. If set to False, ALLOW_CONFIG_UPDATE and SEND_MQTT will be false	true
SEND_MQTT	Transfer Scan Data via MQTT after Scan	true
MQTT_SEND_TIMEOUT_MS	Timeout in milliseconds for sending MQTT messages	5000
ALLOW_CONFIG_UPDATE	Allow updating configuration via MQTT	true
LOGGING	Print Scanning Process, Results, and other events	true
LOG_LEVEL	Log Level (0 = Debug, 1 = Info, 2 = Warning, 3 = Error)	1
NTP_HOST	NTP Server to use for time synchronization	"pool.ntp.org"
SSID	WiFi SSID	"FunnyWifiName"
NETWORK_KEY	WiFi Password	"DefaultRouterPasswordThatYouShouldChange"
MQTT_BROKER_ADDRESS	MQTT Broker Address	"localhost"
MQTT_USER	MQTT User (set to None if no user is needed)	"User"
MQTT_PASSWORD	MQTT Password (set to None if no password is needed)	"Password"
MQTT_ROOM_NAME	MQTT Room Name	"myRoom"
MQTT_BASE_TOPIC	Base MQTT Topic	"roomUtilization/"
TIME_BETWEEN_SCANS_MS	Time in milliseconds between each scan	30000
SCAN_DURATION_MS	Duration of the scan in milliseconds	10000
SCAN_CONNECTION_TIMEOUT_MS	Timeout in milliseconds to connect to a device	5000
ACTIVE_SCAN	Active Scan (True) or Passive Scan (False)	true
FILTER_RSSI	Only Include Devices with RSSI higher than this value (0 for no filter)	-100

Furthermore these settings can be updated directly on the device via MQTT. By default the device is subscribed to the topic roomUtilization/updateConfig and listens for configuration updates. The following data can be used to update the configuration:

{
    "MQTT": true,
    "SEND_MQTT": true,
    "MQTT_SEND_TIMEOUT_MS": "5000",
    "MQTT_BROKER_ADDRESS": "TODO_CHANGE_ME",
    "MQTT_USER": "TODO_CHANGE_ME",
    "MQTT_PASSWORD": "TODO_CHANGE_ME",
    "MQTT_ROOM_NAME": "myRoom",
    "MQTT_BASE_TOPIC": "roomUtilization/",
    "SSID": "TODO_CHANGE_ME",
    "NETWORK_KEY": "TODO_CHANGE_ME",
    "ALLOW_CONFIG_UPDATE": true,
    "TIME_BETWEEN_SCANS_MS": -1,
    "SCAN_DURATION_MS": 10000,
    "SCAN_CONNECTION_TIMEOUT_MS": 5000,
    "ACTIVE_SCAN": true,
    "FILTER_RSSI": -100,
    "LOGGING": true,
    "LOG_LEVEL": 1,
    "NTP_HOST": "pool.ntp.org"
}

If a value is set to null or not present it will not be updated, so you can also send a partial config updated with just the options you want to update. Note that the device will reboot to apply the changes in the configuration. This feature can be disabled by setting ALLOW_CONFIG_UPDATE to False in the config.py file.

Future Work

In-Depth Analysis of Smartphone Characteristics

One possibility is to conduct a more comprehensive analysis of services and characteristics related to smartphones. For instance, the "Phone Alert Status" characteristic could offer additional insights to further refine smartphone identification.

Comprehensive Examination of Advertising Data

Advertising data, integral to BLE communication, holds immense potential for greater insights. Specifically, the "Class of Device" (0x0D) data with the Minor Device Class with bits 2-3 set to 1 (Smartphone) and bits 4-7 set to 0 could be also used to identify smartphones. It is also worth mentioning that the Manufacturer specific data contained in the advertising data could also be used to identify smartphones. An article that might offer more insights into this topic can be found here.

Distinguishing Moving and Stationary Devices

Enhancing the capability to differentiate between moving and stationary devices is another intriguing possibility. The possibility of using received signal strength indicator (RSSI) fluctuations to make informed decisions could be explored. For instance, stable RSSI values over time might suggest a device is stationary (think printers or TVs), while varying RSSI readings could imply movement. This insight can be valuable for excluding stationary devices from occupancy estimates. It's important to note that this approach may face challenges with devices employing frequently changing random addresses as these would not be recognized as the same device over time. However, in combination with advanced techniques like the "address-carryover algorithm," it could prove to be a viable solution.

Address Hashing for Enhanced Privacy ✔

Address privacy is a growing concern. While the current solution doesn't persistently store peripheral addresses for privacy reasons, it could be considerate to hash the addresses on the ESP32 itself. This approach could potentially enhance user privacy while ensuring the system functions effectively.

Streamlined Deployment with Docker ✔

To make the deployment process more accessible and user-friendly, automation using tools like Docker could be explored. This move toward automation aims to simplify setup and configuration, reducing the complexities often associated with system deployment.

Classifier-Based Smartphone Identification

In the quest for accuracy, the possibility of a shift from rule-based smartphone identification to machine learning-based classification could be contemplated. By developing a classifier, the collected data can be leveraged to create more intelligent and adaptive identification algorithms.

Integration with Home Assistant

Lastly, the exploration of integrating the system with popular home automation platforms like Home Assistant could be considered. This integration would not only enhance the system's usability but also open up new avenues for smart home applications and interactions.

Aggregrate datapoints before displaying charts in Dashboard

Currently the system is displaying the data in the dashboard by querying the database for each datapoint. This is not efficient and can be improved by aggregating the data before displaying it in the dashboard. This can be done by using the InfluxDB query language to aggregate the data before displaying it in the dashboard. For example based on the choosen period the step size could be calculated and the data could be aggregated based on the step size. This would reduce the amount of data that needs to be transferred and displayed in the dashboard. Another option could be letting the user choose it in the dashboard.

Papers

Estimating indoor occupancy through low-cost BLE devices

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
BLE Scanner		BLE Scanner
Deployment		Deployment
Interface		Interface
res		res
.gitignore		.gitignore
Deployment.md		Deployment.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BLE (Bluetooth Low Energy) Room occupancy detection

Table of Contents

Introduction

Motivation

Improving Herbrich's Approach

Current Approach

Improving the Accuracy of Occupancy Detection

Analyzing Advertising Data

Connecting to the Device and Reading the Device Info Service

Multiple BLE Scanners

Implementation

The System Architecture

Configuration

Future Work

In-Depth Analysis of Smartphone Characteristics

Comprehensive Examination of Advertising Data

Distinguishing Moving and Stationary Devices

Address Hashing for Enhanced Privacy ✔

Streamlined Deployment with Docker ✔

Classifier-Based Smartphone Identification

Integration with Home Assistant

Aggregrate datapoints before displaying charts in Dashboard

Papers

About

Languages

License

enricoDec/Estimating-indoor-occupancy-using-BLE

Folders and files

Latest commit

History

Repository files navigation

BLE (Bluetooth Low Energy) Room occupancy detection

Table of Contents

Introduction

Motivation

Improving Herbrich's Approach

Current Approach

Improving the Accuracy of Occupancy Detection

Analyzing Advertising Data

Connecting to the Device and Reading the Device Info Service

Multiple BLE Scanners

Implementation

The System Architecture

Configuration

Future Work

In-Depth Analysis of Smartphone Characteristics

Comprehensive Examination of Advertising Data

Distinguishing Moving and Stationary Devices

Address Hashing for Enhanced Privacy ✔

Streamlined Deployment with Docker ✔

Classifier-Based Smartphone Identification

Integration with Home Assistant

Aggregrate datapoints before displaying charts in Dashboard

Papers

About

Topics

Resources

License

Stars

Watchers

Forks

Languages