Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

A Power-Efficient Audio Acquisition System for Smart City Applications

A Power-Efficient Audio Acquisition System for Smart City Applications Acoustic noise has adverse effects on human activities. Aside from hearing impairment and stress-related illnesses, it can also interfere with spoken communication, reduce human performance and affect the quality of life. As ur- banization is intensifying, the potential benefits of reducing noise pollution in smart-city environments are extensive. Noise levels can be collected and analyzed using a wireless sensor network which can monitor the noise level by using microphones. However, every wireless system struggles in terms of the battery requirements needed for continuous data collection and monitoring. In this paper, the design of a testbed for a smart microphone system is presented. To save power, a microcontroller and an Analog-to-Digital Converter (ADC) dynamically switch between high and low power modes in response to environmental noise. Specifically, the high powered components are triggered by a spike in the acoustic noise level. Three wireless technologies, WiFi (2.4 GHz), Bluetooth Low Energy (BLE) 4.0 and Zigbee were examined. According to the results, the power consumption of a node can be lowered by 97% when idle based on the testbed. Keywords: Low-power audio; audio processing; audio coherence; audio classification, noise pollution. 1. Introduction of different sounds such as human voice, traffic, con- struction, or air transport that is currently present in any A common problem in urban cities is acoustic noise school or hospital. Mapping the amount of sound clas- pollution [1]. Noise pollution is an abundance of noise sified as human voice within an area grants the ability in a concentrated area that can harm humans both phys- of the public to observe the amount of human presence ically and mentally [2]. To alleviate this problem, the at the time. This can also help citizens to determine if first step is to understand and analyze what acoustic an area within the city such as a park or restaurant is noise citizens are being exposed to daily. Smart cities comfortable to visit at a specific time in the day. enable the introduction of new technology to the public Analyzing the audio data in an urban environment can through different services that optimize resource usage, become complicated depending on the location within transportation, utilities and more. In a smart city, large the city [5]. There is a variety of challenges such as amounts of data are collected and utilized [3]. This is privacy and placement [6]. Privacy has always been a done to predict, analyze, and observe trends that happen big issue with regard to the collection of data in a pub- throughout the city based on various sensors. lic setting. An invasion of privacy could be claimed by Audio data is an example of sensory information be- anyone who feels uncomfortable about the idea of the ing collected. The amount of human comfort in differ- voices being recorded. ent locations within a city can be found by analyzing Another challenge is the reliability of the data, which noise levels collected by multiple microphones. By cre- depends on its accuracy in modeling the noise within ating a tool that can visualize the levels of acoustic noise an area. Data is considered reliable if the extraneous around the city, its people are given the ability to react variables that could skew the noise model of the area and adjust accordingly. In future iterations, it may also are minimal. A major concern is the power consump- be possible to show the type of sounds present if a neural tion of various microphones and analog-to-digital con- network is utilized [4]. An example is the classification verters (ADCs) when collecting audio data [7]. This is especially true when the microphone is set to sample throughout the day, meaning that it will have minimum Corresponding author Email address: petros@uoguelph.ca (Petros Spachos) downtime. Many systems try to solve this problem by Preprint submitted to Internet of Things January 23, 2020 arXiv:2001.08163v1 [cs.NI] 21 Jan 2020 sampling at different points in the day rather than being level during a day, it can easily miss many important continuous. Although this is a good method for saving events since it is only awake less than 20% of the time. energy, it can miss key information due to the scheduled The work in [9] presents a node that collects audio sampling. level information as well as other types of data. It is The work presented relates to a framework for reduc- designed to wake up based on a timer interrupt or when the accelerometer is triggered. The method of wireless ing power consumption in Micro-Electro-Mechanical transmission involves the Zigbee protocol to ensure low System (MEMS) microphone ADCs for an acoustic en- power. The model was verified by measuring the audio vironmental monitoring system. A testbed to prove the level of varying sinusoidal waves, ranging from 31.5 Hz power savings has been developed. The contributions of to 16 kHz. This was recorded with an MI 6201 MULTI- this paper are as follows: NORM sound level meter. The average power con- • A testbed for environmental sound detection us- sumption of the prototype is 45 mW. This is extremely ing a microphone was designed. The microphone low when compared to most devices, allowing operation was selected after experimentation on the energy for up to 7 days. This prototype is a good starting point requirements and the accuracy of four different mi- but requires a longer operating time to reduce the need crophones. for continuous maintenance. The work in [10] features a prototype using a mini • An amplifier and a threshold circuit were designed PC. Although the system is not fully wireless, it is a for use with audio collection systems. good solution for indoor environments. It was veri- fied using a calibrated sound level meter using an A- • Three wireless technologies were used to exam- weighted filter. In [11], a prototype was built for use ine the effectiveness of the proposed energy-saving with city buses. This had a power supply unit connected scheme. Specifically, WiFi (2.4 GHz), Bluetooth to the city bus and used 3G as well as GPS to wirelessly Low Energy (BLE) 4.0 and Zigbee were used. transmit data along with a location stamp. As described in the article, one challenged faced was attempting to The results are compared to a reference system to filter out the noise generated from the bus, this presents prove the viability and demonstrate the improvements. a big problem with the accuracy of the data if it is meant The rest of the paper is organized as follows; Section 2 to represent noise level from a human perspective. reviews the related works and Section 3 discusses the A pilot study was presented in [12], which involved proposed system architecture. Section 4 details the pro- an embedded system to collect acoustic noise. The cedure involved and the results found during analysis power consumption was not specified but the accuracy are presented in Section 5. The conclusion is in Sec- of collected data was discussed extensively. A standard tion 6. sound level meter was used to calibrate the system and verify the analog filters. A noise monitoring node based around a Raspberry Pi 2B and USB microphone was 2. Related Work developed in [13]. A sound level meter was used to ver- In recent years, many research works involve wire- ify all measurements and power over Ethernet supplied less sensor networks for acoustic noise collection [8, 9, each node with energy. Although not wireless, it was 10, 11, 12, 13, 14, 15, 16, 17]. Their main focus is the still considered a flexible solution for an acoustic sensor development of systems that have low enough power to network. reliably operate with little to no human intervention. In [14], a low-cost sensor network was developed to A promising implementation has been developed target the problem of noise pollution. It has an oper- in [8]. The work focuses on developing an ultra low ating power of 89 mW when collecting data. A sound power sensor node that is designed to wake up periodi- level meter was used to calibrate the system, ensuring cally to collect data, it was programmed to wake for ten accurate measurements. Despite the low power, the pro- minutes every hour. To accomplish this, an ultra-low- totype can run for less than 4 days, which is not very power microcontroller was used along with an XBee. realistic for a full implementation. A similar implemen- The total power when transmitting was 810 mW while tation was followed in [15]. The authors in [16] show the transmitting power was 132 mW. A handheld Sound preliminary results of a wireless noise collection sys- Pressure Level (SPL) meter was used to verify the ac- tem, mainly discussing the accuracy of this method and curacy of the audio level collection. Although this pro- giving recommendations for future designs. In [17], the totype may be sufficient to get the overall average noise authors focus on a prototype involving low power lis- 2 >_ Threshold (Digital) Raw Audio (Analog) Amp   Amplified Raw Processed Audio Audio (Analog) Magnitude (Digital) Figure 1: System overview. tening. They mainly focus on the networking protocols noise pollution by redesigning the air compressor. The involved and recommend the collection tree protocol. work presented in [28] highlights an improved way to collect audio data for voice activity detection. It uses Smartphones have been considered for noise collec- both audio and visual signals with a supervised learn- tion via crowdsourcing. The authors in [18] developed ing algorithm to detect what audio frames correspond an Android application for the use of acoustic data col- to human voice. The authors in [29] use a Gaussian lection. It gives the user two collection methods, man- mixture model-based classification approach to speech ual mode or automatic collection. Both require no ex- detection. It’s trained using spectral flow direction, a tra hardware for the user to start collecting the acoustic novel feature for this type of classification. In [30], a noise level around a city. The exact power being used multistage system is developed to analyze environmen- by the application is unknown. To verify the model, a tal sound and classify it. The authors claim to achieve Sauter SU-130 sound level meter was used and com- an accuracy higher than 90% for audio concept identifi- pared with the data collected via the app. A visual dash- cation. board is available to the end-user which displays the col- In comparison with these works, this paper introduces lected data on to a map. Another Android application a novel method that is portable and can be implemented was developed in [19]. This application allows the user into many in-place solutions. Specifically, this testbed to record and visualize the acoustic comfort in an area utilizes a wake-up circuit to conserve power when min- as well as push the results to a server for analysis from imal audio activity is present, while three different tech- the authors. The accuracy of their data was analyzed nologies are examined to find the most energy-efficient. and verified in a separate study [20]. The main focus of the papers reviewed so far have been focused on developing a low power solution for 3. System Architecture ambient sound collection. The other concern this paper The introduced system has four main components: attempts to address is finding an effective way to collect the microphone, the circuit, the microcontroller, and the and handle the sound data while being able to integrate wireless unit. An overview of the system can be seen in it into the design. The authors in [21] discuss a mobile Fig. 1. A microphone collects raw audio data and then system dedicated to the sensing of noise pollution in ur- the signal is amplified. This amplified signal is then fed ban environments. It focuses on determining the level to both a microcontroller and a custom threshold cir- of attenuation as part of the sound collection process in cuit. The microcontroller only collects data when the addition to gathering the actual audio levels. The work audio threshold has been broken, then finally sends the in [22] specializes in the tracking of audio data with an processed data to a server via a wireless technology. emphasis on vehicles. The data being sent is based on a certain threshold value which will omit any data below 3.1. Microphone it. Unfortunately, no schematic of the system was made and the power consumption was not presented. This Multiple microphones were considered to determine severely limits the usefulness of the claimed results and a good candidate which had a high ratio of accuracy to methodologies. power. They all have varying manufacturers, voltage In [27], a factory was examined and profiled based on ranges, and configurations. A summary showing the de- the acoustic levels. Their goal is to reduce the amount of tails of each microphone can be found in Table 1. While 3 Manufacturer Microphone Voltage Range (V) Configuration InvenSense 0.9 - 1.3 Analog ICS-40310 [23] Projections Unlimited, Inc. (PUI) 1.6 - 3.6 Analog PMM-3738-VM1010-EB-R [24] STMicroelectronics (ST) 1.52 - 3.6 Analog MP23ABS1 [25] STMicroelectronics (ST) 1.6 - 3.6 Digital MP34DT05-A [26] Table 1: MEMS microphone overview. V V out interrupt most are analog, one was picked to be digital and used dd as a control to compare with. The integrated ADC will lead to a higher accuracy than others, the disadvantage Mic Amp Envelope Threshold being a significant increase in power. The chosen microphone needed to be analog to prop- erly interface with the threshold circuit. To make the Figure 2: Top level view of analog circuit. system more sensitive to audio, an amplifier was used. 3.2. Circuit designed to have low noise and a high gain. The am- The circuit that was designed has three main compo- plifier is shown in Fig. 3 The negative feedback adds nents: the amplifier, the envelope filter, and the thresh- stability to make a much cleaner signal. The gain of the old circuit. amplifier can be determined from R and R as seen in f 1 the following: 3.2.1. Top level design Multiple circuits were designed to collect and ana- A = 1 + (1) lyze the data. An overview of the electrical components R can be seen in Fig. 2. This shows the three simplified where A is the gain of the amplifier. This is the trans- components. An external amplifier was needed to allow fer function for a typical non-inverting amplifier design. the ADC to accurately capture small changes in signal The common-mode input and output can be seen by the amplitude, essentially increasing the resolution. The en- following equation: velope filter averaged the positive amplitude using resis- tors and a capacitor. This is used to bypass the need for dd CM = (2) pre-processing before data transmission. The main fea- ture of this prototype, the wake-up circuit, was designed where the common-mode voltage is simply the sup- to output a high or low value based on a threshold. ply voltage divided by two. This is from the voltage divider defined by R . 3.2.2. Amplifier An amplifier was needed to make the signal recog- 3.2.3. Envelope nizable to an ADC with a standard resolution. This was also important for the wake-up circuit, making it much An envelope circuit was designed to average the mag- more sensitive to changes in noise. The amplifier was nitude of audio. The schematic of this circuit can 4 Vdd 1 Vdd Rf in Vout Vdd Vout M1 M2 R R 2 2 Vdd Figure 5: Threshold circuit. Parameter Value V 3.3 V dd Vin R 100 k R 1 k Figure 3: Amplifier circuit. R 1 M Vout in R 10 M R 500 k C R R 5 5 6 R 10 M R 100 k C 22 F Figure 4: Envelope circuit. C 100 nF C 4.5 F be seen in Fig. 4. This surpassed the need for pre- 3 processing. The purpose of C and R is to increase the 5 5 C 1.5 nF time constant, filtering the output. The second diode and C 9 F R are used to lower the common-mode voltage since 5 the XBee ADC uses a voltage reference of 1.2 V. Table 2: Circuit parameters. 3.2.4. Threshold circuit The threshold circuit was used to control an interrupt All of the circuit parameters used for this set of ex- on the microcontroller. This disabled sleep mode and periments can be found in Table 2. Using Eq. (1), the allowed the microcontroller to send audio data through gain can be calculated based on these parameters: the various wireless technologies for a brief period be- 100k fore going back into sleep mode. The schematic for the A = 1 + = 101 (3) wake-up circuit can be seen in Fig. 5. 1k Two transistors are used, M regulates the DC offset The gain becomes 101 , amplifying the signal by of the input, M is biased in such a way that it will satu- approximately 20 dB. This is necessary for the ADC to rate when the input is high. Once M is saturated, it ties properly recognize relatively small changes in the am- the output to ground. Alternatively, V will be close out bient acoustic noise. to V when M is in the cut-off region. This means dd 2 that the output will be grounded when the threshold is 3.3. Microcontroller broken and V when no significant noise is heard. At dd this time, the threshold can only be configured through A microcontroller was needed to correctly process hardware but may be configurable in future iterations. the analog data incoming from the amplifier. The 5 NodeMCU was picked for its low power, flexibility, and An advantage of the XBee is the integrated ADC, ability to harness many types of wireless technologies. which allows for wireless transmission without the need The NodeMCU uses an ESP8266 microprocessor with for an external microcontroller. a 10-bit ADC. When the microcontroller was not in sleep mode, it 4. Experimental Procedure connected to the server using either Wi-Fi (2.4 GHz), BLE 4.0, or Zigbee. The microcontroller then slept if 4.1. Microphone evaluation the noise threshold was not broken. The XBee S2C To examine which microphone was the most accu- is capable of converting an analog value to digital and rate, the coherence between the original waveform and transmitting the data using Zigbee. This bypasses the the recorded waveform was found. This shows how need for a microcontroller and will be tested alongside similar the recording is to the original signal in terms the microcontroller variant of the prototype. of the frequency response. Magnitude squared coher- ence was used on the pre-processed signals to create an 3.4. Wireless technology output which shows how much the frequency domains Three wireless technologies were tested to determine of the signals match. which was the lowest power, and show the sleep func- The coherence was conducted in Matlab by using tionality in a variety of cases. functions in the signal processing library. A procedure was used to filter the two signals and end up with a per- 3.4.1. Wi-Fi centage value that represents how closely the recording Wi-Fi is a common technology found in most homes. resembles the source. The simple list of steps includes: It allows users to connect to the internet wirelessly by using an access point. The power consumption and • Down-sample all recordings to 8 kHz. bandwidth are typically high, keeping up with the needs of the consumer [31]. For the application of an au- • Lineup source and recording. dio acquisition system, the bandwidth should not be as • Limit time to exactly 80 seconds. much of an issue when compared to the power being expended. • Find magnitude squared coherence. The NodeMCU has an integrated proprietary Wi-Fi (2.4 GHz) chip, giving it the capability to transmit data. • Take envelope of the signal. It was able to directly send data to a server. • Average the envelope. 3.4.2. Bluetooth Bluetooth is a popular technology used for many The purpose of down-sampling is to simplify the data wireless devices. It is designed to be a low-power so- and give every dataset a fair comparison without remov- lution that conserves energy to promote battery life in ing the frequencies that matter as well as increase the electronics. The power is typically less than Wi-Fi, but performance of the script. This uses decimation to make the throughput suffers [31]. the signal simpler, and thus easier to compare. The DSD TECH HM-10 was used to connect the mi- Although the magnitude squared coherence will con- crocontroller using BLE 4.0 [32]. It communicates with vert the signals to the frequency domain, it is still impor- a separate device to wirelessly transmit the data. tant to properly line up the source and recording. This eliminates any frequencies that happened before or after 3.4.3. Zigbee the recording interval. The offset can be found by com- Zigbee is a lesser-known protocol capable of low paring the similarities of the two signals and performing power transmission, meant to be used specifically for a correlation that is much more basic when compared to IoT (Internet of Things) [33]. This makes it a good the magnitude squared coherence. This offset was used match for the application of low power collection of au- to perfectly line up the signals by trimming the micro- dio. phone clip to exactly 80 seconds. The offset was based The XBee S2C module was being used [34]. This on a 10 second clip of the source and recording which provided a platform for the microcontroller to transmit was shown to be enough to properly line up the signals. using the Zigbee protocol. Similar to Bluetooth, the The magnitude squared coherence finds the cross power Zigbee experiments required a secondary unit with a spectral density in the numerator which can be seen be- matching XBee to provide wireless communication. low: 6 Algorithm 1 : Find the effective coherence value to a pair of Nex Tech analog speakers to provide a much higher quality sound when compared to the embedded y_source = resample(y_source, 8000, Fs_source) speakers on the Nexus 5 [35]. This was kept at a con- y_mic = resample(y_mic, 8000, Fs_mic) stant 30 cm away from the respective microphone which ensured consistent data. Each microphone was pointed y_source_clip = y_source(1:8000*10) directly at the speakers for a fair comparison. The vol- y_mic_clip = y_mic(1:8000*10) ume of the Nexus 5 was set to a constant of bars, or 53.3% while the speakers were kept at a maximum gain. mic_delay = finddelay(y_source_clip, y_source_clip) The Urban Sound dataset was used to test with, it y_mic = y_mic(mic_delay:mic_delay + 8000*80 - 1) contains audio categorized into four main sections [36]. This includes human, nature, music, and mechanical [corr, f] = mscohere(y_source, y_mic, [], [], [], 8000) based sounds. Twenty seconds of each category was used for testing, totaling 80 seconds for each data point. corr = envelope(corr, 100, ’peak’) The dataset features multiple sampling rates, channels, and resolution. To normalize this data, all clips were corr_value = mean(corr) up-sampled to 96 kHz and each clip was limited to one channel, also known as mono. This was done with the Sox Linux tool by utilizing linear interpolation to reduce jG ( f )j any inaccuracies during the coherence process. Down- xy C ( f ) = (4) xy sampling has the potential to remove data points so it G ( f )G ( f ) xx yy was excluded as an option. where G is the cross-spectral density defined by: xy 4.2. Power evaluation j!t G ( f ) = R (t)e dt (5) xy xy It is important to measure the power savings created by the wake-up functionality. To do this, the Monsoon where R is the cross-correlation of the two original xy power monitor was being used. It is capable of pro- signals, x(t) and y(t). The cross-correlation of the two viding very accurate measurements of energy consump- signals can be described as: tion [37]. Measuring the power consumption of both the sleep R (t) = [x(t) y(t)](t) (6) xy and active modes of each permutation of the prototype which involves the two signals being convoluted. will show The envelope of this attempts to create a coherence The Urban Sounds data set was being used for testing. that more accurately represents how closely the two sig- Audio clips were created and designed to simulate a re- nals match, this finds the outline of the signal. Finally, alistic outdoor environment. Specifically, 20 seconds of the mean is taken of the resulting envelope to determine audio from a category was played, followed by 100 sec- the accuracy value. The code showing all the modifica- onds of silence. This was repeated for each category, tions to the signal can be seen in Algorithm 1. totaling 8 minutes of audio. The clips were organized The experimental conditions were kept constant by loudness to test the power consumption at different through all the experiments. The list of conditions for noise levels. the experiment include: 4.3. Calibration • 80 second audio clip of various urban sounds. The voltage being recorded by the microcontroller • Nexus 5 at 53.3% volume, connected to speakers. ADC does not mean much until it can be related to a more accurate measuring device. Putting data into a • Audio clips out of speakers are 96 kHz. human-readable format is important both for interpre- tation and comparison to other data sets. • Distance of 30 cm from speaker and microphone. A handheld Sound Pressure Level (SPL) meter was A Nexus 5 was used to play the 80 seconds clip for used to calibrate the prototype. An equation was formed each experiment. This amount of time was chosen to to convert values on the ADC to one representing deci- get an average accuracy for each microphone, reducing bel noise levels. The Decibel X Android app was also the effect of any outliers. The Nexus 5 was connected calibrated and used to measure the average noise level 7 it also has the highest power consumption. Addition- Microphone Power (mW) Accuracy ally, exclusively using a digital microphone would pro- hibit the use of the threshold circuit since the output of InvenSense 0.0079 0.7359 the microphone is PDM rather than a pure analog sig- nal. The analog ST microphone shows high accuracy PUI 0.3545 0.6966 and low power, working without the need for a voltage regulator. These reasons are why the analog ST micro- ST (Analog) 0.3480 0.7187 phone was chosen as the best candidate for the remain- ST (Digital) 2.0813 0.7577 ing experiments. Table 3: Microphone accuracy and power. 5.2. Decibel calibration A handheld SPL meter was used to calibrate the data being transmitted from the microcontroller. An equa- tion was formed to transfer the voltages being reported to the server into a Z-weighted decibel value. The visual calibration can be found in Fig. 6. The R value is 0.995, indicating a very strong correlation. The equation is as follows: where x is the ADC value after processing on the mi- crocontroller. This power trend makes sense as decibels are logarithmic, meaning it should follow an exponen- tial trend. 350 450 550 650 750 5.3. Power consumption ADC Value The power consumption of the prototype was mea- sured with the Monsoon power monitor. Each mod- Figure 6: SPL calibration curve. ule was considered individually to analyze the biggest power draw in the system. The power consumption of each component can be seen in Table 4. As expected, when recording and testing with the prototype. The the microcontroller and Wi-Fi module have the largest application was seen to only have an error of approx- power draw by a very significant amount when trans- imately 2 dB. mitting. Both Bluetooth and Zigbee are comparable in terms of power, though the power of the XBee is slightly lower, especially when sleeping. The ESP8266 5. Results Wi-Fi module of the NodeMCU was considered to have 5.1. Microphone selection a power of zero when sleeping since it’s integrated into the microcontroller itself. After the magnitude-squared coherence was con- The power of each wireless technology was specifi- ducted, the accuracy values could be analyzed as well cally compared in Fig. 7a and Fig. 7b. These figures as the power dissipation. As shown in Table 3, it is clear show a comparison of each wireless technology in both that the InvenSense has the lowest power consumption. the sleep mode and transmitting mode. A clear trend This is partially because the supply voltage is 1 V as op- of Wi-Fi consuming significantly more power than the posed to all the other microphones which have a nom- other two technologies can be seen. Zigbee uses the inal voltage of 3.3 V. However, the disadvantage is re- least while BLE is in the middle. quiring a voltage regulator to step down from 3.3 V. This Table 5 puts each technology in perspective in terms would require an excessive amount of power. of the total power being saved by the sleep function- ality of the system. The most apparent savings occur 0:04258 dB = 290:5(x 350) + 314:7 (7) when using Wi-Fi since the active power of the Wi-Fi The highest accuracy is the digital ST microphone. module is so much greater than the sleeping power. The This makes sense as the integrated ADC is configured standalone system consisting of the XBee without a mi- to work specifically for the microphone. Consequently, crocontroller has the lowest overall power. The increase dB Power (mW) Power (mW) Component [Transmit] [Sleep] Microphone 0.35 0.35 Amplifier 0.20 0.20 Threshold 0.07 0.07 NodeMCU 91.84 16.76 Wi-Fi (2.4 GHz) 265.75 0.00 BLE 4.0 49.02 8.47 Zigbee 33.68 0.07 Table 4: Power results by component. 10 300 0 0 Wi-Fi Bluetooth Zigbee Wi-Fi Bluetooth Zigbee (a) Sleep (b) Transmit Figure 7: Wireless power consumption. Wireless Power (mW) Power (mW) Savings (%) Technology [Sleep] [Transmit] Wi-Fi (2.4 GHz) 16.76 357.59 95.3 BLE 4.0 25.23 140.86 82.1 Zigbee 16.83 160.43 89.5 Zigbee (Standalone) 1.00 34.30 97.1 Table 5: Total power results of prototypes. in sleeping power is most likely due to extra leakage power consumption. When looking at the work in [9], current from the use of the ADC on the XBee. the average power consumption claimed is 45 mW. This is based on a periodic interrupt to transition from sleep 5.4. Comparison with similar approach to active, a common technique used to save power in such systems. If a quiet environment is considered, no To prove the relevance of the power savings achieved, significant sound would be reported. In this situation, an in-place solution will be compared and analyzed. the average would still be 45 mW in the work being This will be compared to the Zigbee standalone imple- compared while the presented system will use 1 mW. mentation in this paper since it has the lowest overall Power (mW) Power (mW) This leads to a power savings of over 97%. The paper threshold circuit in each. In the future, large-scale wire- being compared assumes the prototype has a battery size less audio collection systems will be possible, this is one of 2900 mAh which means their prototype can last for step towards a power-efficient system capable of oper- 9 days based on the average power consumption. The ating for months with no human maintenance. amount of improvement shown in this prototype leads Testing a greater variety of MEMS microphones to the system lasting approximately 400 days when no could also potentially improve accuracy and power. De- significant audio is detected. veloping an analog A-weighted filter would make the If a moderately noisy environment is considered, this noise level better represent the human ear when collect- will change the average power of the presented system. ing data. Developing a dynamic threshold circuit to reg- The system presented system will never use more power ulate the duty cycle would help reduce power further. than the system being compared since a 100% duty cy- cle would lead to a power of only 34 mW. This means References that in the worst case, the prototype will last just over 11 days. Based on this result, the improved version pre- References sented in this paper will always outperform that of the [1] J. Khan, M. Ketzel, K. Kakosimos, M. Sørensen, S. S. comparison work. Jensen, Road traffic air and noise pollution exposure assess- Another big advantage of the presented system is that ment - a review of tools and techniques, Science of The Total Environment 634 (2018) 661 – 676. doi:https: it won’t miss significant events, unlike a periodic sam- //doi.org/10.1016/j.scitotenv.2018.03.374. pling system. This is because the threshold circuit will URL http://www.sciencedirect.com/science/ always activate the transmission of data once the noise article/pii/S0048969718311343 threshold has been reached. The envelope circuit will [2] D. Shepherd, K. Dirks, D. Welch, D. McBride, J. Landon, The covariance between air pollution annoyance and noise also mean the sound that triggers the threshold will also annoyance, and its relationship with health-related quality be accounted for when reading the sample. Improving of life, International Journal of Environmental Research and the active standard power consumption of the prototype Public Health 13 (8). doi:10.3390/ijerph13080792. by removing the microcontroller led to a significantly URL https://www.mdpi.com/1660-4601/13/8/ improved power consumption, meaning that in any state [3] C. Xu, X. Huang, J. Zhu, K. Zhang, Research on the con- it will be more efficient than the comparison system. struction of sanya smart tourism city based on internet and big The implementation of the threshold circuit further im- data, in: 2018 International Conference on Intelligent Trans- proved the power by allowing the prototype to sleep in portation, Big Data Smart City (ICITBS), 2018, pp. 125–128. doi:10.1109/ICITBS.2018.00040. most cases, thus improving leading to a much more ef- [4] H. Shu, Y. Song, H. Zhou, Time-frequency performance study ficient prototype overall. on urban sound classification with convolutional neural network, in: TENCON 2018 - 2018 IEEE Region 10 Conference, 2018, pp. 1713–1717. doi:10.1109/TENCON.2018.8650428. 6. Conclusion [5] F. Rong, Audio classification method based on machine learn- ing, in: 2016 International Conference on Intelligent Trans- In this work, a wireless ambient sound collection sys- portation, Big Data Smart City (ICITBS), 2016, pp. 81–84. tem was designed and tested. The novelty is a threshold doi:10.1109/ICITBS.2016.98. circuit that only wakes the digital components upon a [6] D. Christin, Privacy in mobile participatory sensing: Current trends and future challenges, Journal of Sys- threshold being broken. This lowers the average power tems and Software 116 (2016) 57 – 68. doi:https: consumption by 97% when using the lowest power per- //doi.org/10.1016/j.jss.2015.03.067. mutation of the prototype is used for data transmission, URL http://www.sciencedirect.com/science/ article/pii/S0164121215000692 making a full-scale wireless microphone array much [7] K. Shehzad, H. Kang, D. Verma, Y. J. Park, K. Lee, Low-power more feasible in smart cities. When compared to a sys- 10-bit sar adc using class-ab type amplifier for iot applications, tem of similar performance, the presented system low- in: 2017 International SoC Design Conference (ISOCC), 2017, ers power consumption. In comparison to a periodic pp. 224–225. doi:10.1109/ISOCC.2017.8368864. [8] C. Peckens, C. Porter, T. Rink, Wireless sensor networks for sleep cycle, this solution ensures that relevant audio data long-term monitoring of urban noise, Sensors 18 (9) (2018) is collected. This can also be tailored to regulate an en- vironment that is meant to stay below a certain level of [9] V. Risojevic, ´ R. Rozman, R. Pilipovic, ´ R. Cešnovar, P. Bulic, ´ dB. et al., Accurate indoor sound level measurement on a low-power and low-cost wireless sensor node, Sensors 18 (7) (2018) 2351. The work presented is a flexible solution that can [10] C. Mydlarz, J. Salamon, J. P. Bello, The implementation of easily be tailored to other audio collection systems. low-cost urban acoustic monitoring devices, Applied Acoustics Three wireless technologies were used to both measure 117 (2017) 207 – 218, acoustics in Smart Cities. doi:https: the power consumption and find the effectiveness of a //doi.org/10.1016/j.apacoust.2016.06.010. 10 URL http://www.sciencedirect.com/science/ VM1010-EB-R.pdf article/pii/S0003682X1630158X [25] ST Analog Microphone, (retrieved: 2019-04-08). [11] R. M. Alsina-Pagès, U. Hernandez-Jayo, F. Alias, I. Angulo, URL https://www.st.com/resource/en/ Design of a mobile low-cost sensor network using urban buses datasheet/mp23abs1.pdf for real-time ubiquitous noise monitoring, Sensors 17 (1). doi: [26] ST Digital Microphone, (retrieved: 2019-04-08). 10.3390/s17010057. URL https://www.st.com/resource/en/ URL https://www.mdpi.com/1424-8220/17/1/57 datasheet/mp34dt05-a.pdf [12] J. Gubbi, S. Marusic, A. S. Rao, Y. W. Law, M. Palaniswami, [27] J. Zhou, J. Zhao, P. Li, Noise pollution and control measures A pilot study of urban noise monitoring architecture using of rubber factory compressor room, in: 2009 Third Interna- wireless sensor networks, in: 2013 International Conference tional Symposium on Intelligent Information Technology Ap- on Advances in Computing, Communications and Informatics plication Workshops, 2009, pp. 241–242. doi:10.1109/ (ICACCI), 2013, pp. 1047–1052. doi:10.1109/ICACCI. IITAW.2009.35. 2013.6637321. [28] D. Dov, R. Talmon, I. Cohen, Audio-visual voice activity de- [13] J. E. Noriega-Linares, J. M. Navarro Ruiz, On the applica- tection using diffusion maps, IEEE/ACM Transactions on Au- tion of the raspberry pi as an advanced acoustic sensor net- dio, Speech, and Language Processing 23 (4) (2015) 732–745. work for noise monitoring, Electronics 5 (4). doi:10.3390/ doi:10.1109/TASLP.2015.2405481. electronics5040074. [29] Ü. Zubari, E. C. Ozan, B. O. Acar, T. Ciloglu, E. Esen, T. K. URL https://www.mdpi.com/2079-9292/5/4/74 Ates, ¸ D. O. Önür, Speech detection on broadcast audio, in: 2010 [14] I. Hakala, I. Kivela, J. Ihalainen, J. Luomala, C. Gao, Design 18th European Signal Processing Conference, 2010, pp. 85–89. of low-cost noise measurement sensor network: Sensor func- [30] I. Feki, A. B. Ammar, A. M. Alimi, Audio stream analysis for tion design, in: 2010 First International Conference on Sen- environmental sound classification, in: 2011 International Con- sor Device Technologies and Applications, 2010, pp. 172–179. ference on Multimedia Computing and Systems, 2011, pp. 1–6. doi:10.1109/SENSORDEVICES.2010.39. doi:10.1109/ICMCS.2011.5945607. [15] I. Kivelä, C. Gao, J. Luomala, J. Ihalainen, I. Hakala, Design of [31] E. Fallis, P. Spachos, Power consumption and throughput networked low-cost wireless noise measurement sensors (issn of wireless communication technologies for smartphones, in: 1726-5479), International Journal on Sensors & Transducers 9 2018 Global Information Infrastructure and Networking Sym- (2010) 171–190. posium (GIIS), 2018, pp. 1–4. doi:10.1109/GIIS.2018. [16] S. Santini, B. Ostermaier, A. Vitaletti, First experiences using 8635665. wireless sensor networks for noise pollution monitoring, in: [32] Dsd tech hm-10, (retrieved: 2019-06-19). Proceedings of the Workshop on Real-world Wireless Sensor URL http://www.dsdtech-global.com/2017/08/ Networks, REALWSN ’08, ACM, New York, NY, USA, 2008, hm-10.html pp. 61–65. doi:10.1145/1435473.1435490. [33] Zigbee website, (retrieved: 2019-06-19). URL http://doi.acm.org/10.1145/1435473. URL https://www.zigbee.org/ 1435490 [34] Sparkfun xbee s2c, (retrieved: 2019-06-19). [17] L. Filipponi, S. Santini, A. Vitaletti, Data collection in wireless URL https://www.digi.com/resources/ sensor networks for noise pollution monitoring, in: S. E. Niko- documentation/digidocs/pdfs/90002002.pdf letseas, B. S. Chlebus, D. B. Johnson, B. Krishnamachari (Eds.), [35] Nex Tech Speakers, (retrieved: 2019-04-15). Distributed Computing in Sensor Systems, Springer Berlin Hei- URL https://www.thesource.ca/en-ca/ delberg, Berlin, Heidelberg, 2008, pp. 492–497. computers--and--tablets/computer- [18] M. Zappatore, A. Longo, M. A. Bochicchio, Crowd-sensing our accessories/computer-speakers/nexxtech- smart cities: A platform for noise monitoring and acoustic urban 2-0-stereo-pc-speakers/p/108033507 planning, Journal of Communications Software and Systems. [36] Urban Sound Data Sets, (retrieved: 2019-04-08). [19] I. Aspuru, I. García, K. Herranz, A. Santander, Citi-sense: meth- URL https://urbansounddataset.weebly.com/ ods and tools for empowering citizens to observe acoustic com- [37] Monsoon Power Monitor, (retrieved: 2019-01-30). fort in outdoor public spaces, Noise Mapping 3 (1). URL http://www.msoon.com/LabEquipment/ [20] I. Aspuru, A. Santander, I. García, M. González, A. Bartonova, PowerMonitor/ Citi-sense project: Acoustic comfort in the empowerment initia- tive about environmental quality of public spaces, i: Seminario gaa smart cities e nuovi sensori per la misura del rumore ambi- entale, Associazione Italiana di Acustica Pisa 17. [21] I. Kirillov, V. Bulkin, The mobile system of urban area noise pollution monitoring, in: 2015 Second International Scientific- Practical Conference Problems of Infocommunications Science and Technology (PIC S T), 2015, pp. 200–203. doi:10. 1109/INFOCOMMST.2015.7357312. [22] P. Patil, Smart iot based system for vehicle noise and pollu- tion monitoring, in: 2017 International Conference on Trends in Electronics and Informatics (ICEI), 2017, pp. 322–326. doi: 10.1109/ICOEI.2017.8300941. [23] InvenSense Microphone, (retrieved: 2019-04-08). URL https://www.invensense.com/wp-content/ uploads/2015/02/ICS-40310-datasheet- v1.2.pdf [24] PUI Microphone, (retrieved: 2019-04-08). URL http://www.puiaudio.com/pdf/PMM-3738- http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Electrical Engineering and Systems Science arXiv (Cornell University)

A Power-Efficient Audio Acquisition System for Smart City Applications

Loading next page...
 
/lp/arxiv-cornell-university/a-power-efficient-audio-acquisition-system-for-smart-city-applications-0igtTf3Nha

References

References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.

ISSN
2542-6605
DOI
10.1016/j.iot.2019.100155
Publisher site
See Article on Publisher Site

Abstract

Acoustic noise has adverse effects on human activities. Aside from hearing impairment and stress-related illnesses, it can also interfere with spoken communication, reduce human performance and affect the quality of life. As ur- banization is intensifying, the potential benefits of reducing noise pollution in smart-city environments are extensive. Noise levels can be collected and analyzed using a wireless sensor network which can monitor the noise level by using microphones. However, every wireless system struggles in terms of the battery requirements needed for continuous data collection and monitoring. In this paper, the design of a testbed for a smart microphone system is presented. To save power, a microcontroller and an Analog-to-Digital Converter (ADC) dynamically switch between high and low power modes in response to environmental noise. Specifically, the high powered components are triggered by a spike in the acoustic noise level. Three wireless technologies, WiFi (2.4 GHz), Bluetooth Low Energy (BLE) 4.0 and Zigbee were examined. According to the results, the power consumption of a node can be lowered by 97% when idle based on the testbed. Keywords: Low-power audio; audio processing; audio coherence; audio classification, noise pollution. 1. Introduction of different sounds such as human voice, traffic, con- struction, or air transport that is currently present in any A common problem in urban cities is acoustic noise school or hospital. Mapping the amount of sound clas- pollution [1]. Noise pollution is an abundance of noise sified as human voice within an area grants the ability in a concentrated area that can harm humans both phys- of the public to observe the amount of human presence ically and mentally [2]. To alleviate this problem, the at the time. This can also help citizens to determine if first step is to understand and analyze what acoustic an area within the city such as a park or restaurant is noise citizens are being exposed to daily. Smart cities comfortable to visit at a specific time in the day. enable the introduction of new technology to the public Analyzing the audio data in an urban environment can through different services that optimize resource usage, become complicated depending on the location within transportation, utilities and more. In a smart city, large the city [5]. There is a variety of challenges such as amounts of data are collected and utilized [3]. This is privacy and placement [6]. Privacy has always been a done to predict, analyze, and observe trends that happen big issue with regard to the collection of data in a pub- throughout the city based on various sensors. lic setting. An invasion of privacy could be claimed by Audio data is an example of sensory information be- anyone who feels uncomfortable about the idea of the ing collected. The amount of human comfort in differ- voices being recorded. ent locations within a city can be found by analyzing Another challenge is the reliability of the data, which noise levels collected by multiple microphones. By cre- depends on its accuracy in modeling the noise within ating a tool that can visualize the levels of acoustic noise an area. Data is considered reliable if the extraneous around the city, its people are given the ability to react variables that could skew the noise model of the area and adjust accordingly. In future iterations, it may also are minimal. A major concern is the power consump- be possible to show the type of sounds present if a neural tion of various microphones and analog-to-digital con- network is utilized [4]. An example is the classification verters (ADCs) when collecting audio data [7]. This is especially true when the microphone is set to sample throughout the day, meaning that it will have minimum Corresponding author Email address: petros@uoguelph.ca (Petros Spachos) downtime. Many systems try to solve this problem by Preprint submitted to Internet of Things January 23, 2020 arXiv:2001.08163v1 [cs.NI] 21 Jan 2020 sampling at different points in the day rather than being level during a day, it can easily miss many important continuous. Although this is a good method for saving events since it is only awake less than 20% of the time. energy, it can miss key information due to the scheduled The work in [9] presents a node that collects audio sampling. level information as well as other types of data. It is The work presented relates to a framework for reduc- designed to wake up based on a timer interrupt or when the accelerometer is triggered. The method of wireless ing power consumption in Micro-Electro-Mechanical transmission involves the Zigbee protocol to ensure low System (MEMS) microphone ADCs for an acoustic en- power. The model was verified by measuring the audio vironmental monitoring system. A testbed to prove the level of varying sinusoidal waves, ranging from 31.5 Hz power savings has been developed. The contributions of to 16 kHz. This was recorded with an MI 6201 MULTI- this paper are as follows: NORM sound level meter. The average power con- • A testbed for environmental sound detection us- sumption of the prototype is 45 mW. This is extremely ing a microphone was designed. The microphone low when compared to most devices, allowing operation was selected after experimentation on the energy for up to 7 days. This prototype is a good starting point requirements and the accuracy of four different mi- but requires a longer operating time to reduce the need crophones. for continuous maintenance. The work in [10] features a prototype using a mini • An amplifier and a threshold circuit were designed PC. Although the system is not fully wireless, it is a for use with audio collection systems. good solution for indoor environments. It was veri- fied using a calibrated sound level meter using an A- • Three wireless technologies were used to exam- weighted filter. In [11], a prototype was built for use ine the effectiveness of the proposed energy-saving with city buses. This had a power supply unit connected scheme. Specifically, WiFi (2.4 GHz), Bluetooth to the city bus and used 3G as well as GPS to wirelessly Low Energy (BLE) 4.0 and Zigbee were used. transmit data along with a location stamp. As described in the article, one challenged faced was attempting to The results are compared to a reference system to filter out the noise generated from the bus, this presents prove the viability and demonstrate the improvements. a big problem with the accuracy of the data if it is meant The rest of the paper is organized as follows; Section 2 to represent noise level from a human perspective. reviews the related works and Section 3 discusses the A pilot study was presented in [12], which involved proposed system architecture. Section 4 details the pro- an embedded system to collect acoustic noise. The cedure involved and the results found during analysis power consumption was not specified but the accuracy are presented in Section 5. The conclusion is in Sec- of collected data was discussed extensively. A standard tion 6. sound level meter was used to calibrate the system and verify the analog filters. A noise monitoring node based around a Raspberry Pi 2B and USB microphone was 2. Related Work developed in [13]. A sound level meter was used to ver- In recent years, many research works involve wire- ify all measurements and power over Ethernet supplied less sensor networks for acoustic noise collection [8, 9, each node with energy. Although not wireless, it was 10, 11, 12, 13, 14, 15, 16, 17]. Their main focus is the still considered a flexible solution for an acoustic sensor development of systems that have low enough power to network. reliably operate with little to no human intervention. In [14], a low-cost sensor network was developed to A promising implementation has been developed target the problem of noise pollution. It has an oper- in [8]. The work focuses on developing an ultra low ating power of 89 mW when collecting data. A sound power sensor node that is designed to wake up periodi- level meter was used to calibrate the system, ensuring cally to collect data, it was programmed to wake for ten accurate measurements. Despite the low power, the pro- minutes every hour. To accomplish this, an ultra-low- totype can run for less than 4 days, which is not very power microcontroller was used along with an XBee. realistic for a full implementation. A similar implemen- The total power when transmitting was 810 mW while tation was followed in [15]. The authors in [16] show the transmitting power was 132 mW. A handheld Sound preliminary results of a wireless noise collection sys- Pressure Level (SPL) meter was used to verify the ac- tem, mainly discussing the accuracy of this method and curacy of the audio level collection. Although this pro- giving recommendations for future designs. In [17], the totype may be sufficient to get the overall average noise authors focus on a prototype involving low power lis- 2 >_ Threshold (Digital) Raw Audio (Analog) Amp   Amplified Raw Processed Audio Audio (Analog) Magnitude (Digital) Figure 1: System overview. tening. They mainly focus on the networking protocols noise pollution by redesigning the air compressor. The involved and recommend the collection tree protocol. work presented in [28] highlights an improved way to collect audio data for voice activity detection. It uses Smartphones have been considered for noise collec- both audio and visual signals with a supervised learn- tion via crowdsourcing. The authors in [18] developed ing algorithm to detect what audio frames correspond an Android application for the use of acoustic data col- to human voice. The authors in [29] use a Gaussian lection. It gives the user two collection methods, man- mixture model-based classification approach to speech ual mode or automatic collection. Both require no ex- detection. It’s trained using spectral flow direction, a tra hardware for the user to start collecting the acoustic novel feature for this type of classification. In [30], a noise level around a city. The exact power being used multistage system is developed to analyze environmen- by the application is unknown. To verify the model, a tal sound and classify it. The authors claim to achieve Sauter SU-130 sound level meter was used and com- an accuracy higher than 90% for audio concept identifi- pared with the data collected via the app. A visual dash- cation. board is available to the end-user which displays the col- In comparison with these works, this paper introduces lected data on to a map. Another Android application a novel method that is portable and can be implemented was developed in [19]. This application allows the user into many in-place solutions. Specifically, this testbed to record and visualize the acoustic comfort in an area utilizes a wake-up circuit to conserve power when min- as well as push the results to a server for analysis from imal audio activity is present, while three different tech- the authors. The accuracy of their data was analyzed nologies are examined to find the most energy-efficient. and verified in a separate study [20]. The main focus of the papers reviewed so far have been focused on developing a low power solution for 3. System Architecture ambient sound collection. The other concern this paper The introduced system has four main components: attempts to address is finding an effective way to collect the microphone, the circuit, the microcontroller, and the and handle the sound data while being able to integrate wireless unit. An overview of the system can be seen in it into the design. The authors in [21] discuss a mobile Fig. 1. A microphone collects raw audio data and then system dedicated to the sensing of noise pollution in ur- the signal is amplified. This amplified signal is then fed ban environments. It focuses on determining the level to both a microcontroller and a custom threshold cir- of attenuation as part of the sound collection process in cuit. The microcontroller only collects data when the addition to gathering the actual audio levels. The work audio threshold has been broken, then finally sends the in [22] specializes in the tracking of audio data with an processed data to a server via a wireless technology. emphasis on vehicles. The data being sent is based on a certain threshold value which will omit any data below 3.1. Microphone it. Unfortunately, no schematic of the system was made and the power consumption was not presented. This Multiple microphones were considered to determine severely limits the usefulness of the claimed results and a good candidate which had a high ratio of accuracy to methodologies. power. They all have varying manufacturers, voltage In [27], a factory was examined and profiled based on ranges, and configurations. A summary showing the de- the acoustic levels. Their goal is to reduce the amount of tails of each microphone can be found in Table 1. While 3 Manufacturer Microphone Voltage Range (V) Configuration InvenSense 0.9 - 1.3 Analog ICS-40310 [23] Projections Unlimited, Inc. (PUI) 1.6 - 3.6 Analog PMM-3738-VM1010-EB-R [24] STMicroelectronics (ST) 1.52 - 3.6 Analog MP23ABS1 [25] STMicroelectronics (ST) 1.6 - 3.6 Digital MP34DT05-A [26] Table 1: MEMS microphone overview. V V out interrupt most are analog, one was picked to be digital and used dd as a control to compare with. The integrated ADC will lead to a higher accuracy than others, the disadvantage Mic Amp Envelope Threshold being a significant increase in power. The chosen microphone needed to be analog to prop- erly interface with the threshold circuit. To make the Figure 2: Top level view of analog circuit. system more sensitive to audio, an amplifier was used. 3.2. Circuit designed to have low noise and a high gain. The am- The circuit that was designed has three main compo- plifier is shown in Fig. 3 The negative feedback adds nents: the amplifier, the envelope filter, and the thresh- stability to make a much cleaner signal. The gain of the old circuit. amplifier can be determined from R and R as seen in f 1 the following: 3.2.1. Top level design Multiple circuits were designed to collect and ana- A = 1 + (1) lyze the data. An overview of the electrical components R can be seen in Fig. 2. This shows the three simplified where A is the gain of the amplifier. This is the trans- components. An external amplifier was needed to allow fer function for a typical non-inverting amplifier design. the ADC to accurately capture small changes in signal The common-mode input and output can be seen by the amplitude, essentially increasing the resolution. The en- following equation: velope filter averaged the positive amplitude using resis- tors and a capacitor. This is used to bypass the need for dd CM = (2) pre-processing before data transmission. The main fea- ture of this prototype, the wake-up circuit, was designed where the common-mode voltage is simply the sup- to output a high or low value based on a threshold. ply voltage divided by two. This is from the voltage divider defined by R . 3.2.2. Amplifier An amplifier was needed to make the signal recog- 3.2.3. Envelope nizable to an ADC with a standard resolution. This was also important for the wake-up circuit, making it much An envelope circuit was designed to average the mag- more sensitive to changes in noise. The amplifier was nitude of audio. The schematic of this circuit can 4 Vdd 1 Vdd Rf in Vout Vdd Vout M1 M2 R R 2 2 Vdd Figure 5: Threshold circuit. Parameter Value V 3.3 V dd Vin R 100 k R 1 k Figure 3: Amplifier circuit. R 1 M Vout in R 10 M R 500 k C R R 5 5 6 R 10 M R 100 k C 22 F Figure 4: Envelope circuit. C 100 nF C 4.5 F be seen in Fig. 4. This surpassed the need for pre- 3 processing. The purpose of C and R is to increase the 5 5 C 1.5 nF time constant, filtering the output. The second diode and C 9 F R are used to lower the common-mode voltage since 5 the XBee ADC uses a voltage reference of 1.2 V. Table 2: Circuit parameters. 3.2.4. Threshold circuit The threshold circuit was used to control an interrupt All of the circuit parameters used for this set of ex- on the microcontroller. This disabled sleep mode and periments can be found in Table 2. Using Eq. (1), the allowed the microcontroller to send audio data through gain can be calculated based on these parameters: the various wireless technologies for a brief period be- 100k fore going back into sleep mode. The schematic for the A = 1 + = 101 (3) wake-up circuit can be seen in Fig. 5. 1k Two transistors are used, M regulates the DC offset The gain becomes 101 , amplifying the signal by of the input, M is biased in such a way that it will satu- approximately 20 dB. This is necessary for the ADC to rate when the input is high. Once M is saturated, it ties properly recognize relatively small changes in the am- the output to ground. Alternatively, V will be close out bient acoustic noise. to V when M is in the cut-off region. This means dd 2 that the output will be grounded when the threshold is 3.3. Microcontroller broken and V when no significant noise is heard. At dd this time, the threshold can only be configured through A microcontroller was needed to correctly process hardware but may be configurable in future iterations. the analog data incoming from the amplifier. The 5 NodeMCU was picked for its low power, flexibility, and An advantage of the XBee is the integrated ADC, ability to harness many types of wireless technologies. which allows for wireless transmission without the need The NodeMCU uses an ESP8266 microprocessor with for an external microcontroller. a 10-bit ADC. When the microcontroller was not in sleep mode, it 4. Experimental Procedure connected to the server using either Wi-Fi (2.4 GHz), BLE 4.0, or Zigbee. The microcontroller then slept if 4.1. Microphone evaluation the noise threshold was not broken. The XBee S2C To examine which microphone was the most accu- is capable of converting an analog value to digital and rate, the coherence between the original waveform and transmitting the data using Zigbee. This bypasses the the recorded waveform was found. This shows how need for a microcontroller and will be tested alongside similar the recording is to the original signal in terms the microcontroller variant of the prototype. of the frequency response. Magnitude squared coher- ence was used on the pre-processed signals to create an 3.4. Wireless technology output which shows how much the frequency domains Three wireless technologies were tested to determine of the signals match. which was the lowest power, and show the sleep func- The coherence was conducted in Matlab by using tionality in a variety of cases. functions in the signal processing library. A procedure was used to filter the two signals and end up with a per- 3.4.1. Wi-Fi centage value that represents how closely the recording Wi-Fi is a common technology found in most homes. resembles the source. The simple list of steps includes: It allows users to connect to the internet wirelessly by using an access point. The power consumption and • Down-sample all recordings to 8 kHz. bandwidth are typically high, keeping up with the needs of the consumer [31]. For the application of an au- • Lineup source and recording. dio acquisition system, the bandwidth should not be as • Limit time to exactly 80 seconds. much of an issue when compared to the power being expended. • Find magnitude squared coherence. The NodeMCU has an integrated proprietary Wi-Fi (2.4 GHz) chip, giving it the capability to transmit data. • Take envelope of the signal. It was able to directly send data to a server. • Average the envelope. 3.4.2. Bluetooth Bluetooth is a popular technology used for many The purpose of down-sampling is to simplify the data wireless devices. It is designed to be a low-power so- and give every dataset a fair comparison without remov- lution that conserves energy to promote battery life in ing the frequencies that matter as well as increase the electronics. The power is typically less than Wi-Fi, but performance of the script. This uses decimation to make the throughput suffers [31]. the signal simpler, and thus easier to compare. The DSD TECH HM-10 was used to connect the mi- Although the magnitude squared coherence will con- crocontroller using BLE 4.0 [32]. It communicates with vert the signals to the frequency domain, it is still impor- a separate device to wirelessly transmit the data. tant to properly line up the source and recording. This eliminates any frequencies that happened before or after 3.4.3. Zigbee the recording interval. The offset can be found by com- Zigbee is a lesser-known protocol capable of low paring the similarities of the two signals and performing power transmission, meant to be used specifically for a correlation that is much more basic when compared to IoT (Internet of Things) [33]. This makes it a good the magnitude squared coherence. This offset was used match for the application of low power collection of au- to perfectly line up the signals by trimming the micro- dio. phone clip to exactly 80 seconds. The offset was based The XBee S2C module was being used [34]. This on a 10 second clip of the source and recording which provided a platform for the microcontroller to transmit was shown to be enough to properly line up the signals. using the Zigbee protocol. Similar to Bluetooth, the The magnitude squared coherence finds the cross power Zigbee experiments required a secondary unit with a spectral density in the numerator which can be seen be- matching XBee to provide wireless communication. low: 6 Algorithm 1 : Find the effective coherence value to a pair of Nex Tech analog speakers to provide a much higher quality sound when compared to the embedded y_source = resample(y_source, 8000, Fs_source) speakers on the Nexus 5 [35]. This was kept at a con- y_mic = resample(y_mic, 8000, Fs_mic) stant 30 cm away from the respective microphone which ensured consistent data. Each microphone was pointed y_source_clip = y_source(1:8000*10) directly at the speakers for a fair comparison. The vol- y_mic_clip = y_mic(1:8000*10) ume of the Nexus 5 was set to a constant of bars, or 53.3% while the speakers were kept at a maximum gain. mic_delay = finddelay(y_source_clip, y_source_clip) The Urban Sound dataset was used to test with, it y_mic = y_mic(mic_delay:mic_delay + 8000*80 - 1) contains audio categorized into four main sections [36]. This includes human, nature, music, and mechanical [corr, f] = mscohere(y_source, y_mic, [], [], [], 8000) based sounds. Twenty seconds of each category was used for testing, totaling 80 seconds for each data point. corr = envelope(corr, 100, ’peak’) The dataset features multiple sampling rates, channels, and resolution. To normalize this data, all clips were corr_value = mean(corr) up-sampled to 96 kHz and each clip was limited to one channel, also known as mono. This was done with the Sox Linux tool by utilizing linear interpolation to reduce jG ( f )j any inaccuracies during the coherence process. Down- xy C ( f ) = (4) xy sampling has the potential to remove data points so it G ( f )G ( f ) xx yy was excluded as an option. where G is the cross-spectral density defined by: xy 4.2. Power evaluation j!t G ( f ) = R (t)e dt (5) xy xy It is important to measure the power savings created by the wake-up functionality. To do this, the Monsoon where R is the cross-correlation of the two original xy power monitor was being used. It is capable of pro- signals, x(t) and y(t). The cross-correlation of the two viding very accurate measurements of energy consump- signals can be described as: tion [37]. Measuring the power consumption of both the sleep R (t) = [x(t) y(t)](t) (6) xy and active modes of each permutation of the prototype which involves the two signals being convoluted. will show The envelope of this attempts to create a coherence The Urban Sounds data set was being used for testing. that more accurately represents how closely the two sig- Audio clips were created and designed to simulate a re- nals match, this finds the outline of the signal. Finally, alistic outdoor environment. Specifically, 20 seconds of the mean is taken of the resulting envelope to determine audio from a category was played, followed by 100 sec- the accuracy value. The code showing all the modifica- onds of silence. This was repeated for each category, tions to the signal can be seen in Algorithm 1. totaling 8 minutes of audio. The clips were organized The experimental conditions were kept constant by loudness to test the power consumption at different through all the experiments. The list of conditions for noise levels. the experiment include: 4.3. Calibration • 80 second audio clip of various urban sounds. The voltage being recorded by the microcontroller • Nexus 5 at 53.3% volume, connected to speakers. ADC does not mean much until it can be related to a more accurate measuring device. Putting data into a • Audio clips out of speakers are 96 kHz. human-readable format is important both for interpre- tation and comparison to other data sets. • Distance of 30 cm from speaker and microphone. A handheld Sound Pressure Level (SPL) meter was A Nexus 5 was used to play the 80 seconds clip for used to calibrate the prototype. An equation was formed each experiment. This amount of time was chosen to to convert values on the ADC to one representing deci- get an average accuracy for each microphone, reducing bel noise levels. The Decibel X Android app was also the effect of any outliers. The Nexus 5 was connected calibrated and used to measure the average noise level 7 it also has the highest power consumption. Addition- Microphone Power (mW) Accuracy ally, exclusively using a digital microphone would pro- hibit the use of the threshold circuit since the output of InvenSense 0.0079 0.7359 the microphone is PDM rather than a pure analog sig- nal. The analog ST microphone shows high accuracy PUI 0.3545 0.6966 and low power, working without the need for a voltage regulator. These reasons are why the analog ST micro- ST (Analog) 0.3480 0.7187 phone was chosen as the best candidate for the remain- ST (Digital) 2.0813 0.7577 ing experiments. Table 3: Microphone accuracy and power. 5.2. Decibel calibration A handheld SPL meter was used to calibrate the data being transmitted from the microcontroller. An equa- tion was formed to transfer the voltages being reported to the server into a Z-weighted decibel value. The visual calibration can be found in Fig. 6. The R value is 0.995, indicating a very strong correlation. The equation is as follows: where x is the ADC value after processing on the mi- crocontroller. This power trend makes sense as decibels are logarithmic, meaning it should follow an exponen- tial trend. 350 450 550 650 750 5.3. Power consumption ADC Value The power consumption of the prototype was mea- sured with the Monsoon power monitor. Each mod- Figure 6: SPL calibration curve. ule was considered individually to analyze the biggest power draw in the system. The power consumption of each component can be seen in Table 4. As expected, when recording and testing with the prototype. The the microcontroller and Wi-Fi module have the largest application was seen to only have an error of approx- power draw by a very significant amount when trans- imately 2 dB. mitting. Both Bluetooth and Zigbee are comparable in terms of power, though the power of the XBee is slightly lower, especially when sleeping. The ESP8266 5. Results Wi-Fi module of the NodeMCU was considered to have 5.1. Microphone selection a power of zero when sleeping since it’s integrated into the microcontroller itself. After the magnitude-squared coherence was con- The power of each wireless technology was specifi- ducted, the accuracy values could be analyzed as well cally compared in Fig. 7a and Fig. 7b. These figures as the power dissipation. As shown in Table 3, it is clear show a comparison of each wireless technology in both that the InvenSense has the lowest power consumption. the sleep mode and transmitting mode. A clear trend This is partially because the supply voltage is 1 V as op- of Wi-Fi consuming significantly more power than the posed to all the other microphones which have a nom- other two technologies can be seen. Zigbee uses the inal voltage of 3.3 V. However, the disadvantage is re- least while BLE is in the middle. quiring a voltage regulator to step down from 3.3 V. This Table 5 puts each technology in perspective in terms would require an excessive amount of power. of the total power being saved by the sleep function- ality of the system. The most apparent savings occur 0:04258 dB = 290:5(x 350) + 314:7 (7) when using Wi-Fi since the active power of the Wi-Fi The highest accuracy is the digital ST microphone. module is so much greater than the sleeping power. The This makes sense as the integrated ADC is configured standalone system consisting of the XBee without a mi- to work specifically for the microphone. Consequently, crocontroller has the lowest overall power. The increase dB Power (mW) Power (mW) Component [Transmit] [Sleep] Microphone 0.35 0.35 Amplifier 0.20 0.20 Threshold 0.07 0.07 NodeMCU 91.84 16.76 Wi-Fi (2.4 GHz) 265.75 0.00 BLE 4.0 49.02 8.47 Zigbee 33.68 0.07 Table 4: Power results by component. 10 300 0 0 Wi-Fi Bluetooth Zigbee Wi-Fi Bluetooth Zigbee (a) Sleep (b) Transmit Figure 7: Wireless power consumption. Wireless Power (mW) Power (mW) Savings (%) Technology [Sleep] [Transmit] Wi-Fi (2.4 GHz) 16.76 357.59 95.3 BLE 4.0 25.23 140.86 82.1 Zigbee 16.83 160.43 89.5 Zigbee (Standalone) 1.00 34.30 97.1 Table 5: Total power results of prototypes. in sleeping power is most likely due to extra leakage power consumption. When looking at the work in [9], current from the use of the ADC on the XBee. the average power consumption claimed is 45 mW. This is based on a periodic interrupt to transition from sleep 5.4. Comparison with similar approach to active, a common technique used to save power in such systems. If a quiet environment is considered, no To prove the relevance of the power savings achieved, significant sound would be reported. In this situation, an in-place solution will be compared and analyzed. the average would still be 45 mW in the work being This will be compared to the Zigbee standalone imple- compared while the presented system will use 1 mW. mentation in this paper since it has the lowest overall Power (mW) Power (mW) This leads to a power savings of over 97%. The paper threshold circuit in each. In the future, large-scale wire- being compared assumes the prototype has a battery size less audio collection systems will be possible, this is one of 2900 mAh which means their prototype can last for step towards a power-efficient system capable of oper- 9 days based on the average power consumption. The ating for months with no human maintenance. amount of improvement shown in this prototype leads Testing a greater variety of MEMS microphones to the system lasting approximately 400 days when no could also potentially improve accuracy and power. De- significant audio is detected. veloping an analog A-weighted filter would make the If a moderately noisy environment is considered, this noise level better represent the human ear when collect- will change the average power of the presented system. ing data. Developing a dynamic threshold circuit to reg- The system presented system will never use more power ulate the duty cycle would help reduce power further. than the system being compared since a 100% duty cy- cle would lead to a power of only 34 mW. This means References that in the worst case, the prototype will last just over 11 days. Based on this result, the improved version pre- References sented in this paper will always outperform that of the [1] J. Khan, M. Ketzel, K. Kakosimos, M. Sørensen, S. S. comparison work. Jensen, Road traffic air and noise pollution exposure assess- Another big advantage of the presented system is that ment - a review of tools and techniques, Science of The Total Environment 634 (2018) 661 – 676. doi:https: it won’t miss significant events, unlike a periodic sam- //doi.org/10.1016/j.scitotenv.2018.03.374. pling system. This is because the threshold circuit will URL http://www.sciencedirect.com/science/ always activate the transmission of data once the noise article/pii/S0048969718311343 threshold has been reached. The envelope circuit will [2] D. Shepherd, K. Dirks, D. Welch, D. McBride, J. Landon, The covariance between air pollution annoyance and noise also mean the sound that triggers the threshold will also annoyance, and its relationship with health-related quality be accounted for when reading the sample. Improving of life, International Journal of Environmental Research and the active standard power consumption of the prototype Public Health 13 (8). doi:10.3390/ijerph13080792. by removing the microcontroller led to a significantly URL https://www.mdpi.com/1660-4601/13/8/ improved power consumption, meaning that in any state [3] C. Xu, X. Huang, J. Zhu, K. Zhang, Research on the con- it will be more efficient than the comparison system. struction of sanya smart tourism city based on internet and big The implementation of the threshold circuit further im- data, in: 2018 International Conference on Intelligent Trans- proved the power by allowing the prototype to sleep in portation, Big Data Smart City (ICITBS), 2018, pp. 125–128. doi:10.1109/ICITBS.2018.00040. most cases, thus improving leading to a much more ef- [4] H. Shu, Y. Song, H. Zhou, Time-frequency performance study ficient prototype overall. on urban sound classification with convolutional neural network, in: TENCON 2018 - 2018 IEEE Region 10 Conference, 2018, pp. 1713–1717. doi:10.1109/TENCON.2018.8650428. 6. Conclusion [5] F. Rong, Audio classification method based on machine learn- ing, in: 2016 International Conference on Intelligent Trans- In this work, a wireless ambient sound collection sys- portation, Big Data Smart City (ICITBS), 2016, pp. 81–84. tem was designed and tested. The novelty is a threshold doi:10.1109/ICITBS.2016.98. circuit that only wakes the digital components upon a [6] D. Christin, Privacy in mobile participatory sensing: Current trends and future challenges, Journal of Sys- threshold being broken. This lowers the average power tems and Software 116 (2016) 57 – 68. doi:https: consumption by 97% when using the lowest power per- //doi.org/10.1016/j.jss.2015.03.067. mutation of the prototype is used for data transmission, URL http://www.sciencedirect.com/science/ article/pii/S0164121215000692 making a full-scale wireless microphone array much [7] K. Shehzad, H. Kang, D. Verma, Y. J. Park, K. Lee, Low-power more feasible in smart cities. When compared to a sys- 10-bit sar adc using class-ab type amplifier for iot applications, tem of similar performance, the presented system low- in: 2017 International SoC Design Conference (ISOCC), 2017, ers power consumption. In comparison to a periodic pp. 224–225. doi:10.1109/ISOCC.2017.8368864. [8] C. Peckens, C. Porter, T. Rink, Wireless sensor networks for sleep cycle, this solution ensures that relevant audio data long-term monitoring of urban noise, Sensors 18 (9) (2018) is collected. This can also be tailored to regulate an en- vironment that is meant to stay below a certain level of [9] V. Risojevic, ´ R. Rozman, R. Pilipovic, ´ R. Cešnovar, P. Bulic, ´ dB. et al., Accurate indoor sound level measurement on a low-power and low-cost wireless sensor node, Sensors 18 (7) (2018) 2351. The work presented is a flexible solution that can [10] C. Mydlarz, J. Salamon, J. P. Bello, The implementation of easily be tailored to other audio collection systems. low-cost urban acoustic monitoring devices, Applied Acoustics Three wireless technologies were used to both measure 117 (2017) 207 – 218, acoustics in Smart Cities. doi:https: the power consumption and find the effectiveness of a //doi.org/10.1016/j.apacoust.2016.06.010. 10 URL http://www.sciencedirect.com/science/ VM1010-EB-R.pdf article/pii/S0003682X1630158X [25] ST Analog Microphone, (retrieved: 2019-04-08). [11] R. M. Alsina-Pagès, U. Hernandez-Jayo, F. Alias, I. Angulo, URL https://www.st.com/resource/en/ Design of a mobile low-cost sensor network using urban buses datasheet/mp23abs1.pdf for real-time ubiquitous noise monitoring, Sensors 17 (1). doi: [26] ST Digital Microphone, (retrieved: 2019-04-08). 10.3390/s17010057. URL https://www.st.com/resource/en/ URL https://www.mdpi.com/1424-8220/17/1/57 datasheet/mp34dt05-a.pdf [12] J. Gubbi, S. Marusic, A. S. Rao, Y. W. Law, M. Palaniswami, [27] J. Zhou, J. Zhao, P. Li, Noise pollution and control measures A pilot study of urban noise monitoring architecture using of rubber factory compressor room, in: 2009 Third Interna- wireless sensor networks, in: 2013 International Conference tional Symposium on Intelligent Information Technology Ap- on Advances in Computing, Communications and Informatics plication Workshops, 2009, pp. 241–242. doi:10.1109/ (ICACCI), 2013, pp. 1047–1052. doi:10.1109/ICACCI. IITAW.2009.35. 2013.6637321. [28] D. Dov, R. Talmon, I. Cohen, Audio-visual voice activity de- [13] J. E. Noriega-Linares, J. M. Navarro Ruiz, On the applica- tection using diffusion maps, IEEE/ACM Transactions on Au- tion of the raspberry pi as an advanced acoustic sensor net- dio, Speech, and Language Processing 23 (4) (2015) 732–745. work for noise monitoring, Electronics 5 (4). doi:10.3390/ doi:10.1109/TASLP.2015.2405481. electronics5040074. [29] Ü. Zubari, E. C. Ozan, B. O. Acar, T. Ciloglu, E. Esen, T. K. URL https://www.mdpi.com/2079-9292/5/4/74 Ates, ¸ D. O. Önür, Speech detection on broadcast audio, in: 2010 [14] I. Hakala, I. Kivela, J. Ihalainen, J. Luomala, C. Gao, Design 18th European Signal Processing Conference, 2010, pp. 85–89. of low-cost noise measurement sensor network: Sensor func- [30] I. Feki, A. B. Ammar, A. M. Alimi, Audio stream analysis for tion design, in: 2010 First International Conference on Sen- environmental sound classification, in: 2011 International Con- sor Device Technologies and Applications, 2010, pp. 172–179. ference on Multimedia Computing and Systems, 2011, pp. 1–6. doi:10.1109/SENSORDEVICES.2010.39. doi:10.1109/ICMCS.2011.5945607. [15] I. Kivelä, C. Gao, J. Luomala, J. Ihalainen, I. Hakala, Design of [31] E. Fallis, P. Spachos, Power consumption and throughput networked low-cost wireless noise measurement sensors (issn of wireless communication technologies for smartphones, in: 1726-5479), International Journal on Sensors & Transducers 9 2018 Global Information Infrastructure and Networking Sym- (2010) 171–190. posium (GIIS), 2018, pp. 1–4. doi:10.1109/GIIS.2018. [16] S. Santini, B. Ostermaier, A. Vitaletti, First experiences using 8635665. wireless sensor networks for noise pollution monitoring, in: [32] Dsd tech hm-10, (retrieved: 2019-06-19). Proceedings of the Workshop on Real-world Wireless Sensor URL http://www.dsdtech-global.com/2017/08/ Networks, REALWSN ’08, ACM, New York, NY, USA, 2008, hm-10.html pp. 61–65. doi:10.1145/1435473.1435490. [33] Zigbee website, (retrieved: 2019-06-19). URL http://doi.acm.org/10.1145/1435473. URL https://www.zigbee.org/ 1435490 [34] Sparkfun xbee s2c, (retrieved: 2019-06-19). [17] L. Filipponi, S. Santini, A. Vitaletti, Data collection in wireless URL https://www.digi.com/resources/ sensor networks for noise pollution monitoring, in: S. E. Niko- documentation/digidocs/pdfs/90002002.pdf letseas, B. S. Chlebus, D. B. Johnson, B. Krishnamachari (Eds.), [35] Nex Tech Speakers, (retrieved: 2019-04-15). Distributed Computing in Sensor Systems, Springer Berlin Hei- URL https://www.thesource.ca/en-ca/ delberg, Berlin, Heidelberg, 2008, pp. 492–497. computers--and--tablets/computer- [18] M. Zappatore, A. Longo, M. A. Bochicchio, Crowd-sensing our accessories/computer-speakers/nexxtech- smart cities: A platform for noise monitoring and acoustic urban 2-0-stereo-pc-speakers/p/108033507 planning, Journal of Communications Software and Systems. [36] Urban Sound Data Sets, (retrieved: 2019-04-08). [19] I. Aspuru, I. García, K. Herranz, A. Santander, Citi-sense: meth- URL https://urbansounddataset.weebly.com/ ods and tools for empowering citizens to observe acoustic com- [37] Monsoon Power Monitor, (retrieved: 2019-01-30). fort in outdoor public spaces, Noise Mapping 3 (1). URL http://www.msoon.com/LabEquipment/ [20] I. Aspuru, A. Santander, I. García, M. González, A. Bartonova, PowerMonitor/ Citi-sense project: Acoustic comfort in the empowerment initia- tive about environmental quality of public spaces, i: Seminario gaa smart cities e nuovi sensori per la misura del rumore ambi- entale, Associazione Italiana di Acustica Pisa 17. [21] I. Kirillov, V. Bulkin, The mobile system of urban area noise pollution monitoring, in: 2015 Second International Scientific- Practical Conference Problems of Infocommunications Science and Technology (PIC S T), 2015, pp. 200–203. doi:10. 1109/INFOCOMMST.2015.7357312. [22] P. Patil, Smart iot based system for vehicle noise and pollu- tion monitoring, in: 2017 International Conference on Trends in Electronics and Informatics (ICEI), 2017, pp. 322–326. doi: 10.1109/ICOEI.2017.8300941. [23] InvenSense Microphone, (retrieved: 2019-04-08). URL https://www.invensense.com/wp-content/ uploads/2015/02/ICS-40310-datasheet- v1.2.pdf [24] PUI Microphone, (retrieved: 2019-04-08). URL http://www.puiaudio.com/pdf/PMM-3738-

Journal

Electrical Engineering and Systems SciencearXiv (Cornell University)

Published: Jan 21, 2020

References