TY - JOUR
T1 - Large-Area Microphone Array for Audio Source Separation Based on a Hybrid Architecture Exploiting Thin-Film Electronics and CMOS
AU - Sanz-Robinson, Josue
AU - Huang, Liechao
AU - Moy, Tiffany
AU - Rieutort-Louis, Warren
AU - Hu, Yingzhe
AU - Wagner, Sigurd
AU - Sturm, James C.
AU - Verma, Naveen
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/4
Y1 - 2016/4
N2 - We present a system for reconstructing-independent voice commands from two simultaneous speakers, based on an array of spatially distributed microphones. It adopts a hybrid architecture, combining large-area electronics (LAE), which enables a physically expansive array (> 1m width), and a CMOS IC, which provides superior transistors for readout and signal processing. The array enables us to: 1) select microphones closest to the speakers to receive the highest SNR signal; 2) use multiple spatially diverse microphones to enhance robustness to variations due to microphones and sound propagation in a practical room. Each channel consists of a thin-film transducer formed from polyvinylidene fluoride (PVDF), a piezopolymer, and a localized amplifier composed of amorphous silicon (a-Si) thin-film transistors (TFTs). Each channel is sequentially sampled by a TFT scanning circuit, to reduce the number of interfaces between the large-area electronics (LAE) and CMOS IC. A reconstruction algorithm is proposed, which exploits the measured transfer function between each speaker and microphone, to separate two simultaneous speakers. The algorithm overcomes 1) sampling-rate limitations of the scanning circuits and 2) sensitivities to microphone placement and directionality. An entire system with eight channels is demonstrated, acquiring and reconstructing two simultaneous audio signals at 2 m distance from the array achieving a signal-to-interferer (SIR) ratio improvement of ∼ 12 dB.
AB - We present a system for reconstructing-independent voice commands from two simultaneous speakers, based on an array of spatially distributed microphones. It adopts a hybrid architecture, combining large-area electronics (LAE), which enables a physically expansive array (> 1m width), and a CMOS IC, which provides superior transistors for readout and signal processing. The array enables us to: 1) select microphones closest to the speakers to receive the highest SNR signal; 2) use multiple spatially diverse microphones to enhance robustness to variations due to microphones and sound propagation in a practical room. Each channel consists of a thin-film transducer formed from polyvinylidene fluoride (PVDF), a piezopolymer, and a localized amplifier composed of amorphous silicon (a-Si) thin-film transistors (TFTs). Each channel is sequentially sampled by a TFT scanning circuit, to reduce the number of interfaces between the large-area electronics (LAE) and CMOS IC. A reconstruction algorithm is proposed, which exploits the measured transfer function between each speaker and microphone, to separate two simultaneous speakers. The algorithm overcomes 1) sampling-rate limitations of the scanning circuits and 2) sensitivities to microphone placement and directionality. An entire system with eight channels is demonstrated, acquiring and reconstructing two simultaneous audio signals at 2 m distance from the array achieving a signal-to-interferer (SIR) ratio improvement of ∼ 12 dB.
KW - Amorphous silicon (a-Si)
KW - critically sampled
KW - flexible electronics
KW - large area electronics
KW - microphone array
KW - source separation
KW - thin-film
KW - thin-film transistors (TFT)
UR - http://www.scopus.com/inward/record.url?scp=84963986401&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84963986401&partnerID=8YFLogxK
U2 - 10.1109/JSSC.2015.2501426
DO - 10.1109/JSSC.2015.2501426
M3 - Article
AN - SCOPUS:84963986401
SN - 0018-9200
VL - 51
SP - 979
EP - 991
JO - IEEE Journal of Solid-State Circuits
JF - IEEE Journal of Solid-State Circuits
IS - 4
M1 - 7361744
ER -