Abstract
The use of one-bit analog-to-digital converters (ADCs) at a receiver is a power-efficient solution for future wireless systems operating with a large signal bandwidth and/or a massive number of receive radio frequency chains. This solution, however, induces high channel estimation error and therefore makes it difficult to perform the optimal data detection that requires perfect knowledge of likelihood functions at the receiver. In this paper, we propose a likelihood function learning method for multiple-input multiple-output (MIMO) systems with one-bit ADCs using a reinforcement learning approach. The key idea is to exploit input-output samples obtained from data detection, to compensate for the mismatch in the likelihood function. The underlying difficulty of this idea is a label uncertainty in the samples caused by a data detection error. To resolve this problem, we define a Markov decision process (MDP) to maximize the accuracy of the likelihood function learned from the samples. We then develop a reinforcement learning algorithm that efficiently finds the optimal policy by approximating the transition function and the optimal state of the MDP. Simulation results demonstrate that the proposed method provides significant performance gains for data detection methods that suffer from the mismatch in the likelihood function.
Original language | English (US) |
---|---|
Article number | 8924916 |
Pages (from-to) | 1663-1676 |
Number of pages | 14 |
Journal | IEEE Transactions on Wireless Communications |
Volume | 19 |
Issue number | 3 |
DOIs | |
State | Published - Mar 2020 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Computer Science Applications
- Electrical and Electronic Engineering
- Applied Mathematics
Keywords
- Multiple-input-multiple-output (MIMO)
- likelihood function learning
- one-bit analog-to-digital converter (ADC)
- reinforcement learning
- robust data detection