Incorporating spatial relationship information in signal-to-text processing