Urban air pollution presents significant health risks, requiring effective monitoring, forecasting and controlling strategies. Comprehensive monitoring is often hindered by the limited availability of measurement data. This study introduces a seq2seq model designed to perform operational forecasting of air pollutant concentrations at an unmonitored site using an upwind sensor. The effectiveness of seq2seq model is systematically evaluated through test cases that aim to explore effects of several influencing factors, including network architecture, embedding method, model complexity, and sensor placement. The test cases involve 252 seq2seq model candidates, which are trained and tested on a synthetic dataset established using a validated large eddy simulation (LES) model for the typical street canyon urban setting, ensuring controlled conditions and reproducibility. Additionally, scheduled sampling is used during model training to mitigate error accumulation. Results demonstrate that a decoder-only model is only capable of making flatline predictions, while a well-tuned seq2seq model informed by a strategically placed upwind sensor provides reasonable operational predictions. Despite its simplicity, the linear network, using the positional embedding, a two-layer structure, and the sensor placed 0.17H above the ground, exhibits the best performance among seq2seq models. The study also challenges a priori belief that favors higher sensor locations based on statistical similarity measures, as the sensor at 0.17H enables the best performing model. These findings underscore the potential of seq2seq models to enhance urban air quality monitoring, offering a robust scientific basis for informed urban planning and pollution management strategies.