Behavior Encoder Transformer BETR: A Transformer Encoder for Predicting Agent Behavior

Document Type : Original Article

Authors

1 Electrical Communication Engineering, Port Said University.

2 Electrical Communication and Electronics, Faculty of Engineering, Port Said University

3 Computer and Control Engineering, Port Said University

4 Electrical Engineering Department, Faculty of Engineering, Port Said University, Port Said, Egypt.

Abstract

Anticipating the future behavior of road users stands as one of the most formidable challenges in the

realm of autonomous driving. Achieving a comprehensive understanding of the dynamic driving environment requires an autonomous vehicle to accurately predict the motion of other traffic participants within the scene. As the complexity of motion prediction tasks increases, capturing

intricate spatial relationships, temporal dependencies, and nuanced interactions between agents and map elements becomes crucial. Our proposed hierarchical architecture strategically incorporates transformers, effectively modeling both local and global representations to extract multi-

scale features. Leveraging the potency of transformers, BETR adeptly captures and encodes intricate patterns of agent interactions, spatial dependencies, and temporal dynamics. Demonstrating superior performance in predicting agent behavior compared to conventional methods, BETR proves its efficacy through extensive experiments. Its capacity to adapt to diverse scenarios establishes BETR

as a robust and versatile solution for the intricate task of agent behavior prediction.

Keywords

Main Subjects