It's quite common in conversations for the bot to receive images or voice notes. Considering GPT 4o already has vision and audio, maybe it's a possibility to integrate those capabilities with CloseBot? It would take the bot to a whole other level.