yolo_with_voice-recognition we are using yolo to recognize voice. In here we create mel_spectrogrames and bounding boxes.