### Expected behavior Because it always takes up 3GB of video memory after calling the api, now I will add auto.py ,204 line async def api_category_mask(payload: CategoryMaskRequest = Body(...), .... clear_sem_sam_cache() return result