[논문 리뷰] Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators 3 분 소요 논문 링크: Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators