A term that describes when an AI model is asked to generate a response; it refers to the process of making predictions drawing from new data.