TY - GENERIC DO - 10.20944/preprints202409.1208.v1 UR - http://dx.doi.org/10.20944/preprints202409.1208.v1 TI - Efficient Hybrid Inference for LLMs: Reward-Based Token Modelling with Selective Cloud Assistance T2 - Preprints AU - MS, Adarsh AU - VG, Jithin AU - PS, Ditto PY - 2024 DA - 2024/09/17 PB - Preprints