Let’s say you’re using an app that involves calls to an LLM; maybe it’s for document analysis, customer support, whatever (feel free to give examples).Would you rather be charged $X/mo flat fee, or some variable amount $f(x) based on your actual usage?Which do you prefer and why?
Been out of touch in this area, for a while (since my last “private cloud” project), during which I had available solutions like HCI of the likes of Nutanix, Cisco, HP and an immature (at the time) AWS Outpost, all with their own HA and storage solution.I am now in position to revisit and maybe…