yeah man… but honestly I bought my fully loaded R720 server with 256GB ram and 2x CPUs and even hard drives for $250. On eBay. Rails kit was $50 but with a wall mount rack you don’t need rails.
Make an offer … 96GB ram is huge for an AI server… buy a cheaper GPU like a $300 nvidia 3060 with 12GB vram ike I did and slap it into riser card 3… you’ll need a power cable mod like I had to do but you’ll have a well running AI server on the cheap. With 12GB vram you can load almost all 7B models no sweat and they run descent. The trick is to work with a single model and force it to stay loaded into memory… first query takes a while to come back but subsequent queries come back immediately and spit out pretty fast.