Rethinking Latency Denial-of-Service: Attacking the LLM Serving Framework, Not the Model