Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models
Comments I found a big mistake in the paper that causes significant bias on the results. The residual links are not taken into consideration when computing the transmission. All results about the compressed data size and transmission latency would be affected