Examples: Add text compression example. #9633

stduhpf · 2024-09-24T23:14:22Z

This PR adds an example text compression scheme using a language model. This compression scheme is not optimal, but it's not too far from it.

Performance:

Testing it on the source file against classical compression schemes.

Size (bytes)	Name
1480	compress.cpp.qwen2.5-coder-1.5b-q6-k.bin
1487	compress.cpp.llama3-8b-q4-k-m.bin
1557	compress.cpp.starcoder2-3b-q8.bin
3872	compress.cpp.gz
3878	compress.cpp.bz2
3908	compress.cpp.xz
3983	compress.cpp.7z
3999	compress.cpp.zip

Usage:

Compression

./compress --mode compress -m path/to/your/model.gguf -f path/to/the/text/file.txt -o output.bin

Decompression

./compress --mode expand -m path/to/your/model.gguf -f output.bin -o output.txt

Drawbacks

It's very slow compared to traditionnal compression schemes.
It needs the exact same setup for compression and decompression. ( just changing the number of offloaded gpu layers can change the behavior enough to introduce errors )

How it works

TODO (I'm bad at explainig things, but please read the code)

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

examples/compress/compress.cpp

ngxson · 2024-09-25T09:29:43Z

Is this the same method as: https://arxiv.org/pdf/2306.04050 ?

stduhpf · 2024-09-25T09:47:40Z

Is this the same method as: https://arxiv.org/pdf/2306.04050 ?

Interesting, thanks for sharing. At first glance, this does look similar to what I'm doing. At least the part about the ranks is the same.
The main difference is in the compression format. I'm using a bespoke algorithm here, but maybe Arithmetic Coding (like in the paper) would be better.
Actually, this would be pretty much equivalent to arithmetic coding if we assume the token probabilities are decreasing exponentially with rank (wich is not the case in reality, making this less efficient).

matteoserva · 2024-10-07T08:24:52Z

Just for reference, I found an interesting implementation of arithmetic coding using llama_cpp_python:

https://github.com/AlexBuz/llama-zip

stduhpf added 4 commits September 24, 2024 22:24

examples: add compression example

e02c45c

compress: fix sampling problem introduced by b0f2736

1146007

compress: cleanup

bd5b24e

compress: update comment

77dd5d0

github-actions bot added the examples label Sep 24, 2024

compress: Fix missing c_str()

b9a32f4

stduhpf force-pushed the compress-example branch from ed2c292 to b9a32f4 Compare September 24, 2024 23:26

compress: format

bec8398

ggerganov reviewed Sep 25, 2024

View reviewed changes

examples/compress/compress.cpp Outdated Show resolved Hide resolved

stduhpf added 2 commits September 25, 2024 11:56

compress: remove sampling.cpp dependency

da444fa

compress: add cmath

d3df98d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Examples: Add text compression example. #9633

Examples: Add text compression example. #9633

stduhpf commented Sep 24, 2024 •

edited

Loading

ngxson commented Sep 25, 2024

stduhpf commented Sep 25, 2024 •

edited

Loading

matteoserva commented Oct 7, 2024

Examples: Add text compression example. #9633

Are you sure you want to change the base?

Examples: Add text compression example. #9633

Conversation

stduhpf commented Sep 24, 2024 • edited Loading

Performance:

Usage:

Compression

Decompression

Drawbacks

How it works

ngxson commented Sep 25, 2024

stduhpf commented Sep 25, 2024 • edited Loading

matteoserva commented Oct 7, 2024

stduhpf commented Sep 24, 2024 •

edited

Loading

stduhpf commented Sep 25, 2024 •

edited

Loading