Forum: >>> Magnum BBS <<<

Dark
Log in

Username Password

Basics of LZ77 Algorithm

From [email protected]@21:1/5 to All on Tue Jun 11 08:11:11 2019

Hello,

I am trying to understand how LZ77 algorithm work. From what I read from various sources, I have come to following conclusion:

According to LZ77 compression algorithm, if I encode jump and length using 4 bits, and character as 8 bits, I will use 16bits for each token. If the text I am compressing doesn't have any repetition, I will actually double the size of my input.

I was wondering if I had arrived to correct conclusion, because it doesn't sound right.

Thanks in advance,

Yaşar Arabacı

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Scott@21:1/5 to [email protected] on Tue Jun 11 22:05:38 2019

On Tue, 11 Jun 2019 08:11:11 -0700 (PDT), [email protected] wrote:

I am trying to understand how LZ77 algorithm work. From what I read from va= >rious sources, I have come to following conclusion:

According to LZ77 compression algorithm, if I encode jump and length using = >4 bits, and character as 8 bits, I will use 16bits for each token. If the t= >ext I am compressing doesn't have any repetition, I will actually double th= >e size of my input.

I was wondering if I had arrived to correct conclusion, because it doesn't = >sound right.

It does sound odd at first, but that's more or less right. It's a
fundamental fact of information theory and applies to all (lossless) compression methods. Basically, over the set of all possible messages
of length N, the average length of the corresponding compressed
messages is also N.

So yes, every method will have certain inputs that produce larger
outputs. The trick to practical compression is to find algorithms that
work well with patterns that you find in certain useful inputs, which
AIUI is how LZ was designed. Then you check as you go, and if you have
a chunk that is anti-compressible, you just store it without
compression.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From [email protected]@21:1/5 to [email protected] on Mon Jul 1 10:49:13 2019

On Tuesday, June 11, 2019 at 4:11:12 PM UTC+1, [email protected] wrote:

Hello,

I am trying to understand how LZ77 algorithm work. From what I read from various sources, I have come to following conclusion:

According to LZ77 compression algorithm, if I encode jump and length using 4 bits, and character as 8 bits, I will use 16bits for each token. If the text I am compressing doesn't have any repetition, I will actually double the size of my input.

I was wondering if I had arrived to correct conclusion, because it doesn't sound right.

Thanks in advance,

Yaşar Arabacı

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Who's Online
Recent Visitors
- Rixter
  Wed Jul 29 02:00:40 2026
  from Madison, Nc via Telnet
- Centurion
  Tue Jul 28 22:54:59 2026
  from Berea, Ohio via Telnet
- Bob Worm
  Tue Jul 28 16:01:18 2026
  from Wales, Uk via Telnet
- Rixter
  Tue Jul 28 13:42:46 2026
  from Madison, Nc via Telnet
- Krenn
  Tue Jul 28 11:59:57 2026
  from Sydney, Nsw via Telnet
- Rixter
  Tue Jul 28 01:23:48 2026
  from Madison, Nc via Telnet
- Centurion
  Mon Jul 27 22:50:42 2026
  from Berea, Ohio via Telnet
- Ataricrypt
  Mon Jul 27 19:19:17 2026
  from England via Telnet

System Info

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	741
Nodes:	16 (2 / 14)
Uptime:	57:02:38
Calls:	12,446
Calls today:	1
Files:	15,192
Messages:	6,537,379