LineLog
LineLog is an implementation of interleaved deltas.
It provides conflict-free stack editing ability. It is used by the absorb
command.
Bytecode
LineLog uses a bytecode format that is interpreted to produce content. There are 5 instructions:
Name | Operand 1 | Operand 2 | Meaning |
---|---|---|---|
JGE | Rev | Addr | Jump to Addr if Current Rev >= Rev |
J | 0 | Addr | Jump to Addr unconditionally |
JL | Rev | Addr | Jump to Addr if Current Rev < Rev |
LINE | Rev | Line | Append the Line + 1 -th line in Rev |
END | - | - | Stop execution |
Instructions are fixed-sized. The opcode takes 2 bits. J
and JGE
share the
same opcode (J Addr
is just JGE 0 Addr
). Operand 1 takes 30 bits. Operand 2
takes 32 bits.
Interpretation
Example
It is easier to understand with an example. Given a file with 3 revisions:
Rev 1
a
b
c
Rev 2: Inserted 2 lines.
a
b
1
2
c
Rev 3: Deleted 2 lines.
a
2
c
It can be encoded in LineLog bytecode like:
# Addr: Instruction
0: JL 1 8
1: LINE 1 0
2: JGE 3 6
3: LINE 1 1
4: JL 2 7
5: LINE 2 2
6: LINE 2 3
7: LINE 1 2
8: END
To check out a specified revision, set Current Rev
to the revision to check
out, then execute the instructions from the beginning.
Here are the steps to check out each revision:
- Rev 0
- Rev 1
- Rev 2
- Rev 3
- Address 0: JL 1 8: Jump to address 8, because Current Rev (0) < 1.
- Address 8: END: Stop execution. The content is empty.
- Address 0: JL 1 8: Do nothing, because Current Rev (1) is not < 1.
- Address 1: LINE 1 0: Append the first line from rev 1 ("a").
- Address 2: JGE 3 6: Do nothing, because 1 is not ≥ 3.
- Address 3: LINE 1 1: Append the second line from rev 1 ("b").
- Address 4: JL 2 7: Jump to address 7, because 1 < 2.
- Address 7: LINE 1 2: Append the third line from rev 1 ("c").
- Address 8: END: Stop. The final content is "abc".
- Address 0: JL 1 8: Do nothing, because Current Rev (2) is not < 1.
- Address 1: LINE 1 0: Append the first line from rev 1 ("a").
- Address 2: JGE 3 6: Do nothing, because 2 is not ≥ 3.
- Address 3: LINE 1 1: Append the second line from rev 1 ("b").
- Address 4: JL 2 7: Do nothing, because 2 is not < 2.
- Address 5: LINE 2 2: Append the 3rd line from rev 2 ("1").
- Address 6: LINE 2 3: Append the 4th line from rev 2 ("2").
- Address 7: LINE 1 2: Append the third line from rev 1 ("c").
- Address 8: END: Stop. The final content is "ab12c".
- Address 0: JL 1 8: Do nothing, because Current Rev (3) is not < 1.
- Address 1: LINE 1 0: Append the first line from rev 1 ("a").
- Address 2: JGE 3 6: Jump to address 6, because 3 ≥ 3.
- Address 6: LINE 2 3: Append the 4th line from rev 2 ("2").
- Address 7: LINE 1 2: Append the third line from rev 1 ("c").
- Address 8: END: Stop. The final content is "a2c".
Checkout and annotate
Note the lines that are not changed across multiple revisions, such as "a" only
occurs once as LINE 1 0
in the bytecode. The LINE
instruction points to the
revision and line that introduces the line. By tracking the operands of LINE
instructions in addition to line contents, LineLog could also produce the
annotate
(also called blame
) result at the same time.
In LineLog, the checkout and annotate operation are basically the same.
Range of revisions
A variation of the interpretation is to treat "Current Rev" as a range, not a
single fixed revision number. More specifically, given an inclusive range from
minRev
to maxRev
, treat JL
as "< maxRev
", JGE
as ">= minRev
". This
can produce all lines that existed in the revision range, in a reasonable order,
like:
rev 1: a rev 1: b rev 2: 1 rev 2: 2 rev 1: c
Linear history
LineLog assumes linear history. The revision comparisons are done using direct integer comparisons. It might be not too difficult to support non-linear history (i.e. with merges) by changing the revision comparisons to consider the graph topology. But that hasn't been attempted due to lack of use-cases so far.
Editing LineLog
LineLog provides a method for editing: replace_lines(brev, a1, a2, b1, b2)
.
It means replacing the line range [a1, a2)
from the current checkout to line
range [b1, b2)
in the given brev
revision. This covers insertion and
deletion too. If a1
equals to a2
, it is an insertion. If b1
equals to
b2
, it means lines from a1
to a2
are deleted in revision brev
.
This is implemented by appending a block that appends the lines from the
brev
, and removes lines from a
. Then change the LINE
instruction for the
a1
line to point to the added block.
# Before # After
# Addr: Instruction # Addr: Instruction
: ... : ...
a1: <a1's LINE> a1 : J len
a1+1: ... a1+1 : ...
: ... : ...
a2: ... a2 : ...
: ... : ...
len: N/A len : JL brev p
: LINE brev b1
: LINE brev b1+1
: ...
: LINE brev b2-1
p : JGE brev a2
: <a1's LINE> (copied)
: J a1+1
To construct LineLog for a file, one needs to run through the contents of revisions
of the file in commit order, calculate diffs for adjacent revisions, and then
feed LineLog the diffs using the replace_lines
method.
Usually replace_lines
is used to edit the latest revision. However, it can
also be used to edit past revisions, if past revisions are checked out. This
is how the absorb
command works under the hood.