Sunday, 7 August 2016

Bob's Bible at 3/8ths complete

Complete is a relative term. Some things may change as new information appears. The appearance is not always expected. But so far it keeps happening.

Current status: These are the chapters I have done to date. Apart from the Psalms, they are all online beginning 4Q 2014. My text is essentially the Leningrad codex with a few tweaks from Aleppo or the DSS. And sometimes I prefer a ketiv over a qere.

Books Chapters Verses Final % 'done' no draft
1 Torah 187 5843 30 16.0% 4796
2 Former Prophets 147 4297 18 12.2% 3679
3 Later Prophets 166 3923 19 11.4% 3566
4 The 12 67 1050 50 74.6% 257
5 Books of Truth 223 4502 188 84.3% 777
6 The 5 Scrolls 39 745 30 76.9% 135
7 Other 100 2791 13 13.0% 2456
929 23151 348 37.5%

So for example (reading line 1). Torah in my data has 187 chapters and 5843 verses. I have 30 chapters online 'done' on this blog. That's 16.0%. Of the verses, I have no draft English for 4796. If I dove into Torah alone at my target pace of 3 hours a day, it would take me 32 weeks to complete. I am not operating at my target pace. I have a bunch of excuses besides the fact that it is summer. I figure I will be at this for another 2 years or so, maybe more like 3.

But we are at 37.5% of the chapters. It's a start. If I count by word where the word has a 'pretty good' guess (automated), I am just shy of 50% done. But I know there are 47,673 words (about 15.6%) that I have no knowledge of yet. Some (many?) of these are place and person names. And some are letter combinations I have not yet encountered. With a more sophisticated algorithm based on parsing, I might guess some of them. When I see them I will know, but coding 'the rules' that I now intuit (at least a little) is more than I want to do at the moment. I have written two algorithms for parsing, a root derivation routine that I no longer use, and a grammatical analysis routine that is in a decrepit state and could be more usable. But it would take a lot of time to tidy it up. Neither routine uses the Masoretic vowels. It would have been better to encode their decisions, perhaps, but I chose not to. It seemed more important to get into the text rather than to follow decisions I could only understand by rote when I started.

15 books are complete (10 of the 12, Psalms, and 4 of the 5 scrolls). These are the books (24) that are incomplete at this time. The second column is completion percentage. It is based on automation on a phrase by phrase basis. Until there is a draft for a verse, word guesses by phrase are useful but not necessarily acceptable. You can see the impact. Only 8 of 36 chapters of 2 Chronicles are 'done' but 50% of the words are already guessed! I do, however, have to look at every verse to finish this project.

GENESIS 45
EXODUS 46
LEVITICUS 35
NUMBERS 47
DEUTERONOMY 58
JOSHUA 39
JUDGES 39
1 SAMUEL 47
2 SAMUEL 37
1 KINGS 52
2 KINGS 51
ISAIAH 34
JEREMIAH 44
EZEKIEL 32
HOSEA 41
ZECHARIAH 67
PROVERBS 26
JOB 84
ESTHER 28
DANIEL 12
EZRA 22
NEHEMIAH 29
1 CHRONICLES 34
2 CHRONICLES 50


No comments:

Post a Comment