Show sutra text #33

shreevatsa · 2023-01-09T03:29:41Z

Hacky code for #26 , but it actually seems to work:

Before merging I guess at minimum we should copy over https://raw.githubusercontent.com/ashtadhyayi-com/data/master/sutraani/data.txt so that it's hosted here — maybe convert it to a simple data/sutrapatha.tsv as suggested at #26 (comment) that has entries like

8.4.68	अ अ

— but just pushing this commit for now as a savepoint for when I return to this later (not today). (Or if I don't return 😱 )

(Curious how much faster it will get when we change from array of 4000 items to JS object for faster lookup…)

Generated with the following python script: ``` import requests, json from indic_transliteration import sanscript data = json.loads(requests.get('https://raw.githubusercontent.com/ashtadhyayi-com/data/master/sutraani/data.txt').text)['data'] print(len(data)) out = {} for sutra in data: name = f"{sutra['a']}.{sutra['p']}.{sutra['n']}" text = sutra['s'] slp1 = sanscript.transliterate(text, sanscript.DEVANAGARI, sanscript.SLP1) if slp1 == 'kftyErfRe': back = sanscript.transliterate('kftyEr fRe', sanscript.SLP1, sanscript.DEVANAGARI).replace(' ', '') elif slp1 == 'urft': back = sanscript.transliterate('ur ft', sanscript.SLP1, sanscript.DEVANAGARI).replace(' ', '') else: back = sanscript.transliterate(slp1, sanscript.SLP1, sanscript.DEVANAGARI) assert back == text, (text, slp1, back) out[name] = slp1 with open('sutrapatha.json', 'w') as f: json.dump(out, f, indent=2) f.write('\n') ```

shreevatsa · 2023-01-09T04:42:43Z

The async is kind of weird but it works, and loading sutrapatha.json doesn't block the rest of the app from loading, so this is probably best? Please take a look whether it's ready to merge.

Aside: note that transliterating to SLP1 makes it hard (AFAICT) to recover the original Devanagari (which is debatable in the first place) for:

kftyErfRe (turns कृत्यैर्ऋणे into कृत्यैरृणे) and
urft (turns उर्ऋत् into उरृत्)

— see indic-transliteration/indic_transliteration_py#75 (this is the kind of thing that I'm hoping a Rust transliteration library would fix by being "pedantic" and requiring specifying a strategy instead of making ad-hoc choices, but it's probably fine for now).

shreevatsa · 2023-01-09T05:08:37Z

Made a small change, updated screenshot (tested with manual removal of "1.3.9": "tasya lopaH" from sutrapatha.json) — no error messages in JS console:

Generated with: ```py import requests, json, csv from indic_transliteration import sanscript data = json.loads(requests.get('https://raw.githubusercontent.com/ashtadhyayi-com/data/master/sutraani/data.txt').text)['data'] print(len(data)) out = [] for sutra in data: name = f"{sutra['a']}.{sutra['p']}.{sutra['n']}" text = sutra['s'] slp1 = sanscript.transliterate(text, sanscript.DEVANAGARI, sanscript.SLP1) # if slp1 == 'kftyErfRe': # back = sanscript.transliterate('kftyEr fRe', sanscript.SLP1, sanscript.DEVANAGARI).replace(' ', '') # elif slp1 == 'urft': # back = sanscript.transliterate('ur ft', sanscript.SLP1, sanscript.DEVANAGARI).replace(' ', '') # else: # back = sanscript.transliterate(slp1, sanscript.SLP1, sanscript.DEVANAGARI) # assert back == text, (text, slp1, back) out.append((name, slp1)) with open('sutrapatha.tsv', 'w', newline='') as f: writer = csv.writer(f, dialect='excel-tab') writer.writerows(out) ```

akprasad · 2023-01-09T15:48:07Z

wonderful -- thank you!!

akprasad · 2024-01-24T03:27:48Z

this is the kind of thing that I'm hoping a Rust transliteration library would fix by being "pedantic" and requiring specifying a strategy instead of making ad-hoc choices, but it's probably fine for now).

I'd love to discuss this with you further now that we have a starter implementation (https://ambuda-org.github.io/vidyut-lipi/).

Show sutra text

fdfad5d

shreevatsa mentioned this pull request Jan 9, 2023

prakriya demo: Set up instructions #32

Closed

shreevatsa added 3 commits January 8, 2023 19:57

Repeat final form at the end, for ambuda-org#25

40b87ba

Use sutrapatha.json

253c7c2

shreevatsa requested a review from akprasad January 9, 2023 04:36

Separate table cell, and error handling when missing text.

281d78f

shreevatsa force-pushed the sutraText branch from 87ee11c to 281d78f Compare January 9, 2023 05:08

akprasad approved these changes Jan 9, 2023

View reviewed changes

akprasad merged commit 2edcd9b into ambuda-org:main Jan 9, 2023

akprasad mentioned this pull request Jan 9, 2023

prakriya: show final form at the end #25

Closed

shreevatsa mentioned this pull request Jan 25, 2024

vidyut-lipi needs to handle the colon separator in ISO-15919 #103

Open

akprasad mentioned this pull request Jan 26, 2024

Decide on and implement an error-handling policy #105

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show sutra text #33

Show sutra text #33

shreevatsa commented Jan 9, 2023

shreevatsa commented Jan 9, 2023

shreevatsa commented Jan 9, 2023 •

edited

Loading

akprasad commented Jan 9, 2023

akprasad commented Jan 24, 2024

Show sutra text #33

Show sutra text #33

Conversation

shreevatsa commented Jan 9, 2023

shreevatsa commented Jan 9, 2023

shreevatsa commented Jan 9, 2023 • edited Loading

akprasad commented Jan 9, 2023

akprasad commented Jan 24, 2024

shreevatsa commented Jan 9, 2023 •

edited

Loading