Code Monkey home page Code Monkey logo

tfc-khmer-epigraphy's Introduction

License: CC-BY Open Source Love

The repository tfc-khmer-epigraphy

This repository assembles data produced by the project Corpus des inscriptions khmères (before and during the DHARMA project).

Encoding log

We maintain in .md format an encoding log that can be viewed through https://github.com/erc-dharma/tfc-khmer-epigraphy/blob/master/texts/CIK_encoding_log.md.

Repository Static page

You can consult the current state of the work on this repository at https://erc-dharma.github.io/tfc-khmer-epigraphy/.

Encoding intellectual authorship

  • indicate only the name of the original editor if no verification against the stone/estampage has been possible at any stage between the original printed edition and your digital edition
  • indicate only your name if you are responsible both for encoding and for a thoroughly revised reading of the text: if you can honestly claim to have done a more thorough job than your predecessor, even if the difference in terms of meaning of the text is minimal
  • indicate both the original editor’s name and your name in case of doubt
  • indicate Dominic’s, Dominique’s or Arlo's name if you are using a txt file that records a revised edition by one of us, and leave away the name of the original editor if the same condition applies to the relevant person among us as the one applying to you in the second bullet point above
  • leave away your own ONLY if you are really doing no more than applying xml tags and you are not verifying against an estampage (but in principle, this should only happen when no reproduction can be obtained)

Basic GIT guidance

See https://training.github.com/kit/downloads/fr/github-git-cheat-sheet.pdf

Things you always need to do

Command Line (Terminal.app) Users

  • Open Terminal.app.
  • Then make sure you are in your proper working directory.
    • If you don't know where you are, use pwd.
    • To navigate between folders with the Terminal, use command such as
      • cd to change your working folder (cd .. allows you to return in the parent level folder while cd folderName1/folderName2 allow to jump directly into folderName2 in one command line).
      • if you are lost in your folders, use pwd, it will give you your position.
    • type the key TAB once you’ve filled in 3 or 4 of the first letters of a directory name: Terminal will fill in the rest if there’s no other directory beginning with the same letters.
  • Once you’re there, type the following command followed by ENTER:
    • git pull
      • do this every time you start working, to get most recent version
      • also do it every time you are going to do the following steps
  • Once you start making edits to file, you can use
    • git add -A
      • use this every time you end a working session if you have made any changes needing to be recorded
      • makes all files from your copy of the repository ready to be committed
      • so this is a shortcut avoiding the need to type every filename per changed file after git add
    • git commit -m "message describing last changes"
      • this needs to be done at the end of every working session if you have made any changes needing to be recorded
    • git push
      • this needs to be done if you have done git commit, for others to get your changes with a pull
    • git checkout -f
      • this helps to get out of the problem ‘Your local changes to the following files would be overwritten by merge'
      • do git pull after this

If your regular command line screen suddenly changes

  • This may happen if you forget to type -m "..." during a git commit, or if you forget to do a git pull before a git add git commit.
  • Don't do anything else than this: type :q and ENTER
  • It will allow you to quit the strange screen and return to the normal command line screen

Desktop Users

  • Open the Github Desktop
    • If it isn't in your Dock yet, find it under Applications
  • Make sure you are in the right repository under "Current repository".
  • Be sure the Desktop has fetched the changes that could have been made in Github repository online under "Fetch Origin"
  • Once you start making edits
    • They will pill up under "Changes"
      • Write your commit message
      • Click the "Commit to master" button
      • Click on "Pull Origin"

Help and Guides

Validation

All DHARMA XML are validated against Epidoc schema in its latest version as well as against DHARMA ones. You can declare those schemas in the processing instructions of every XML files. To use it with Oxygen, you might need to change the default configuration. In the toolbar, go to the Options > Preferences, a window will open, select XML > XML Parser > RelaxNG. The check box for "Add default attributes values" must be unchecked. Click on Apply. You might need to restart Oxygen for it to be taken into account.

  • DHARMA validation schemas
    • a RelaxNG with embedded Schematron code is available here.
    • a Schematron with embedded Schematron Quick Fixes to complete the first schema is available here Note that both schemas are needed to a complete validation of DHARMA encoding rules

If you use an online validation processing, the processing instructions should be as followed

<?xml-model href="https://raw.githubusercontent.com/erc-dharma/project-documentation/master/schema/latest/DHARMA_Schema.rng" type="application/xml" schematypens="http://relaxng.org/ns/structure/1.0"?>
<?xml-model href="https://raw.githubusercontent.com/erc-dharma/project-documentation/master/schema/latest/DHARMA_Schema.rng" type="application/xml" schematypens="http://purl.oclc.org/dsdl/schematron"?>
<?xml-model href="https://raw.githubusercontent.com/erc-dharma/project-documentation/master/schema/latest/DHARMA_SQF.sch" type="application/xml" schematypens="http://purl.oclc.org/dsdl/schematron"?>
<?xml-model href="http://www.stoa.org/epidoc/schema/latest/tei-epidoc.rng" schematypens="http://relaxng.org/ns/structure/1.0"?>
<?xml-model href="http://www.stoa.org/epidoc/schema/latest/tei-epidoc.rng" schematypens="http://purl.oclc.org/dsdl/schematron"?>

However, if you need to do it locally, you can access all DHARMA RelexNG and Schematron in their latest version in the projection-documentation repository under schema/latest. In this case, you will need to update the @href and provide the path between you current file and the schema itself (either as a standalone file or either as a part of the project-documentation repository if you have cloned it)

Workflow

  • Before working with this repository, make sure you always the latest version through a git pull
  • Create a XML file or edit an existing one.
  • Once, you are done use git to add, commit and push it on the repository.
  • Once you have done your git push, the XML files are validated against the Epidoc and Dharma RelaxNG with a CI service called Travis. This process is made possible with the existing .travis.yml file. Do not delete or modify this file on your own. You can access the resulting log here
  • On the git push, we are also using github Actions service to transforme automatically the XML into a HTML version. We are using Ant in a java environment to do this batch transformation. The steps are declare in the build.xml file, while the automated scenario is provided under .github/workflows/editorial.yml Do not delete or modify those files on your own.
    • Step 1: it will editorially edit all the XMLs in the repository.
    • Step 2: from those edited XMLs, it will create a HTML output and retrieve Zotero data as well.
    • Step 3: it will push those files in the repository under editedxml and htmloutput folders.
    • Step 4: to access those newly created files, make a git pull
    • Step 5: all the HTML are available as a static website thanks to GitHub pages service. Note that the README is the by default page if no XMLs are stored in the repository: https://erc-dharma.github.io/tfc-khmer-epigraphy/

Please note that this process can break easily and requires high quality data. The content log of the pipeline are available under Actions tab of the repository, if you need to find the reason behind an error.

DHARMA XML workflow

License & Attribution

All the edited XML files of this repository are available under the Creative Commons CC-BY 4.0 License, meaning you are free to use it for any purpose, commercial or non-commercial! However, we ask you to mention the project name, funder as well as the members involved in the XML edition of the inscriptions. Feel free to contact the project for more infos at [email protected].

Contributing to this repository

Thanks for your interest in contributing! Currently only members of the project are allowed to contribute to this repository. Contact the researchers in charge of this corpus if you are interested in working with them.

tfc-khmer-epigraphy's People

Contributors

ajaniak avatar arlogriffiths avatar chhomkunthea avatar chloechollet avatar danbalogh avatar dg2018 avatar dominiquesoutif avatar michaelnmmeyer avatar salomepichon avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

tfc-khmer-epigraphy's Issues

Doublons

Duplicated files under texts/xml-provisional/doublon-to-check should be excluded from the repository (by adding this directory in .gitignore, for instance). Otherwise it's impossible to tell which files are supposed to be validated and which are not.

DHARMA_INSCIK00172 - Zotero Validation Issue

Dear @chloechollet,

In your edition of K. 172, you are using the Zotero Short Title Coedes1911_01. It matches 2 entries in Zotero Library: « Études Cambodgiennes : 2. Une inscription du sixième siècle çaka, 3. Une nouvelle inscription du Phnoṃ Bàkheṅ, IK 496, 4. La grotte de Pơṅ Práḥ Thvãr (Phnoṃ Kulèn), 5. Une inscription d’Udayādityavarman I, 6. Des édicules appelés “bibliothèques” », Bulletin de l’École française d’Extrême-Orient 11 (3-4), p. 393-396, 396-398, 398-400, 400-404, 405-406, [Réimpr. 1989, APK 1, p. 3-6, 6-8, 8-10, 10-15, 15-16]. and Etudes cambodgiennes I: la légende de la Nāgī.
Can you see if it is possible to resolve this?

Thank you for your help.
Best

Validation error on K. 1238

Dear @chhomkunthea,

  • The first issue regarding this file is located in this chunck
<lb n="A1"/><g type="symbol" subtype="initialGomutra"/><num value="958">958</num> śaka ṣaṣṭhi roc· <unclear>qā</unclear>ṣādha sauravāra nu vraḥ kamrateṅ· qañ· śri narendravar<unclear>mma</unclear> sruk· <unclear>da</unclear>nlāṅ· khloñ· mukha mahā<ra>nasa <g type="symbol" subtype="egg"/> paṅgaṁ

There is a weird <ra> inside mahā<ra>nasa. Can you let me know what was your intent please?

  • There is also a missing @reason on the <supplied>
<lb n="B3"/>cat<supplied>uḥ</supplied><unclear>ś</unclear>āla mvāya sn<unclear>o</unclear>ṅ· dravya <unclear>ta</unclear> roḥha <unclear>n</unclear>oḥha ti qaña qoy· sre noḥ <unclear>nu</unclear> pada<unclear>ḥ</unclear> noḥha ta kvan· qañ· ta jmaḥ loña śriviṣnu nu qnak· khlo

Let me know if you need my help to handle those cases,
Best,

ZST issue : IT

dear @chloechollet,

Could you resolve the issue regarding "bib:IT" used in the tfc-khmer-epigraphy/texts/xml-provisional/Chloe/DHARMA_INSCIK01085.xml ?

Many thanks,
Best,

Validation error on K. 1241

Dear @chhomkunthea,

there is a validation issue on this file because of <mra> identified as a tag:

<lb n="12"/><gap reason="lost" quantity="4" unit="character"/> bhay· moya knap· mā<unclear>s·</unclear> moya ta vraḥ ka<mra>tāṅ· qañ· ge hvatta nā

Do you need my help to settle this?

Best,

<num> with numbers written in letters

Dear @chloechollet and @chhomkunthea,

In some of the XML files, you have used the tag <num> to encode numbers written in letters. The EG states that only numbers written with arabic numbers or symbols should be encoded.

If you need to keep such an encoding, let me know, I will figure something out for you.

Best,
Axelle

Validation error on K. 333

Dear @chhomkunthea,

The issue regarding this file can be easily settled. There are 2 <supplied> tags used without @reason:

  • XML line 183 for
<p n="A34-36">Équipe de serviteurs de la commune de Carāṃ, <foreign>pramān</foreign> de Qamoghapura <supplied >suit une liste comprenant, dans l’ordre : 1 <foreign>qamraḥ</foreign>, 1 <foreign>gho</foreign>, 1 <foreign>tai</foreign>, 1 <foreign>sī pau</foreign>, 1 <foreign>sī rat</foreign>, 2 <foreign>tai</foreign>, 1 <foreign>gvāl</foreign></supplied> ;</p>
  • XML line 200 for
<p n="B46-47">Équipe de serviteurs de la commune de Travāṅ Kbas : <supplied >suit une liste de serviteurs comprenant, dans l’ordre : 1 <foreign>qamraḥ</foreign>, 1 <foreign>gvāl</foreign></supplied> ;</p>

Let me know if you need help
Best,

Bibliography problem K0410.xml

Chère Axelle,

La présentation des balises <bibl> pour cette inscription en html montre quelques incohérences que je n'arrive pas à comprendre. Compare ce qui se passe correctement pour Aymonier (dans la bibliographie) avec les présentations bizarres de Cœdès (dans la bibliographie également) ou encore de Griffiths & Soutif (dans le commentaire).
Capture d’écran 2020-11-03 à 15 13 15

Merci !

Chloé

Updating encoding : Line numbering

Dear @erc-dharma/khmer-epigraphy,

Daniel has updated the numbering pattern. The content of @n for <lb/>must be unique in the file. See EG, p. 21 for more informations.
Could you please check the following files:

Let me know if you need help figuring out the new numbering pattern.
Best,

K. 1354

Dear @chloechollet,

There are some <persName> inside your encoding of the K.1354. Arlo asked me to inform you that they need to be deleted. However if you want to keep them because of the research interest. I can show you how to handle your versions with Git Branches as well as write you as XSLT to make it easier for you to create those 2 versions: one without <persName> for the project and one with <persName> for your own purpose.

Let me know.

Best,
Axelle

Language code "eng"

code: <div type="translation" xml:lang="eng" source="bib:Goodall2020_01"> est refusé par le schéma. En revanche, si je mets "en" le schéma est accepté. Or, selon EGC app. D, le code à mettre est bien "eng".

DHARMA_INSCIK00293-5 - Zotero Short Title Issue

Dear @chhomkunthea,

In your edition of K 293-5, you have used the Zotero Short Title Coedes1928_01. It matches 2 entries in Zotero Études Cambodgiennes : 19. La date du Bàyon, 20. Les capitales de Jayavarman II », Bulletin de l’École française d’Extrême-Orient 28 (1-2), p. 81-112, 113-123, [Réimpr. 1989, APK 1, p. 95-117, 127-137]. and L’inscription de Tuol Komnap Ta Kin. Can you check this please?

Thanks

encoding of "Numeral symbols other than decimal digits" (EGD 4.2.2)

@michaelnmmeyer — in tfc-khmer-epigraphy, there is a massive number of <num> elements whose contents are made up of symbols other than decimal digits that have not been wrapped in <g type="numeral"> by the responsible encoder(s) as EGD 4.2.2 prescribes. Examples:

  • <num value="1">I</num> should be <num value="1"><g type="numeral">I</g></num>
  • <num value="4">IIII</num>should be <num value="4"><g type="numeral">IIII</g></num>
  • <num value="123">100 20III</num> should be <num value="123"><g type="numeral">100</g><g type="numeral">20</g><g type="numeral">III</g></num>

There are also cases like <num value="80">80</num> which look like they contain decimal digits but where the transliteration is probably a representation of a non-decimal notation system, and so ought to be <num value="80"><g type-numeral>80</g></num> (as in the 123 example above). But there is no way for a machine to tell that these are not decimal units.

@chhomkunthea : do we ever have numbers noted with the decimal system outside of dates in the Khmer corpus? If we do not, then all such cases can automatically be converted to the encoding with <g>. You seem to have ignored EGD 4.2.2 so far. Please re-read it carefully.

Can you process the xml files and apply <g> wherever an algorithm can determine that the contents of <num> is not (explusively) a series of decimal digits?

@danbalogh : please correct me if I have made any mistake in my representation of our encoding rules.

@chloechollet and @salomepichon: please take note of the above if you weren't aware of the rules yet.

Problem with display of III

Hello Axelle,
J'ai un problème avec ce bout de code : sre tl<unclear>o</unclear>ṅ· <num value="3"><g type="numeral">III</g></num> .
Le display est :
Capture d’écran 2021-03-09 à 20 22 44

Or, il faudrait avoir III au lieu de II.

Checking the files after the update of the template

Dear all,

The template have been update is all the files. If possible, check the transformation carefully since all the files were not at the same stage of work. The process had to be not as systematic as the one done on other repository.

I have kept an archiving version of the all your files since I haven't save elements recorded in your <teiHeader>. I haven't migrated the data recorded in the <title> nor <idno> since their nature change depending the file.

The new <teiHeader> model has not been filled in. I can do it for you, but I need to you, if I should record all of you in the <respStmt> and under which role.

Please fill in the <revisionDesc>. I can write an XSLT to help you fill it in, if you feel confident in running them, or run it for you, if you give me the details of your work on the files.

Please note that the default value of the @xml:lang of the <div type="translation"> is English. Change the value "eng" in "fra", if you are translating your inscription into French. See the Appendix D and 9.3 of the Encoding Guide, if you have question on this matter.

Feel free to open issues for your questions or issues regarding this update.

Don't forget to git pull
Best,

revise K1298.xml

I have just made a commit that involves a correction to the txt file for K. 499 and comments on the xml file for K. 1298. See https://github.com/erc-dharma/tfc-khmer-epigraphy/commit/49113ae3e36323f1357067a01a4b316a00aaf489 and https://github.com/erc-dharma/tfc-khmer-epigraphy/blob/master/texts/xml-provisional/Kunthea/K1298.xml.

@chhomkunthea : I have inserted a number of comments <!-- argr: ... --> into your recently committed version of K1298.xml. I have found no serious errors of encoding, but note that you take only the bare minimum from the source txt file. You need from now on to reproduce all philologically relevant observations from the txt files, and you need to encode variant readings systematically. That is a high priority. It will also be good to start making a habit of encode translations and bibliography

@dominiquesoutif : nous allons avoir besoin de renseignements plus systématiques sur le membre du CIK qui a fait le travail dans chaque fichier .txt dit "vérifié par CIK". La logique des éditions EpiDoc demande que soit rendu explicite la prise de responsabilité, et de fait @dg2018 et @arlogriffiths ont d'habitude signé les fichiers dont ils sont responsables. Je présume que c'était toi pour K. 1298. Serait-il possible que tu repasses le corpus txt et insères une indication du membre CIK responsable quand tu le sais et quand cela n'est pas encore indiqué dans le fichier? Cela alègera le travail pour Kunthea (et Chloé). Ne pas oublier de faire un git pull avant d'aborder ce travail, et un git add/commit/push après que tu l'auras bouclé!

@dg2018 et @dominiquesoutif : si vous aussi, vous pouviez aider à encadrer le travail de Kunthea en révisant de cette façon quelques fichiers xml, cela serait une contribution précieuse.

Merging subfolders

Dear all,

Now that we have setup stable file-naming conventions, time is ripe, it seems, to merge all individual subfolders (named by encoders), that is transfer the XML files they contain into "XML-provisional", delete the now empty subfolders, and rename "XML-provisional" simply as "xml".
I guess you will have no difficulty to find your files as the K number is contained in their IDs.
This will greatly facilitate the management of github for Axelle.

Dear Axelle,
I guess it the easy way is that you take care of this action?

Dear Arlo,
Have we you got your green light?

Yours.

Manu

Validation error on K.268

Dear all,

there is a validation error in the K.268, line 153 of the XML, regarding a <seg> with only its closing tag: could you please add the opening one or deleting this lonely closing one depending of your encoding intent ?
<l n="1"><lb n="35"/>rājāṇāṁ dyu <gap reason="lost" quantity="2" unit="character"/></seg> vendrā</l>

ZST issue : Bourdonneau+al_

Dear @salomepichon,

Could you correct your use of Bourdonneau+al_ in the following files:

tfc-khmer-epigraphy/texts/xml-provisional/Salome/DHARMA_INSCIK00183-1.xml
tfc-khmer-epigraphy/texts/xml-provisional/Salome/DHARMA_INSCIK00183-2.xml
tfc-khmer-epigraphy/texts/xml-provisional/Salome/DHARMA_INSCIK00183-3.xml
tfc-khmer-epigraphy/texts/xml-provisional/Salome/DHARMA_INSCIK00188.xml

The reference doesn't match any data in Zotero.
Many thanks

Best,

Wrong encoding inside K. 1247 ?

Dear @chhomkunthea,

I found this strange encoding on K. 1247 at the xml line 135:

<lb n="B17"/>der· tai <num value="2">II</num> ku rok· <num value="1">Iseg</num> pau tai

Should seg really be inside the <num>tag? Can you check it please?

Best,

DHARMA_INSCIK00121 - Zotero Short Title Issue

Dear @chhomkunthea,

In your edition of K 121, you have used the ZST Finot1904_02 but it matches 2 entries in Zotero: Nécrologie: prosper Odend'hal & Notes d'épigraphie: VIII. Inscription de Prah That Kvan Pir. Can you check this please?

Thanks

K0080.xml

Dear @chhomkunthea,

There is a version conflict on your file K0080.xml. It starts with such a pattern <<<<<<< HEAD and says that 2 versions of the same part of the file coexist. Can you check it and keep only the one you want in the file?
Note that there are currently 2 conflicts in the file: one in the apparatus and one in the translation.
Thanks for your help

ZST issue : Cha-em Kaeokhlai

@chloechollet,

Could you add the ZST for <rdg source="bib:Cha-em Kaeokhlai">ssadhisaṇa</rdg> in tfc-khmer-epigraphy/texts/xml-provisional/Chloe/DHARMA_INSCIK01117.xml ?

Thank you,
Best,

revise K0003.xml

dear @chhomkunthea — I have just finished a first round of revision of reading and encoding this inscription, and pused the revised file (as well as a first set of changes to the txt file, although we'll have to update it systematically as we wrok through the xml edition together).

Please make all the changes that are required globally:

  • encoding of āryā with <l n="ab"/> and <l n="cd"/>
  • use of <space/> instead of underscore while avoiding unwanted typed spaces when <space/> falls in mid word

We will go through my other comments in the file during our first "Mekong" session on Monday. If I have time, I will add a draft English translation during the weekend.

Sending this message also to @dominiquesoutif and @dg2018 for their information.

Feedbacks on `*.txt`

Dear all,

I think I was able to align the plain text files.

The line endings have been converted in LF. Some CR might have survived my conversion routine. Let me know and I will convert the file.
Anyway avoid the mixed between, LF, CR and CRLF. Be careful while using your text editor.

Note that all the files in Doublons, Vacants and the K.1406 as well as K.1367 are in ASCII. I haven't change the format since the characters used are recognized (ASCII being a subset of UTF-8) and the use of the file won't necessitated such a precision in the encoding. It works so let's avoid creating new issue with it.

However some files still bear the sign of the forced conversion. When I could identify the original code of the character, I have ask for automatic changes, but in some cases, I couldn't. Could you look into:
K.315, K.364 and K.639 (same pattern)
K.1082
K.780 et K.937 (same pattern, not sure about the identification of the characters.)
K.1034

and let me know what you expect!

DHARMA_INSCIK00003 - missing @source on reading

dear @chhomkunthea,

In your edition of K. 3, you have a reading without the @source, as per EGC it is required. Can you check if it can be added please?

 <app loc="5">
                    <lem>śrīy<unclear cert="low">ā</unclear> me</lem>
                    <rdg>śrī <seg met="+"><gap reason="lost" quantity="1" unit="character"/></seg> me</rdg>

Thanks

Validation error on K 1029

Dear @chhomkunthea,

This validation error can be settled easily too on K. 1029. You need to add @reasonon <supplied>.
For XML line 91

<p n="5-7">Voici ce qu’il <supplied>le roi</supplied> donne du kanloṅ du palais à Mahāvikrāntakesarī et aux serviteurs qui sont <foreign>madhyamaparicāraka</foreign> :</p>

Best,

encoding script class and maturity

Dear @chhomkunthea, @chloechollet and @salomepichon !

I have only just become aware that the specification in the revised EGD under "7.5.5. Script" does not yet seem to have been implemented in our xml files yet.

I suppose in almost all cases a single script class and maturity can be defined for an entire inscription and hence be encoded in <div type="edition">. So I can ask @michaelnmmeyer to make a global change to all our files. (I still need to tell him which values to use.) But I will need from you a list of any biscript inscription or inscription that use a foreign script (Siddhamatrika for instance).

Thanks.

Arlo

Validation error on K. 338

Dear all,

<unclear> can only contain <g> element. The encoding line 88 of the XML in the file K. 338, also identified as <lb n="29"/> isn't conform to Epidoc TEI model : <unclear>tgap· <num value="1">I</num></unclear>.
Could you please resolve this?

Question on K.1428

Dear @chloechollet,

Some changes have been done regarding the encoding, especially on <space>used with @unit='line | page'. The <space>used in K.1428 don't have attribute. Could you check them please?

Let me know if you need help
Best

DHARMA_INSCIK00328 - Missing Lem

Dear @chhomkunthea,

In your edition K. 328, the apparatus entry needs a <lem>to be valid. Can you add one please?

<app loc="7">
                     <note>Après sruk Saveros Pou note <gap reason="lost" quantity="3" unit="character"/>, pourtant, tous les caractères sont lisibles et il semble que rien n'était inscrit après ce mot.</note>
                 </app>

Thanks

Display problem in K0178.xml

  • code: kantrum<unclear>·</unclear><g type="circleMed">.</g> niya<choice><sic>n</sic><corr>m</corr></choice>a
  • display:

Capture d’écran 2021-01-19 à 15 56 55

- Can you sort this out @ajaniak ?

DHARMA_INSCIK00008- Zotero Short Title Isssue

dear @chhomkunthea,

Can you check the Zotero Short Title Malleret1963_01? It matches 2 entries in Zotero Library Notes archéologiques, II: Pierres gravées et cachets de divers pays du Sud-Est de l'Asie; III: Classification et nomenclature des « perles » archéologiques en fonction de la symétrie minérale and L'archéologie du delta du Mékong.

Thanks

use of encoding log

Dear all!

Note that the encoding log was converted to .md format by Axelle a few weeks ago, to give the display you can see here. Just open the md file https://github.com/erc-dharma/tfc-khmer-epigraphy/blob/master/texts/CIK_encoding_log.md in oXygen and fill it in as you were previously doing in txt. The txt file has been moved to the archive.

@dg2018 : I have incorporated your four log entries into the .md file before archiving the .txt.

@chloechollet and @chhomkunthea : if you have been adding entries to the .txt rather than the .md in recent weeks, then we have to check if all your entries have been correctly applied also ion the md. But I don't recall seeing any changes to the .txt in your recent commits.

Please respond with a simple "ok" if you have seen this and require no further guidance.

proper sign for virāma

I see that some (or all?) of Chloé's files use ∙ instead of · (keyboard ABC extended option+shift+9) for virāma.

Please correct all cases in your files. Best to do so one by one, because • may also occur as punctuation sign. At least it does in the files of some other team members.

Please respond to this issue with "done" when you're done.

replace "dDanda" by "ddanda"

@michaelnmmeyer — could you please make a global replacement of the <g> token "dDanda" by "ddanda"?

If you find other tokens in tfc-khmer-epigraphy that seem not to be compliant with the Symbol Taxonomy, as it stands now, please furnish a list.

missing ) at juncture between <unclear> and <seg>

This concerns K1176.xml.

  • code: <l n="d"><unclear>nīla</unclear><seg met="=="><gap reason="lost" quantity="2" unit="character"/></seg> śi<unclear>lā</unclear>may<unclear>ī</unclear></l>
  • display:

Capture d’écran 2021-01-29 à 12 07 02

@ajaniak : I expect a ")" after nīla. Is there a bug in the transformation code?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.