jonbitgood / transcription-blackfoot-matthew Goto Github PK
View Code? Open in Web Editor NEWThe transcribed Blackfoot Gospel of Matthew (1890) in USFM
Home Page: https://dbs.org
License: Creative Commons Attribution Share Alike 4.0 International
The transcribed Blackfoot Gospel of Matthew (1890) in USFM
Home Page: https://dbs.org
License: Creative Commons Attribution Share Alike 4.0 International
Hi, this is not yet USFM text, though you clearly aimed at the right thing. But so far you have only tagged the chapter markers and a couple of others.
None of the verses have been tagged appropriately. I do not know Blackfoot, but I am wondering if the captiised bits are titles - if so they should be tagged too.
Please see below, there are a bunch of inapprorpiate characters in the files - brackets opening with a curly bracket. and closing with a smooth one, a square bracket etc, but also cyrillic characters, lots of control characters etc.
U+0020 10198 Common SPACE
! U+0021 24 Common EXCLAMATION MARK
' U+0027 7784 Common APOSTROPHE
( U+0028 8 Common LEFT PARENTHESIS
) U+0029 11 Common RIGHT PARENTHESIS
, U+002C 1943 Common COMMA
- U+002D 15 Common HYPHEN-MINUS
. U+002E 864 Common FULL STOP
: U+003A 257 Common COLON
; U+003B 331 Common SEMICOLON
? U+003F 170 Common QUESTION MARK
A U+0041 294 Latin LATIN CAPITAL LETTER A
B U+0042 33 Latin LATIN CAPITAL LETTER B
C U+0043 33 Latin LATIN CAPITAL LETTER C
D U+0044 19 Latin LATIN CAPITAL LETTER D
E U+0045 43 Latin LATIN CAPITAL LETTER E
G U+0047 33 Latin LATIN CAPITAL LETTER G
H U+0048 26 Latin LATIN CAPITAL LETTER H
I U+0049 129 Latin LATIN CAPITAL LETTER I
J U+004A 285 Latin LATIN CAPITAL LETTER J
K U+004B 710 Latin LATIN CAPITAL LETTER K
L U+004C 1 Latin LATIN CAPITAL LETTER L
M U+004D 79 Latin LATIN CAPITAL LETTER M
N U+004E 201 Latin LATIN CAPITAL LETTER N
O U+004F 87 Latin LATIN CAPITAL LETTER O
P U+0050 104 Latin LATIN CAPITAL LETTER P
Q U+0051 1 Latin LATIN CAPITAL LETTER Q
R U+0052 11 Latin LATIN CAPITAL LETTER R
S U+0053 195 Latin LATIN CAPITAL LETTER S
T U+0054 119 Latin LATIN CAPITAL LETTER T
U U+0055 15 Latin LATIN CAPITAL LETTER U
W U+0057 3 Latin LATIN CAPITAL LETTER W
X U+0058 2 Latin LATIN CAPITAL LETTER X
Y U+0059 1 Latin LATIN CAPITAL LETTER Y
Z U+005A 12 Latin LATIN CAPITAL LETTER Z
[ U+005B 1 Common LEFT SQUARE BRACKET
a U+0061 12554 Latin LATIN SMALL LETTER A
b U+0062 95 Latin LATIN SMALL LETTER B
c U+0063 61 Latin LATIN SMALL LETTER C
d U+0064 99 Latin LATIN SMALL LETTER D
e U+0065 2370 Latin LATIN SMALL LETTER E
f U+0066 3 Latin LATIN SMALL LETTER F
g U+0067 15 Latin LATIN SMALL LETTER G
h U+0068 483 Latin LATIN SMALL LETTER H
i U+0069 15064 Latin LATIN SMALL LETTER I
j U+006A 1 Latin LATIN SMALL LETTER J
k U+006B 9986 Latin LATIN SMALL LETTER K
l U+006C 205 Latin LATIN SMALL LETTER L
m U+006D 4041 Latin LATIN SMALL LETTER M
n U+006E 6047 Latin LATIN SMALL LETTER N
o U+006F 6519 Latin LATIN SMALL LETTER O
p U+0070 4579 Latin LATIN SMALL LETTER P
r U+0072 292 Latin LATIN SMALL LETTER R
s U+0073 12996 Latin LATIN SMALL LETTER S
t U+0074 10981 Latin LATIN SMALL LETTER T
u U+0075 6061 Latin LATIN SMALL LETTER U
v U+0076 26 Latin LATIN SMALL LETTER V
w U+0077 177 Latin LATIN SMALL LETTER W
x U+0078 1534 Latin LATIN SMALL LETTER X
y U+0079 571 Latin LATIN SMALL LETTER Y
z U+007A 21 Latin LATIN SMALL LETTER Z
{ U+007B 2 Common LEFT CURLY BRACKET
ã U+00E3 1 Latin LATIN SMALL LETTER A WITH TILDE
æ U+00E6 14 Latin LATIN SMALL LETTER AE
ă U+0103 228 Latin LATIN SMALL LETTER A WITH BREVE
Ĕ U+0114 1 Latin LATIN CAPITAL LETTER E WITH BREVE
ĕ U+0115 364 Latin LATIN SMALL LETTER E WITH BREVE
Ĭ U+012C 8 Latin LATIN CAPITAL LETTER I WITH BREVE
ĭ U+012D 5049 Latin LATIN SMALL LETTER I WITH BREVE
ō U+014D 1 Latin LATIN SMALL LETTER O WITH MACRON
Ŏ U+014E 11 Latin LATIN CAPITAL LETTER O WITH BREVE
ŏ U+014F 1499 Latin LATIN SMALL LETTER O WITH BREVE
ś U+015B 1 Latin LATIN SMALL LETTER S WITH ACUTE
Ŭ U+016C 39 Latin LATIN CAPITAL LETTER U WITH BREVE
ŭ U+016D 2143 Latin LATIN SMALL LETTER U WITH BREVE
Ǐ U+01CF 1 Latin LATIN CAPITAL LETTER I WITH CARON
Ǔ U+01D3 1 Latin LATIN CAPITAL LETTER U WITH CARON
Ӑ U+04D0 1 Cyrillic CYRILLIC CAPITAL LETTER A WITH BREVE
This text is from 1890, so to add a license to it when the text is in the public domain is probably not appropriate. The effort of proofreading, while laudable is not copyrightable, the mark up is so limited as to not to have much or any copyrightable value.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.