Fork of https://github.com/oxigraph/oxigraph.git for the purpose of NextGraph project
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
oxigraph/testsuite/serd-tests/good/UTF-8.ttl

220 lines
14 KiB

@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
<> rdfs:comment """
UTF-8 encoded sample plain-text file
Markus Kuhn [ˈmaʳkʊs kuːn] <http://www.cl.cam.ac.uk/~mgk25/> 2002-07-25
The ASCII compatible UTF-8 encoding used in this plain-text file
is defined in Unicode, ISO 10646-1, and RFC 2279.
Using Unicode/UTF-8, you can write in emails and source code things such as
Mathematics and sciences:
Eda = Q, n , f(i) = g(i),
a²+b³
x: x = x, α ¬β = ¬(¬α β),
c
,
< a b c d (A B),
a-b
2H + O 2HO, R = 4.7 kΩ, 200 mm i=1
Linguistics and dictionaries:
ði ıntəˈnæʃənəl fəˈnɛtık əsoʊsiˈeıʃn
Y [ˈʏpsilɔn], Yen [jɛn], Yoga [ˈjoːgɑ]
APL:
((VV)=V)/V,V
Nicer typography in plain text files:
single and double quotes
Curly apostrophes: Weve been here
Latin-1 apostrophe and accents: '´` ║
deutsche Anführungszeichen
, , , , 34, , 5/+5, ,
ASCII safety test: 1lI|, 0OD, 8B
the euro symbol: 14.95
Combining characters:
STARGΛ̊TE SG-1, a = v̇ = r̈, a b
Greek (in Polytonic):
The Greek anthem:
Σ γνωρζω π τν κψη
το σπαθιο τν τρομερ,
σ γνωρζω π τν ψη
πο μ βα μετρει τ γ.
᾿Απ᾿ τ κκκαλα βγαλμνη
τν Ελλνων τ ερ
κα σν πρτα νδρειωμνη
χαρε, χαρε, ᾿Ελευθερι!
From a speech of Demosthenes in the 4th century BC:
Οχ τατ παρστατα μοι γιγνσκειν, νδρες ᾿Αθηναοι,
ταν τ᾿ ες τ πργματα ποβλψω κα ταν πρς τος
λγους ος κοω· τος μν γρ λγους περ το
τιμωρσασθαι Φλιππον ρ γιγνομνους, τ δ πργματ᾿
ες τοτο προκοντα, σθ᾿ πως μ πεισμεθ᾿ ατο
πρτερον κακς σκψασθαι δον. οδν ον λλο μοι δοκοσιν
ο τ τοιατα λγοντες τν πθεσιν, περ ς βουλεεσθαι,
οχ τν οσαν παριστντες μν μαρτνειν. γ δ, τι μν
ποτ᾿ ξν τ πλει κα τ ατς χειν σφαλς κα Φλιππον
τιμωρσασθαι, κα μλ᾿ κριβς οδα· π᾿ μο γρ, ο πλαι
γγονεν τατ᾿ μφτερα· νν μντοι ππεισμαι τοθ᾿ κανν
προλαβεν μν εναι τν πρτην, πως τος συμμχους
σσομεν. ν γρ τοτο βεβαως πρξ, ττε κα περ το
τνα τιμωρσετα τις κα ν τρπον ξσται σκοπεν· πρν δ
τν ρχν ρθς ποθσθαι, μταιον γομαι περ τς
τελευτς ντινον ποιεσθαι λγον.
Δημοσθνους, Γ ᾿Ολυνθιακς
Georgian:
From a Unicode conference invitation:
Unicode-
, 10-12 ,
. , .
Unicode-,
, Unicode-
, , ,
.
Russian:
From a Unicode conference invitation:
Зарегистрируйтесь сейчас на Десятую Международную Конференцию по
Unicode, которая состоится 10-12 марта 1997 года в Майнце в Германии.
Конференция соберет широкий круг экспертов по вопросам глобального
Интернета и Unicode, локализации и интернационализации, воплощению и
применению Unicode в различных операционных системах и программных
приложениях, шрифтах, верстке и многоязычных компьютерных системах.
Thai (UCS Level 2):
Excerpt from a poetry on The Romance of The Three Kingdoms (a Chinese
classic 'San Gua'):
[----------------------------|------------------------]
(The above is a two-column text. If combining characters are handled
correctly, the lines of the second column should be aligned with the
| character above.)
Ethiopian:
Proverbs in the Amharic language:
Runes:
(Old English, which transcribed into Latin reads 'He cwaeth that he
bude thaem lande northweardum with that Westsae.' and means 'He said
that he lived in the northern land near the Western Sea.')
Braille:
(The first couple of paragraphs of "A Christmas Carol" by Dickens)
Compact font selection example text:
ABCDEFGHIJKLMNOPQRSTUVWXYZ /0123456789
abcdefghijklmnopqrstuvwxyz £©µÀÆÖÞßéöÿ
œŠŸž ΑΒΓΔΩαβγδω АБВГДабвгд
<EFBFBD>ӥɐːאԱ
Greetings in various languages:
Hello world, Καλημρα κσμε,
Box drawing alignment tests:
""" .
<> rdfs:comment """
Two byte Unicode escape: \u00E0
Largest Unicode escape in Turtle: \U0010FFFF
""" .