-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disambiguating <emph> into multiple taggings #7
Comments
Whereas <lb n="161744"/>admiration of Rossini's <emph>Stabat Mater</emph>, a work simply abounding in Is just an instance of |
I like the embedded |
Is it worth indicating that the bare words of such titles as encountered in the episode are also in Latin &c.? I’m looking through ‘Proteus’ here. On an earlier pass for <lb n="030167"/> […] But he must send me <foreign xml:lang="fr">La Vie de Jésus</foreign> by M. Léo Taxil.
<lb n="030196"/>[…] Rich booty you brought back; <foreign xml:lang="fr">Le
<lb n="030197"/>Tutu</foreign>, five tattered numbers of <foreign xml:lang="fr">Pantalon Blanc et Culotte Rouge</foreign>; Gotcha moment aside (!), this is valuable information that we don’t want clipped in the shift to <lb n="030167"/> […] But he must send me <title type="book" xml:lang="fr">La Vie de Jésus</title> by M. Léo Taxil. I note, in passing, that there’s also a case to be made for marking up the remainder of the sentence as |
I like that syntax of embedding the language in the tag. Let's do it. And thanks for catching those mistakes! I've just corrected them, using your suggested syntax. |
I was just finishing the <p><lb n="081039"/>He hummed, prolonging in solemn echo the closes of the bars:
<lb n="081040"/><said who="Leopold Bloom">―<foreign xml:lang="it">Don Giovanni, a cenar teco
<lb n="081041"/>M'invitasti.</foreign></said></p>
[...]
<lb n="081051"/><said who="Leopold Bloom">―<foreign xml:lang="it">A cenar teco.</foreign></said></p>
<p><lb n="081052"/>What does that <foreign xml:lang="it">teco</foreign> mean? Tonight perhaps.
<lb n="081053"/><said who="Leopold Bloom">―<emph>Don Giovanni, thou hast me invited
<lb n="081054"/>To come to supper tonight,
<lb n="081055"/>The rum the rumdum.</emph></said></p>
<p><lb n="081056"/>Doesn't go properly.</p> Really, these instances of <lb n="010005"/><said who="Buck Mulligan">―<foreign xml:lang="la">Introibo ad altare Dei.</foreign></said></p> read <lb n="010005"/><said who="Buck Mulligan">―<quote xml:lang="la">Introibo ad altare Dei.</quote></said></p> I’m happy to make these changes, but I wanted to run the proposal by the group first. I’m sure if we make our encoding decisions clear in the README, tools like your foreign-language analysis can be tailored to catch non-English quotations, right, Jonathan? |
This sounds great. I think |
And yep, this won't make too much of a difference in analyses, since we can just look for |
Here is an interesting example of typographic distinction opening up into multiple possibilities for tagging:
@JonathanReeve switched the inherited
<emph>
tagging for a<foreign xml:lang="it">
. But the italics also render a quotation (not that every quotation is so distinguished!). Gifford has:Is this then
Are there other examples in this vein?
The text was updated successfully, but these errors were encountered: