this post was submitted on 22 Sep 2024
111 points (98.3% liked)

Today I Learned

17770 readers
421 users here now

What did you learn today? Share it with us!

We learn something new every day. This is a community dedicated to informing each other and helping to spread knowledge.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules (interactive)


Rule 1- All posts must begin with TIL. Linking to a source of info is optional, but highly recommended as it helps to spark discussion.

** Posts must be about an actual fact that you have learned, but it doesn't matter if you learned it today. See Rule 6 for all exceptions.**



Rule 2- Your post subject cannot be illegal or NSFW material.

Your post subject cannot be illegal or NSFW material. You will be warned first, banned second.



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Posts and comments which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding non-TIL posts.

Provided it is about the community itself, you may post non-TIL posts using the [META] tag on your post title.



Rule 7- You can't harass or disturb other members.

If you vocally harass or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.

For further explanation, clarification and feedback about this rule, you may follow this link.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



Rule 10- Majority of bots aren't allowed to participate here.

Unless included in our Whitelist for Bots, your bot will not be allowed to participate in this community. To have your bot whitelisted, please contact the moderators for a short review.



Partnered Communities

You can view our partnered communities list by following this link. To partner with our community and be included, you are free to message the moderators or comment on a pinned post.

Community Moderation

For inquiry on becoming a moderator of this community, you may comment on the pinned post of the time, or simply shoot a message to the current moderators.

founded 1 year ago
MODERATORS
 

Have fun figuring out how to pronounce them though.

top 16 comments
sorted by: hot top controversial new old
[–] [email protected] 20 points 1 month ago (1 children)

[Shameless advertisement: we have a linguistics community, [email protected] . I'm the mod there; I apologise for the relative lack of activity nowadays, but everyone is welcome to post this sort of stuff there.]

What Morris Swadesh did was at the same time simpler and greater than that: he created a list of concepts likely to pop up across many different languages, regardless of their time period and area. This is extremely useful to track the relationship between multiple languages, even if you don't speak them.

I'm not sure if he created one for Proto-Indo-European; "Swadesh list" became a generic name for this sort of list, regardless of who compiles it. Plus Morris Swadesh main interest was Amerindian languages.

[–] [email protected] 3 points 1 month ago

FWIW, the link says he was the initial compiler, but it was revised after his death.

[–] [email protected] 10 points 1 month ago* (last edited 1 month ago) (3 children)

What do all those asterisk and numbers mean? I feel like I'm missing a key to decode that.

Edit: Damn, that little post lead to an actual TIL moment. I'm actually going to copy all your answers for future reference, since I enjoy etymology and get into contact with these symbols a lot. Thanks, everyone!

[–] [email protected] 7 points 1 month ago

Since Contramuffin answered most of it, I'll focus on the diacritics.

The acute in *ḱ *ǵ *ǵʰ shows that they aren't the same as *k *g *gʰ. Odds are that the ones with an acute were pronounced with the tongue a bit fronted (palatalised).

The acute over other consonants, plus in *é *ó, is something else entirely. It's the accent - you're supposed to pronounce those consonants with a higher pitch. "Yay, consistency" /s

The macron over *ē *ō is to show that the vowel is loooong.

Those floating ⟨ʰ⟩ refer to aspiration. Aspiration is that "puff" of air that you release when you say ⟨pill, till, kill⟩ but not when you say ⟨spill, still, skill⟩. In English this is not distinctive, but in a lot of other Indo-European languages it is, and the mainstream hypothesis is that it was distinctive in PIE itself.

In the meantime, a floating ⟨ʷ⟩ means that the consonant is pronounced with rounded lips. The difference between something like *kʷ and *kw is mostly that the first one behaves like a single consonant, the other as two.

That ring under some consonants is to highlight that they're syllabic, as if they were LARPing as vowels. It's a lot like writing "button" as "buttn̥".

If you ever see *ə₁ *ə₂ *ə₃ etc., pretend that that "ə" is "h₁". It's simply different ways to annotate the same stuff.

*H means "this is *h₁ or *h₂ or *h₃, but we have no clue on which".

[–] [email protected] 14 points 1 month ago* (last edited 1 month ago) (1 children)

Asterisk means that the word has been reverse engineered without any direct evidence backing it up. All proto languages will have asterisks in front of all their words because proto languages are, by definition, languages that were used before anything was written down.

The reverse engineered word is likely to be correct (or at least, as correct as we can be), but in the absence of direct evidence, it's still just guesswork

The numbers you're talking about are because we know that there are different consonants used, but we don't entirely know what sounds those consonants are. So we just write all of the consonants that likely sounded somewhat like the letter h as h1, h2, h3, etc., and repeat for the other uncertain consonants.

So basically h1 definitely sounds different than h2, but as for exactly what they sound like, all we know is that both of them are kinda like h

[–] [email protected] 10 points 1 month ago (1 children)

This is mostly correct so I'll focus on small specific details, OK?

Asterisk means not directly attested. In reconstructions it goes as you say, but you'll also see them before things that you don't expect speakers to use, in synchronic linguistics; for example *me apple eat gets an asterisk because your typical English speaker wouldn't use it.

It is kind of "guesswork" but it follows a very specific procedure, called the comparative method. As in, it is not an "anything goes".

The sounds represented as *h₁, *h₂, *h₃, *h₄ and *H do not necessarily sound like [h]. At this point they're simply part of the notation. For example, a common hypothesis is that *h₁ was [ʔ], it's more like the sound in "oh-oh" than like [h]. And some argue that they aren't even the sounds themselves, but rather the effect of the sounds on descendant words (the difference is important because, if two sounds had the same effect, they ended with the same symbol).

[–] [email protected] 2 points 1 month ago

Thanks for the details! I forgot some of the details since I last took a course in linguistics like almost 10 years ago

[–] [email protected] 3 points 1 month ago

The asterisks mean that it's a reconstructed word, not an actually attested word. No PIE writing exists, so all these words are deduced from the descendant words across the vast IE family, not recorded as definitely existing.

The numbers are h1, h2 and h3. We know that PIE had three different h-like sounds (pharyngeals). But, funny thing, every single daughter language lost them, so it's hard to tell exactly what each one sounded like. If you see overlapping ripples in a pond, you can work out roughly where and how many things dropped, but you can't guess the exact weight or size.

[–] [email protected] 25 points 1 month ago (4 children)

It is INCREDIBLE how many of these are basically still in use today.

I (1sg) *éǵh₂ == german "ich"

you (2sg) *túh, *te == french "tu" russian ты

we (1pl) *wéy == english

you (2pl) *yū, *yú

who *kʷis (pron.), *kʷod (adj.) == latin "quis, quod"

one *(H)óynos, *(H)óykos, *(H)óywos, *sḗm == spanish "uno"

two *dwóh₁; *dwó == french "deux"

three *tréyes == spanish "tres"

four *kʷetwóres == french "quatorze"... 🤔

Im no lingust, just uninformed observation.

[–] [email protected] 2 points 1 month ago

We finns aren't even a PIE language, and we still use some clearly from those.

The word for sea is basically exactly the same, depending on the pronunciation. We say "meri", it's marked down as "móri". In Finnish yellow is "keltainen" and PIE says "ǵʰelh₃-".

[–] [email protected] 3 points 1 month ago* (last edited 1 month ago) (1 children)

You forgot the proto-indo-european example for "I'm no linguist, just uninformed observation"

[–] [email protected] 5 points 1 month ago* (last edited 1 month ago)

four *kʷetwóres == french “quatorze”… 🤔

Close: it's French quatre (4), not "quatorze" (14). It goes like this: PIE *kʷetwóres → Latin ⟨quattuor⟩ /kʷattuor/ → Old French ⟨quatre, catre⟩ /kʷatɾə/~/katɾə/ → contemporary French ⟨quatre⟩ /katʁ(ə)/.

French ⟨quatorze⟩ does contain that *kʷetwóres, but it's only the "quator-". The "-ze" is from Proto-Indo-European *déḱm̥ (10). This gets easier to see in Latin, as the word for 14 was ⟨quattuordecim⟩ (literally four-ten).

Note that almost all English words that you used to translate the PIE words are also examples of those PIE words being still in use nowadays - they're direct descendants, for example *kʷis → who, *éǵh₂ → I, etc. In English, German, Swedish and other Germanic languages, this gets a bit obscured due to some old sound change called Grimm's Law. (EDIT: the only exception is the second line - *túh, *te became "thou, thee".)

[–] [email protected] 7 points 1 month ago (1 children)

In every language, most words that most people use are still used from a very long time ago, we just pronounce them a bit differently.

*(H)óynos, *dwó, *treyes for example aren't just uno, deux and tres, they are 1, 2 and 3 in English, French, German, Italian, Greek, Russian, Hindi, Farsi, Kurdish, Tajik etc. Literally the same word, just spoken by different groups of descendent speakers.

Some languages have undergone sound changes that make certain words sound more or less similar to how we think they sounded in PIE.

So even though though four, vier, quattuor and tessera sound quite different to us, they are all basically just how we say *kʷetwóres.

[–] [email protected] 3 points 1 month ago (1 children)

1 in russian is один, I think it's quite different from one/uno/un (especially since the о is pronounced а). 2 and 3 are instead extremely similar (два три). Does it actually still come from the same root?

While not being competent in this subject, I found it very fascinatinf that ugro-finnic languages (which are not indoeuropean AFAIK) like Finnish or Estonian are so wildly different, so that 1 2 and 3 are üks, kaks, kolm (in Estonian), for example.

[–] [email protected] 2 points 1 month ago

That example does sound quite different, but Wiktionary has it as from the Proto-Slavic *(j)edinъ, which is "ultimately from *h₁óy(H)nos."

Finnish or Estonian are so wildly different

I know right?! I strongly remember the first Uralic language example I heard was the Finnish for merry christmas: hyvää joulua - it just sounded so different.