It’s hard to run a business, but the hard part is not honoring your own stupid promotional gimmicks instead of using them to insult your customers and make them feel like complete idiots.
Please fix the bug where trying to connect my account to a Google account just leads me to a completely blank page http://news.ycombinator.com/openid_merge with the word "Unknown." (Sorry, didn't understand where to report bugs.)
URL detection is tricky because people omit the “http://” part of URLs. It used to be that you could look for stuff beginning with “www.”, but nobody uses “www.” anymore, so now you basically have to have a list of TLDs and look for anything ending in “.<tld>”. Now that everybody will be able to register their own TLD, will URL detection even be feasible for humans any more? I’ve been thinking about how we’re going to solve this problem for a while, because we need some kind of notation to replace “.<tld>”. I mean I don’t think people will go back to writing <http://coca-cola>, so when I saw the title of this post I immediately knew what the idea was: let’s write //coca-cola to denote “the URL coca-cola”. (Unfortunately, both Google Chrome and Safari interpret “//coca-cola” as “file:///coca-cola”.)
I don’t know why everybody is talking about the technical meaning of //coca-cola, as that’s kind of irrelevant. Us web developers will of course still be typing the full http://coca-cola, as we always have.
I thought this might be an issue when they did generic TLDs, but as far as I can see they are only selling to registrars, so they are not expecting web sites at the top level, there will still be a subdomain, and browsers will probably not go to the top level domains.
Interestingly, http://dk/ has a valid A record to 193.163.102.24 which curl will show is a 301 redirect, but you cannot load it in Chrome or Firefox. dk has MX records too (I remember an old story about someone who had the email address m@dk or similar), but I think generally having any records for the top level is frowned upon now.
Short answer: the latter is more true. The reason why {mi} is unmarked for number is simply that it’s defined that way.
In Lojban, everything is unmarked for number by default. It’s actually quite rare to see things explicitly marked for number, as it’s usually either irrelevant or obvious from context.
The pronoun {mi} is technically unmarked for number, but is restricted to refer to people that the speaker represents, just as you guessed. For example, it would usually be weird or incorrect to use {mi} to mean “we” in the sense of “me and you”, since representing the very people you are talking to is a rare situation — although theoretically you could come up with examples where it would make sense.
So in practice {mi} is usually singular. On the other hand, {do} (which means “you”) is as often plural as it is singular.
There are other pronouns that mean “me and you”, “me and others”, “you and others”, and “me, you, and others” — respectively, {mi’o}, {mi’a}, {do’o} and {ma’a} — which is another reason why the need for plural {mi} seldom arises.
Thanks. I figured grammatical # as such would be discarded as unnecessary but the notion of plurality raises semantic issues that natural language sidesteps by ambiguity.
Disclaimer/Personal Background (trying to be brief): I've often been told I've got an unusual cognitive style (for lack of better term) and I've often felt very much as if there's an impedance mismatch between how my thoughts are structured and how language operates; in essence, at the word-or-sentence level everything I hear or read is very polyvalent and vague, and only take on a concrete meaning to me if I get multiple paraphrases of it...it's putting all the variants into superposition and seeing which parts reinforce or cancel shows me the contour of the actual meaning (which itself is not necessarily ever actually "represented in words" so much as "gets the outline of its semantic boundaries painted").
In the abstract this leaves me with an interest in the idea of something like lojban but very mixed initial reactions: it's possible an artificial language with more-precise meanings would eliminate my need for doing verbal interferometry across multiple paraphrases but on the other
hand I have a lifetime's experience feeling very uncomfortable without tons of redundancy and repetition-with-alteration, which seems to be what lojban is trying to eliminate in its use.
Too much info, I'll stop there.
I do have two more questions if you have time.
#1 is historical: what's the process by which the core sets of things like spatial relationships or tenses or shapes or so on came to be enumerated?
EG: if I were doing a language in this form I'd go through all the languages I could get my hands on and try to get good lists of all their fundamental categories (eg: spatial prepositions and "classifiers", like you have in swahili and chinese (+ languages with heavy chinese contact) (cf: http://www.jstor.org/pss/413103 ) and then try to factor them into semantic atoms. I'd consider this approach bottom-up (see what's out there, and then try and simplify and unify them) and contrast it with a more top-down approach (trying to derive a finite set of spatial relations ab initio via pure reasoning); it'd also be a good set of "unit tests" for your final set of core concepts, making sure that none of these words' senses are not really expressible in terms of your base concepts.
How did the lojbanists derive their tenses / spatiotemporal prepositions / etc.? Is there a good "history of the design of lojban" that speaks to this?
Question #2: at a practical level how would you decompose "There are dogs in the kitchen" into lojban?
If I had to break it into predicates it'd probably be the conjunction:
- T ~ whatever containment type you have that is ~ "contains within its spatial bounds -- but not structurally -- for an indeterminate time period"
- COUNT(E) > 0
- ENTITY-COLLECTION-TYPE(E,X), where X ~ "collection treated as collection due to spatiotemporal circumstance and descriptive convenience" (EG: E is an entity collection b/c there are label(s) they all share, namely being instance-of dog and contained-in-the-kitchen in the same way; there's no assertion of any other source of entity-identity beyond the circumstances this utterance is describing; contrast to say "baseball team" or "deck of cards", etc., which are entity-collections with a more-persistent and "intentional" identity)
- forall e in E IS-INSTANCE-OF("instance-of-type IoT",e,"dog")
- "instance-of-type" ~ whatever instance-of you have that is ~ "is a concrete instantiation of an abstract type not otherwise specified (eg: an actual 'dog', not 'Pomeranian')
- + some temporal modifier to explain like "the described circumstance started before I made this utterance and I do not think it has ceased, yet"
...but I'd assume some of the intended distinctions are usually left implicit or inferred; what's a good lojban decomposition?
Something similar was done, but it was explicitly recognised that the purpose of lojban was not to generate "the semantic primes of language." Such as exercise is regarded by some linguists as meaningless, and by others as too difficult. Instead, concepts were listed, and from them a "covering set" was extracted. Similarly tenses, both spatial and temporal.
After the concepts were agreed, it was expressed in each of the (then) six major world languages. The words thus obtained were put through a weighting algorith,=m to try to find a "word" that had components of each, and that became the lojban word for that concept.
Thanks for the response, I do appreciate it. I wasn't aware of how active lojban still is (and it's much more accessible to to get information on thanks to the internet).
I should point out that I'm fairly familiar with the general range of opinion in the linguistics community (as an undergrad I did dual math / linguistics, which made me at that time quite the rara avis, though it's more common now apparently).
Generally I don't give much credence to the idea of semantic primes (at all, not just in some pragmatic sense) but for stuff like spatial relationships + tenses (+ aspect, mood, etc.) it'd seem not an impossible undertaking (do enough reading in linguistic typology and you start seeing enough "repeats" to think such an enumeration might be possible).
After going through a bit of the grammar and the vocab list on wiktionary it seems like you'd have constant problems with synecdoche, which'd bother me (but perhaps only me, and it's not as though natural languages aren't riddled with similar problems).
I've walked away from this with a much stronger sense of the sense in which lojban is attempting to be a logical language, thanks for your time.
It's heartening to see substantial effort put into engineering language; good luck with your efforts.
There are many people who can converse fairly well in real-time written Lojban (although vocabulary is limited, and some parts of the language are generally not used).
Due to audio-visual isomorphism, being able to converse in written Lojban almost implies being able (theoretically) to converse in spoken Lojban. However, people are generally much less used to speaking and hearing Lojban, so spoken conversations are usually somewhat difficult (definitely far from fluent).
Right. It's been around since 1955 (in spirit at least, in the form of Loglan) and the best anyone can muster is to converse "fairly well" in "real-time" but not in spoken conversation. In other words, it's a language no one can speak. Klingon speakers are even more fluent than Lojban speakers.
I'm not trying to say Lojban is stupid. It's not. It's cool. It fills an interesting niche in the language world. However, I'm not convinced it's speakable. Remember the quote by Kernighan: "Debugging is twice as hard as writing the code in the first place. Therefore, if you write the code as cleverly as possible, you are, by definition, not smart enough to debug it." I would argue that speaking a language is at least twice as hard as inventing it or writing it in a non-real-time fashion, so if you create the most complicated language you can still manage to write (however laboriously), you are by definition not smart enough to speak it. Lojban is that language.
Lojban is not designed to be complicated. The important parts of the BNF grammar are about a page, and easily understood in an hour or so of properly directed study.
The hard parts are the vocabulary and the mind-set. It is specifically designed to be different, but even then, there are sufficient similarities that it doesn't have to feel completely alien. Some people present it as such because they think it will attract people, but I know that sometimes familiarity is a better draw.
It can be presented either way.
And people tend to say uncomplicated things in Klingon, whereas most lojban speakers are exploring saying very complicated things - you're not comparing like with like. Further, although gatherings of Klingon speakers are fairly common - piggy-backing on Star Trek conventions - gatherings of lojban speakers are rare. Even so, there are several people who speak it fairly fluently, and the number is growing (in some sense).
It is often criticised (I'm not saying you are doing so) for being something other than what people think it "ought" to be. This is a marketing issue. I think it's interesting, and, like learning Lisp, it has expanded my mind in interesting ways.
Lojban may not be designed to be complicated, but it's designed to be complicated. By that I mean, while the goal might not be "make it complicated," the goals necessitate that it will in fact be complicated. There is no way to remove all ambiguity from language without making it exceedingly complicated.
This is a persistent, incorrect interpretation of what people mean when they say that lojban is intended to be unambiguous.
Parsing lojban is unambiguous. Interpreting lojban quite specifically is not unambiguous.
Given a grammatically correct construction there is a unique parse. That is what is meant by "unambiguous". There's no problems akin to "Machines need to wreck a nice beach."
There are manifold ambiguities, however, in the semantics. When one talks of "lo sutra tavla" there is no indication as to the sense in which the speaker is fast. Perhaps the speaker produces many words per minute, or perhaps the speaker runs past while talking. These ambiguities can be reduced by using more precise expressions. Metaphorical use is frowned upon, so what it does not mean is one who persuades in a fraudulent manner.
For example, we can say "lo gerku" which refers to a dog, or some dogs, but gives no idea of how many. We could say "re le ci gerku", which means "two of the three dogs." More precise.
I can say "mi tavla", which means "I speak" or "I will speak" or "I have spoken" and even leaves the audience, topic and language unspecified. I can say "mi ba tavla" or "mi pu tavla" which are future and past respectively. I can be even more precise if necessary or desirable.
... my understanding based solely on reading ABOUT
Lojban, I don't know a single Lojban word... aside
from Lojban
I think you have fallen into the trap of not reading enough, and mis-interpreting some of what you have read.
I'm not surprised, I think much of the early material written about lojban was written without regard for how it might be mis-interpreted. Politicians today generally say nothing, because everything they do say runs the risk of being taken other than intended. The early lojban writers (writing about lojban, not necessarily in lojban) needed "spin doctors" to ensure that what they said could not be mis-interpreted.
All that aside, lojban is intended to be an expressive language, suitable for communication. Therefore it will be complex, although the complexities are not necessarily those of natlangs. I suspect that we are not that far apart. We agree that:
- lojban is complex
- lojban is not currently suited for general use
- lojban is cool
I further believe that:
- learning lojban (at least beyond "mi tavla") is mind-expanding
- learning the structures of lojban teach more than just lojban, they teach about structure, syntax, and monolinguistic assumptions.