Stories
Slash Boxes
Comments

SoylentNews is people

posted by NCommander on Sunday February 16 2014, @10:13PM   Printer-friendly
from the ¡sᴉɥʇ-sǝlpuɐɥ-ʍou-ǝʇᴉs-ǝɥʇ dept.
So, after dealing with a bit of monkeying with the database, I'm pleased to announce that Soylent should (in theory) have support for UTF-8 starting immediately. Now obviously this isn't well tested, so this is your chance to break the site in two, consider the comments below to be "open season" so to speak. I know the comment preview has some issues with UTF-8 (and it only works at all in Plain Text or HTML modes)

For purposes of breakage, anything that breaks the site layout/Reply To/Parent/Moderate buttons, or breaks any comments beyond itself is considered bad. We need to stop those. If you can break it (which shouldn't be hard), you earn a cookie, and I'll get you in the CREDITS file as something awesome.

For comments that are just plain unreadable, moderation will take care of them, and that isn't considered a bug. So go forth and BREAK my minions! ()}:o)↺
 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 4, Informative) by mattie_p on Sunday February 16 2014, @10:24PM

    by mattie_p (13) on Sunday February 16 2014, @10:24PM (#322) Journal

    If I can break something without even trying, surely I'll be able to do it when I do try

    • (Score: 1) by StupendousMan on Monday February 17 2014, @02:03AM

      by StupendousMan (103) on Monday February 17 2014, @02:03AM (#368)
      Try to break it?  Use funny characters?

      bã‚ã‘ã&# 8218;“

      æ°´

      OD.1.3   Ï€Î¿Î»Î» ῶν

      • (Score: 1) by StupendousMan on Monday February 17 2014, @02:05AM

        by StupendousMan (103) on Monday February 17 2014, @02:05AM (#369)
        My post below has several lines of Japanese and Greek letters, pasted into the "Comment" box.

        So, if I try using "Plain old text", the code below cannot be posted.

        If I try "HTML formatted", it can't be posted.

        If I try "code", it IS posted, but the results -- shown in post above this one -- are bad: one can't see the characters properly.

        bã‚ã‘ã‚“

        æ°´

        OD.1.3 πολλ& #225;¿¶Î½
        • (Score: 1) by StupendousMan on Monday February 17 2014, @02:09AM

          by StupendousMan (103) on Monday February 17 2014, @02:09AM (#371)
          And the post above, I tried "Extrans", but that didn't work, either.

          Rats. Can't get Japanese kana or Greek letters.
          • (Score: 1) by omoc on Monday February 17 2014, @06:28AM

            by omoc (39) on Monday February 17 2014, @06:28AM (#453)

            According to the preview, Chinese characters don't work either I think

            我很快ä¹

    • (Score: 1) by yellowantphil on Thursday February 20 2014, @02:53AM

      by yellowantphil (2125) on Thursday February 20 2014, @02:53AM (#3096) Homepage

      I don’t see a comment button other than “reply to this.†Hmm… I wasn’t willing to use unicode in my comment on another page, because the preview looked wrong, but maybe this comment will look fine after I submit it.

      • (Score: 2) by mattie_p on Thursday February 20 2014, @03:33AM

        by mattie_p (13) on Thursday February 20 2014, @03:33AM (#3129) Journal

        Hitting "reply" to the main story is equivalent to "post." But you're welcome to test out almost anything in this thread so we can work the bugs out.

        • (Score: 1) by yellowantphil on Thursday February 20 2014, @04:19AM

          by yellowantphil (2125) on Thursday February 20 2014, @04:19AM (#3156) Homepage

          Ah yes, there is the reply button. Thanks.

          I wonder how someone else managed to post in braille, and I can’t even get quote marks to work. I’ll try HTML.

  • (Score: 5, Interesting) by ticho on Sunday February 16 2014, @10:24PM

    by ticho (89) on Sunday February 16 2014, @10:24PM (#323) Homepage Journal

    Braille:

        ⡌⠁⠧⠑ ⠼⠁⠒ ⡍⠜⠇⠑⠹⠰⠎ ⡣⠕⠌

        ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠙⠑⠁⠙⠒ ⠞⠕ ⠃⠑⠛⠔ ⠺⠊⠹⠲ ⡹⠻⠑ ⠊⠎ ⠝⠕ ⠙⠳⠃⠞
        ⠱⠁⠞⠑⠧⠻ ⠁⠃⠳⠞ ⠹⠁⠞⠲ ⡹⠑ ⠗⠑⠛⠊⠌⠻ ⠕⠋ ⠙⠊⠎ ⠃⠥⠗⠊⠁⠇ ⠺⠁⠎
        ⠎⠊⠛⠝⠫ ⠃⠹ ⠹⠑ ⠊⠇⠻⠛⠹⠍⠁⠝⠂ ⠹⠑ ⠊⠇⠻⠅⠂ ⠹⠑ ⠥⠝⠙⠻⠞⠁⠅⠻⠂
        ⠁⠝⠙ ⠹⠑ ⠡⠊⠑⠋ ⠍⠳⠗⠝⠻⠲ ⡎⠊⠗⠕⠕⠛⠑ ⠎⠊⠛⠝⠫ ⠊⠞⠲ ⡁⠝⠙
        ⡎⠊⠗⠕⠕⠛⠑⠰⠎ ⠝⠁⠍⠑ ⠺⠁⠎ ⠛⠕⠕⠙ ⠥⠏⠕⠝ ⠰⡡⠁⠝⠛⠑⠂ ⠋⠕⠗ ⠁⠝⠹⠹⠔⠛ ⠙⠑
        ⠡⠕⠎⠑ ⠞⠕ ⠏⠥⠞ ⠙⠊⠎ ⠙⠁⠝⠙ ⠞⠕⠲

        ⡕⠇⠙ ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠁⠎ ⠙⠑⠁⠙ ⠁⠎ ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲

        ⡍⠔⠙⠖ ⡊ ⠙⠕⠝⠰⠞ ⠍⠑⠁⠝ ⠞⠕ ⠎⠁⠹ ⠹⠁⠞ ⡊ ⠅⠝⠪⠂ ⠕⠋ ⠍⠹
        ⠪⠝ ⠅⠝⠪⠇⠫⠛⠑⠂ ⠱⠁⠞ ⠹⠻⠑ ⠊⠎ ⠏⠜⠞⠊⠊⠥⠇⠜⠇⠹ ⠙⠑⠁⠙ ⠁⠃⠳⠞
        ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲ ⡊ ⠍⠊⠣⠞ ⠙⠁⠧⠑ ⠃⠑⠲ ⠔⠊⠇⠔⠫⠂ ⠍⠹⠎⠑⠇⠋⠂ ⠞⠕
        ⠗⠑⠛⠜⠙ ⠁ ⠊⠕⠋⠋⠔⠤⠝⠁⠊⠇ ⠁⠎ ⠹⠑ ⠙⠑⠁⠙⠑⠌ ⠏⠊⠑⠊⠑ ⠕⠋ ⠊⠗⠕⠝⠍⠕⠝⠛⠻⠹
        ⠔ ⠹⠑ ⠞⠗⠁⠙⠑⠲ ⡃⠥⠞ ⠹⠑ ⠺⠊⠎⠙⠕⠍ ⠕⠋ ⠳⠗ ⠁⠝⠊⠑⠌⠕⠗⠎
        ⠊⠎ ⠔ ⠹⠑ ⠎⠊⠍⠊⠇⠑⠆ ⠁⠝⠙ ⠍⠹ ⠥⠝⠙⠁⠇⠇⠪⠫ ⠙⠁⠝⠙⠎
        ⠩⠁⠇⠇ ⠝⠕⠞ ⠙⠊⠌⠥⠗⠃ ⠊⠞⠂ ⠕⠗ ⠹⠑ ⡊⠳⠝⠞⠗⠹⠰⠎ ⠙⠕⠝⠑ ⠋⠕⠗⠲ ⡹⠳
        ⠺⠊⠇⠇ ⠹⠻⠑⠋⠕⠗⠑ ⠏⠻⠍⠊⠞ ⠍⠑ ⠞⠕ ⠗⠑⠏⠑⠁⠞⠂ ⠑⠍⠏⠙⠁⠞⠊⠊⠁⠇⠇⠹⠂ ⠹⠁⠞
        ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠁⠎ ⠙⠑⠁⠙ ⠁⠎ ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲

        (The first couple of paragraphs of "A Christmas Carol" by Dickens)

    • (Score: 2, Funny) by Anonymous Coward on Sunday February 16 2014, @10:30PM

      by Anonymous Coward on Sunday February 16 2014, @10:30PM (#326)

      I can't see the naked lady.

      • (Score: 1) by stderr on Sunday February 16 2014, @10:59PM

        by stderr (11) on Sunday February 16 2014, @10:59PM (#332) Journal

        She doesn't appear until chapter 2.

        --
        alias sudo="echo make it yourself #" # ... and get off my lawn!
    • (Score: 4, Insightful) by Nerdfest on Monday February 17 2014, @12:34AM

      by Nerdfest (80) on Monday February 17 2014, @12:34AM (#353)

      All I see is blonde, brunette, redhead ...

      • (Score: 5, Funny) by chromas on Monday February 17 2014, @01:06AM

        by chromas (34) Subscriber Badge on Monday February 17 2014, @01:06AM (#364) Journal

        Actually, it's braille, so you feel the blonde, brunette and redhead.

        • (Score: 0) by Anonymous Coward on Monday February 17 2014, @12:52PM

          by Anonymous Coward on Monday February 17 2014, @12:52PM (#637)

          Her mouth says "no" but her bumps say "⢀⣲⠢⡔".

  • (Score: 1) by Landon on Sunday February 16 2014, @10:28PM

    by Landon (45) on Sunday February 16 2014, @10:28PM (#324) Journal

    test

  • (Score: 1) by Techwolf on Sunday February 16 2014, @10:29PM

    by Techwolf (87) on Sunday February 16 2014, @10:29PM (#325)

    oooooOOOOoooɹɹɹɐɐɐɐ --werewolf greeting in upside down. :-)

    • (Score: 1) by ticho on Sunday February 16 2014, @10:34PM

      by ticho (89) on Sunday February 16 2014, @10:34PM (#330) Homepage Journal

      Alas, mirrored text (done by using ‮) doesn't seem to work, even if it does show up mirrored when pasted in the comment editbox.

    • (Score: 1) by c0lo on Wednesday February 19 2014, @02:58AM

      by c0lo (156) Subscriber Badge on Wednesday February 19 2014, @02:58AM (#2111) Journal

      A character beyond what UNICODE restricts itself: &amp#x20FFFF; - shows like
      Something below the max limit, valid, but unassigned 𰀁 - shows like
      Something that's an invalid UNICODE character -  - shows like
      The REPLACEMENT CHARACTER i.e. � - shows like �

      Now, what the above will do inside the storage, I don't know, I'm just trying to go forth and BREAK those minions!

      (hmmm, the "preview" looks like they are totally kicked out, not replaced by the replacement character [wikipedia.org] - won't cry over the loss of it)

      --
      https://www.youtube.com/watch?v=aoFiw2jMy-0 https://soylentnews.org/~MichaelDavidCrawford
  • (Score: 5, Funny) by mtrycz on Sunday February 16 2014, @10:32PM

    by mtrycz (60) on Sunday February 16 2014, @10:32PM (#329)

    You can't parse [X]HTML with regex. Because HTML can't be parsed by regex. Regex is not a tool that can be used to correctly parse HTML. As I have answered in HTML-and-regex questions here so many times before, the use of regex will not allow you to consume HTML. Regular expressions are a tool that is insufficiently sophisticated to understand the constructs employed by HTML. HTML is not a regular language and hence cannot be parsed by regular expressions. Regex queries are not equipped to break down HTML into its meaningful parts. so many times but it is not getting to me. Even enhanced irregular regular expressions as used by Perl are not up to the task of parsing HTML. You will never make me crack. HTML is a language of sufficient complexity that it cannot be parsed by regular expressions. Even Jon Skeet cannot parse HTML using regular expressions. Every time you attempt to parse HTML with regular expressions, the unholy child weeps the blood of virgins, and Russian hackers pwn your webapp. Parsing HTML with regex summons tainted souls into the realm of the living. HTML and regex go together like love, marriage, and ritual infanticide. The cannot hold it is too late. The force of regex and HTML together in the same conceptual space will destroy your mind like so much watery putty. If you parse HTML with regex you are giving in to Them and their blasphemous ways which doom us all to inhuman toil for the One whose Name cannot be expressed in the Basic Multilingual Plane, he comes. HTML-plus-regexp will liquify the n​erves of the sentient whilst you observe, your psyche withering in the onslaught of horror. Rege̿̔̉x-based HTML parsers are the cancer that is killing StackOverflow it is too late it is too late we cannot be saved the trangession of a chi͡ld ensures regex will consume all living tissue (except for HTML which it cannot, as previously prophesied) dear lord help us how can anyone survive this scourge using regex to parse HTML has doomed humanity to an eternity of dread torture and security holes using regex as a tool to process HTML establishes a breach between this world and the dread realm of c͒ͪo͛ͫrrupt entities (like SGML entities, but more corrupt) a mere glimpse of the world of reg​ex parsers for HTML will ins​tantly transport a programmer's consciousness into a world of ceaseless screaming, he comes, the pestilent slithy regex-infection wil​l devour your HT​ML parser, application and existence for all time like Visual Basic only worse he comes he comes do not fi​ght he com̡e̶s, ̕h̵i​s un̨ho͞ly radiańcé destro҉ying all enli̍̈́̂̈́ghtenment, HTML tags lea͠ki̧n͘g fr̶ǫm ̡yo​͟ur eye͢s̸ ̛l̕ik͏e liq​uid pain, the song of re̸gular exp​ression parsing will exti​nguish the voices of mor​tal man from the sp​here I can see it can you see ̲͚̖͔̙î̩́t̲͎̩̱͔́̋̀ it is beautiful t​he final snuffing of the lie​s of Man ALL IS LOŚ͖̩͇̗̪̏̈́T ALL I​S LOST the pon̷y he comes he c̶̮omes he comes the ich​or permeates all MY FACE MY FACE ᵒh god no NO NOO̼O​O NΘ stop the an​*̶͑̾̾​̅ͫ͏̙̤g͇̫͛͆̾ͫ̑͆l͖͉̗̩̳̟̍ͫͥͨe̠̅s ͎a̧͈͖r̽̾̈́͒͑e n​ot rè̑ͧ̌aͨl̘̝̙̃ͤ͂̾̆ ZA̡͊͠͝LGΌ ISͮ̂҉̯͈͕̹̘̱ TO͇̹̺ͅƝ̴ȳ̳ TH̘Ë͖́̉ ͠P̯͍̭O̚​N̐Y̡ H̸̡̪̯ͨ͊̽̅̾̎Ȩ̬̩̾͛ͪ̈́̀́͘ ̶̧̨̱̹̭̯ͧ̾ͬC̷̙̲̝͖ͭ̏ͥͮ͟Oͮ͏̮̪̝͍M̲̖͊̒ͪͩͬ̚̚͜Ȇ̴̟̟͙̞ͩ͌͝S ̨̥̫͎̭ͯ̿̔̀ͅ

    • (Score: 1) by Pav on Monday February 17 2014, @10:54AM

      by Pav (114) on Monday February 17 2014, @10:54AM (#561)

      So Tony the Pony brings us UTF-8... I would have expected a unicorn. :-/

      • (Score: 0) by Anonymous Coward on Thursday February 20 2014, @09:10AM

        by Anonymous Coward on Thursday February 20 2014, @09:10AM (#3277)

        UTF = Unicorn Tony Form?

    • (Score: 1) by slartibartfastatp on Monday February 17 2014, @03:37PM

      by slartibartfastatp (588) on Monday February 17 2014, @03:37PM (#779) Journal

      Then we get to know why slashdot won't support UTF-8...

    • (Score: 2, Funny) by edIII on Monday February 17 2014, @06:33PM

      by edIII (791) on Monday February 17 2014, @06:33PM (#927)

      Wow. Your comment made Hal look lucid in his last moments :)

      "Daisy...."

      --
      Technically, lunchtime is at any moment. It's just a wave function.
  • (Score: 5, Insightful) by bryan on Sunday February 16 2014, @11:18PM

    by bryan (29) <bryan@pipedot.org> on Sunday February 16 2014, @11:18PM (#337) Homepage Journal

    I know UTF-8 is one of those "features" that many people on slashdot have missed for a long time, but I thought most of that was for simple additions like the euro/pound/yen symbols and such.

    Am I the only one that would prefer not to see non-english text mixed in with the comments?

    Then again, I'd fully support an exception for Klingon. Maybe elvish too.

    • (Score: 1) by clone141166 on Monday February 17 2014, @01:31AM

      by clone141166 (59) on Monday February 17 2014, @01:31AM (#366)

      Yeah it may mean some unreadable posts, but as pointed out in the story text moderation should take care of that. There was even talk of a new mod category, something like "-1 Unintelligible" :P

      • (Score: 1) by turtledawn on Monday February 17 2014, @06:16AM

        by turtledawn (136) <reversethis-{moc ... ta} {nwadeltrut}> on Monday February 17 2014, @06:16AM (#447)

        I rather like that idea for a mod.

        • (Score: 1) by Spook brat on Wednesday February 19 2014, @03:57PM

          by Spook brat (775) on Wednesday February 19 2014, @03:57PM (#2544) Journal
          There's some room for abuse there; 'unintelligible' ranges from "I don't speak that language" to Time Cube to "poster doesn't make cogent argument". Since that last one borders closely on "I don't agree with this" I hope meta-moderation will keep that in check.
          --
          Travel the galaxy! Meet fascinating life forms... And kill them [schlockmercenary.com]
    • (Score: 1) by FatPhil on Tuesday February 18 2014, @11:53AM

      by FatPhil (863) <pc-soylentNO@SPAMasdf.fi> on Tuesday February 18 2014, @11:53AM (#1546) Homepage

      There will be stories about Finns with äs in their names. Swedes with Ås in their names, and maybe, just maybe, stores about icelandic volcanoes with ðs in their name. Whilst I like the idea of everyone agreeing to use English language, that doesn't mean every word they'll be typing will be English.

      Note - this post isn't UTF-8, this is plain old ASCII, I used the &entity; syntax.

      --
      Great minds discuss ideas; average minds discuss events; small minds discuss people; the smallest discuss themselves
    • (Score: 1) by xaxa on Tuesday February 18 2014, @07:49PM

      by xaxa (1489) on Tuesday February 18 2014, @07:49PM (#1848)

      At present, there's nothing stopping someone from writing in German, French, Spanish, or many other languages using Latin letters (more or less). Since it doesn't happen, I don't think we need to worry.

      (But if I post in a quote with an em-dash: — or some “proper†‘quotes,’ maybe an arrow → or temperature (0°C) it won't make a mess.)

      (Nope, looks like it's going to make a mess.)

    • (Score: 1) by M. Baranczak on Wednesday February 19 2014, @02:38AM

      by M. Baranczak (1673) on Wednesday February 19 2014, @02:38AM (#2101)

      Well, why shouldn't we be able to post non-English comments?

      And even if you don't, it would be nice to be able to spell some people's names correctly.

      Lech Wałęsa [wikipedia.org]
      Friðrik Þór Friðriksson [wikipedia.org]
      Jaroslav Hašek [wikipedia.org]

    • (Score: 1) by c0lo on Wednesday February 19 2014, @03:12AM

      by c0lo (156) Subscriber Badge on Wednesday February 19 2014, @03:12AM (#2115) Journal

      Am I the only one that would prefer not to see non-english text mixed in with the comments?

      How do you consider the use of ℃ or ㎞ units: are they proper English or part of the non-english text?

      Oooopps. A copy/paste of these characters from the "Character map" (on Ubuntu/Firefox) straight into the reply text box results in them showing mangled in the preview (no mater if plain-old text or HTML).
      To provide the context: I was enquiring about the use of these and units.

      --
      https://www.youtube.com/watch?v=aoFiw2jMy-0 https://soylentnews.org/~MichaelDavidCrawford
      • (Score: 1) by maxwell demon on Thursday February 20 2014, @08:31AM

        by maxwell demon (1608) on Thursday February 20 2014, @08:31AM (#3264) Journal

        The bytes are probably interpreted by slashcode as latin1 instead of utf8.

        --
        The Tao of math: The numbers you can count are not the real numbers.
  • (Score: 3, Interesting) by Maow on Monday February 17 2014, @02:35AM

    by Maow (8) on Monday February 17 2014, @02:35AM (#379) Homepage

    fooðŒ†bar

    Hmm, not seeing a preview. Using Plain Old Text mode. Switching to HTML next...

    OK, HTML Formatted does give a preview.

    But the 4 horizontally stacked lines do not represent properly in preview.

    So, it's a bug.

    Here's a guide to getting FULL UTF8MB4 support in the DB:

    http://mathiasbynens.be/notes/mysql-utf8mb4 [mathiasbynens.be]

    OH, and now I see preview in Plain Old Text, having added more than the "foo_bar" that was originally present.

    And, to be certain, I removed everything but the "foo_bar" and got ... no preview in Plain Old Text.

    I like my cookies crispy with chocolate chips!

    • (Score: 1) by Popsikle on Monday February 17 2014, @03:10AM

      by Popsikle (77) on Monday February 17 2014, @03:10AM (#386) Homepage

      no can post.

      • (Score: 1) by Popsikle on Monday February 17 2014, @03:13AM

        by Popsikle (77) on Monday February 17 2014, @03:13AM (#390) Homepage

        'The Ross–Littlewood paradox[clarification needed] (also known as the balls and vase problem or the ping pong ball problem) is a hypothetical problem in abstract mathematics and logic designed to illustrate the seemingly paradoxical, or at least non-intuitive, nature of infinity. More specifically, like the Thomson's lamp paradox, the Ross–Littlewood paradox tries to illustrate the conceptual difficulties with the notion of a supertask, in which an infinite number of tasks are completed sequentially.[1] The problem was originally described by mathematician John E. Littlewood in his 1953 book Littlewood's Miscellany, and was later expanded upon by Sheldon Ross in his 1988 book A First Course in Probability.

        • (Score: 1) by Popsikle on Monday February 17 2014, @03:16AM

          by Popsikle (77) on Monday February 17 2014, @03:16AM (#392) Homepage

          "MαgđαlÑи′s ÄαÑκиÑÑs" is a bad bad string about a bad mans darkness.

  • (Score: 1) by weilawei on Monday February 17 2014, @05:39AM

    by weilawei (109) on Monday February 17 2014, @05:39AM (#413)

    Oh bleep. There goes the neighborhood (U+0CCB). ೋ

  • (Score: -1, Redundant) by Anonymous Coward on Monday February 17 2014, @05:46AM

    by Anonymous Coward on Monday February 17 2014, @05:46AM (#420)
    Although I can't think of an example right now, there was moments when the lack of unicode was problematic earlier for some discussions.

    When writing letters and symbols outside of the normal keyboard mapping, what is the most often used method? Is it with AltGr key (like AltGr-M for µ), or with a Compose key, a keycombo to enter unicode char number (ctrl-shift-u ?), or simply cut&paste from a character table application?

    A⃣ B⃣ C⃣

    There will probably be use of lots of unicode in discussions about languages etc later on. I might add that I think UTF-16 would be preferable.

    Let me sing a little song to celebrate this event: ♩♫♬♯♪♩♫♬♯♩♪♫♬♯♪

    I'm not sure it is my web browsers fault or not, but it didn't work if I wrote the characters here directly, only if I entered them html encoded ( ♩♫♬♯♪♩♫ð…Ÿâ™¬â™¯â™ªð…¡ð…žâ™©â™«ð…Ÿâ ¬â™¯â™ªð…¡ð…ž) ��

  • (Score: 1) by iNaya on Monday February 17 2014, @07:11AM

    by iNaya (176) on Monday February 17 2014, @07:11AM (#469)

    I don't seem to be able to post anything with non-Latin characters.

  • (Score: 1) by k8n on Monday February 17 2014, @09:15AM

    by k8n (295) on Monday February 17 2014, @09:15AM (#519)

    Preview doen't work... neither text nor html

    text
    текÑÑ‚
    κείμενο
    课文
    課文
    テキスト
    Õ¿Õ¥Ö„Õ½Õ¿
    mətn
    শিরোনাম
    ტექსტი
    પાઠ
    tèks
    पाठ
    ಪಠà³à²¯
    អážáŸ’ážáž”áž‘
    ì›ë³¸
    ເນື້ອໃນ
    मजकूर
    உரை
    టెకà±à°¸à±à°Ÿà±
    ข้อความ
    văn bản
    Ñ‚ÑкÑÑ‚
    متن
    نص
    טקסט

  • (Score: 1) by Eunuchswear on Monday February 17 2014, @01:48PM

    by Eunuchswear (525) on Monday February 17 2014, @01:48PM (#687) Journal

    If I just type it in I get the traditional "utf8 shows as 8859-1" crap:

    «Les accents ont une fonction en français, ne serait-ce que pour distinguer,
      dans un hôpital psychiatrique, entre les internes et les internés.»
                                                                - Annie Bourret

    --
    Watch this Heartland Institute video [youtube.com]
  • (Score: 1) by VLM on Monday February 17 2014, @02:19PM

    by VLM (445) Subscriber Badge on Monday February 17 2014, @02:19PM (#711)

    ∇.D=p

    ∇.B=0

    ∇xE=-∂B/∂t

    ∇xH= ∂D/∂t+j

    • (Score: 2, Funny) by VLM on Monday February 17 2014, @02:25PM

      by VLM (445) Subscriber Badge on Monday February 17 2014, @02:25PM (#720)

      Maxwells equations always sounded better in their original Klingon.

      Its supposed to be pretty tame stuff, div B equals zero and all that. Net charge of magnetic field in space doesn't exist aka net flow in and out of a closed surface is zero as long as monopoles don't exist, that kind of thing.

  • (Score: 2, Informative) by stormwyrm on Monday February 17 2014, @04:20PM

    by stormwyrm (717) on Monday February 17 2014, @04:20PM (#814) Journal

    Doesn't look like it does. Been trying to type Japanese text but it doesn't seem to work. Japanese text in the subject gets mangled into XML entities.

    --
    Numquam ponenda est pluralitas sine necessitate.
    • (Score: 1) by stormwyrm on Monday February 17 2014, @11:32PM

      by stormwyrm (717) on Monday February 17 2014, @11:32PM (#1176) Journal

      Seems to be still just as broken as it was on the old site. I get the same garbage when I type something like this: Qu'on me donne six lignes écrites de la main du plus honnête homme, j'y trouverai de quoi le faire pendre. That's supposed to be a quote attributed to Cardinal Richelieu, and is my main rebuttal to people who say they have nothing to hide. It works if I force encoding to ISO-8859-1 as I had to before (see my sig for how it should look).

      --
      Numquam ponenda est pluralitas sine necessitate.
    • (Score: 0) by Anonymous Coward on Thursday February 20 2014, @01:07AM

      by Anonymous Coward on Thursday February 20 2014, @01:07AM (#3027)
      <p>&#230;&#8212;&#165;&#230;&#339;&#172;&#232;&#17 0;&#382;&#227;&#174;&#230;&#8211;&#8225;&#229;&#17 3;&#8212;&#227;&#8218;&#8217;&#228;&#189;&#191;&#2 27;&#163;&#227;&#166;&#232;&#166;&#8249;&#227;&#19 0;&#227;&#8212;&#227;&#8218;&#8225;</p>
  • (Score: 2, Interesting) by DarkMorph on Monday February 17 2014, @07:07PM

    by DarkMorph (674) on Monday February 17 2014, @07:07PM (#948)

    Does this in any way suggest that we might have a SoylentNews.jp [soylentnews.jp] in the future or are we abandoning all hope for the Japanese /. crowd that might be interested in migrating or at least additionally visiting SN?

  • (Score: 1) by yellowantphil on Sunday February 23 2014, @01:38AM

    by yellowantphil (2125) on Sunday February 23 2014, @01:38AM (#5034) Homepage

    E = mc²

    F = T∇St

    I don’t think it’s working for me, but it is for other people…

    • (Score: 1) by yellowantphil on Sunday February 23 2014, @01:52AM

      by yellowantphil (2125) on Sunday February 23 2014, @01:52AM (#5037) Homepage

      UTF-8: Café, soupçon

      HTML entities: Café, soupçon

      Id rather not HTML-encode everything I type, but at least Duck Duck Go gives me a handy table of HTML entities [duckduckgo.com].

      My quotation marks are disappearing, when encoded as HTML.

  • (Score: 1) by PrinceVince on Monday February 24 2014, @01:27AM

    by PrinceVince (2801) on Monday February 24 2014, @01:27AM (#5429)

    Isn't setting up a UTF-8 capable front-end and database a pretty basic task these days; something do get done after following a few tutorials and articles?

    You create your database with the right settings (e.g. utf8_general_ci collation in MySQL) and make sure that your page scripts don't garble the content entered via the form. Recent versions of PHP and Python can do that just fine, never used Perl though.

  • (Score: 1) by Reziac on Friday March 07 2014, @02:25AM

    by Reziac (2489) on Friday March 07 2014, @02:25AM (#12407) Homepage

    Nothing to do with comments, but rather with contents of the article box:

    The links [ /dev/random ] [ The Main Page ]
    on THIS page work.

    However, the links [ Soylent ] [ The Main Page ]
    on other pages do not work for me.

    If I turn off CSS, then these links work. So it's the CSS, not the links themselves.

    [SeaMonkey 2.5 with JS turned off, here.]

    Cripes, you'd think I could come up with a better bug than that. :D

    --
    And there is no Alkibiades to come back and save us from ourselves.
  • (Score: 1) by kbahey on Wednesday March 12 2014, @12:55AM

    by kbahey (1147) on Wednesday March 12 2014, @12:55AM (#14960) Homepage

    The text below is in Arabic, entered from Firefox on Linux.

    It does not display correctly for some unknown reason:

    العربية