Back

If your email is not recognized and you believe it should be, please contact us.

  • You must be logged in to reply to this topic.Login

Grep for Non-Western characters combined with Western characters (Unicode)

Return to Member Forum

  • Author
    Posts
    • #95658
      Lyezs °
      Member

      Hello,

      I start with a main document in English. Therefor I have several Paragraph styles made in directories. When the main document is ready in lay-out I take that as a start to place other languages like DE, NL, FR, … For them I use the same paragrahp styles but change the langues. At the end I have 1 directory for each languages in my paragraph styles. Sometimes I also have Chinees, Arabic and Japanees text.

      Now I was wondering if there is a grep that I could use that recognises the Western characters (unicode) from the non Western, including registeremarks, any digit, punctuation marks and even Brackets. (Idealy the same grep for all the languages (Chinees, Arabic and Japanees))

      Fyi – the fonts for the different languages are:
      CN: Adobe Song
      Arabic: Adobe Arabic
      Japanees: Meiryo
      Western: Helvetica Neue LT Std

    • #95661
      Lyezs °
      Member

      For the CN-EN version I’m almost there I believe.
      I’ve found a grep on this site

      [^\x{3000}-\x{efff}\x{4E00}-\x{9FD5}\x{3300}-\x{33FF}][^.,;:?!]+

      It works fine untill their are punctuation or a space between two words. The part after the white space or for ex. ‘?’ he can’t recognise the Chinese characters anymore.

      You can look into a basic file with the grep here:
      https://indd.adobe.com/view/091c9dde-4eee-4faa-93b6-0a8b775a37d3

Viewing 1 reply thread
  • The forum ‘General InDesign Topics (CLOSED)’ is closed to new topics and replies.
Forum Ads