擅长:python、mysql、java
<p>命令:</p>
<pre><code>$ echo سالەم ورتا ءۇي سوزمەن الما ۇل مەنىڭ اتىم شالقار | ./arabic-to-latin
</code></pre>
<p>输出:</p>
^{pr2}$
<p>要使用文件而不是stdin/stdout:</p>
<pre><code>$ ./arabic-to-latin input_file_with_arabic_text_in_utf8 >output_latin_in_utf8
</code></pre>
<p>其中<code>arabic-to-latin</code>文件:</p>
<pre><code>#!/usr/bin/perl
use strict;
use warnings;
use utf8;
use open qw(:std :utf8);
#XXX normalization
sub replace_word {
my ($word) = @_;
$_ = $word;
if (/ء|ە|ك|گ/) { # g, k, e, or hamza in the word
tr/اوىۇ/әѳiü/; # soft
} else {
tr/اوىۇ/aoiu/; # hard
}
tr/سلەمرتزنشق/slêmrtznxқ/;
s/ءüي/üy/g;
s/ڭ/ng/g;
$_;
}
while (my $line = <>) {
$line =~ s/(\w+)/replace_word($1)/ge;
print $line;
}
</code></pre>
<p>要使<code>arabic-to-latin</code>文件可执行:</p>
<pre><code>$ chmod +x ./arabic-to-latin
</code></pre>