python - Python翻译器，如何只替换一个单词

Question

所以我正在尝试在 python 中创建一个翻译器（在一个 s60 设备中）。所以我们要做的就是只替换一个完整的单词而不触及其他单词。这是一个例子

原文：“棕狐跳过了名叫布朗尼的狗。” 我想把“brown”这个词替换成“deathlesi”（忽略为什么）结果应该是：“the deathlesi fox jumps over the dog named brownie”。但相反，它也改变了字符串中的“brownie”，结果为：“the deathlesi fox jumps over the dog named deadlesiie”。

由于我试图替换每一个单词，有时它会陷入一个永无止境的悖论。示例：“我很愚蠢” 我正在尝试将“I”更改为“ium”，这就是发生的情况。“iumumumumumumumumumumumumumumumumumumum....am stupiuiuiuiuiuiuiuiuiuiuiuiuiuiuiuiuiuiuim..”，它基本上改变了字符串中的每个“I”，直到字符串中没有“I”才停止。

有什么帮助吗？谢谢！

编辑：我已经尝试过 "stringhere".replace() 但某些部分，如小写字母 "i" 通常会替换愚蠢的 "i"。

这是另一个例子：“人们对巨型野兔感到兴奋。” 将“are”替换为“iume”，而不是“人们 iume 对巨型野兔感到兴奋”。它还取代了“hare”，从而导致“人们 iume 对巨大的 hiume 感到兴奋”。

假设我排列了句子并翻译了它们。这就是我现在的方法。基本上将每个单词转换为一个数组并转换它们中的每一个。然后做一个

translated_sentence=["particulus:people", "iume:are", "geus:getting", "exchantus:excited", "d:at", "qun:the", "gesas:giant", "hsont:hare"]
sentence= "People are getting excited at the giant hare."
for i in translated_sentence do
element=i.split(":")
sentence=sentence.replace(element[1], element[0])

它仍然抛出“particulus uime geus exchantus d qun gesas huime（而不是 hsont）”

我刚弄明白了。我只是将字符串拆分为一个数组，并通过清理当前单词并对原始单词执行 string.replace() 来保留格式。

sentence="The quick brown fox jumps over the lazy dog.".split(" ")
result=""

for i in sentence:

cleaned=clean(i) #removes the punctuations and stuff leaving the raw word.

translated=translate(cleaned) #returns the translated word

result=result+i.replace(cleaned,translated)+" "

return result

score 2 · Accepted Answer

这听起来像一个正则表达式场景：

import re
x = "The brown fox jumps over the dog named brownie."
newstring = re.sub(r"(\s+|[:punct:]+|^)brown(\s+|[:punct:]+|$)",r"\1deathlies\2",x, flags=re.IGNORECASE)

产生：

>>> print newstring
The deathlies fox jumps over the dog named brownie.

或者：

x = "People are getting excited at the giant hare."
newstring = re.sub(r"(\s+|[:punct:]+|^)are(\s+|[:punct:]+|$)",r"\1iume\2",x, flags=re.IGNORECASE)

哪个产生：

>>> print newstring
People iume getting excited at the giant hare.

第一个捕获组(\s+|[:punct:]+|^)匹配空格、标点符号或字符串的开头，另一个组(\s+|[:punct:]+|$)匹配字符串的结尾。

在进行替换时，\1将\2标点或空格与替换的文本一起放回去，使事情变得整洁。

附言

如果您很懒惰，只需创建捕获组(\W+|^)并(\W+|$)...

score 1 · Accepted Answer

由于您只想找到第一次出现，因此您只需要一种方法来跟踪它。您可以通过多种方式做到这一点。就这么简单：

def replacer(original, looking_for, replace_with):
   ''' A straightforward way... '''
   return original.replace(looking_for, replace_with, 1)
   #return regex.sub(replace_with, looking_for, 1)

该数字表示您要替换多少次。如果存在两个，并且您输入 2，则两个匹配项都将被替换。

字符串是不可变的，因此您必须重新分配新字符串。每次你replace都在生成一个新字符串。

如果您不想要内置的，您也可以编写一个循环来查找第 N 次出现。

我建议让你的帖子更短（我的意思是更少的单词，更多的语法高亮）。格式化它。如果我没有正确阅读您的帖子，请纠正我。

score 0 · Accepted Answer

0

Just call replace function of string

"I am stupid".replace("I", "ium")

于 2012-04-26T03:48:02.450 回答

score 0 · Accepted Answer

您想替换完全相等的单词。不是 string.replace()

替换“are”，但不要替换“hare”

如果是这样的话

已编辑

正如@Niall 所说，正则表达式搜索和替换是满足您任务的最佳工具。

或者，如果你刚刚开始学习 Python，而正则表达式太复杂了。只需使用str.split()then 循环单词将字符串拆分为单词。

def simply_replace(string, search, replace):
    words = string.split(' ')
    for i in range(len(words)):
        if(words[i].lower() == search):
            words[i] = replace
    return ' '.join(words)

>>> simply_replace("I am stupid", 'i', 'ium')
'ium am stupid'
>>> simply_replace("The brown fox jumps over the dog named brownie.", 'brown', 'deathly')
'The deathly fox jumps over the dog named brownie.'
>>> simply_replace("People are getting excited at the giant hare.", 'are', 'ium')
'People ium getting excited at the giant hare.'

score 0 · Accepted Answer

我现在没有 python，但是如何创建一个将字符串转换为列表的函数。您可以取出空白，因此列表将是 [The, brown, fox, jumps...]。然后做一个.replace。

python - Python翻译器，如何只替换一个单词

5 回答 5

已编辑

Related

Reference