Google translate via web scraping r

我的梦境 提交于 2019-12-04 02:11:35

问题


I have a list of 1000 text in Russian language and want to convert it to English in R. I know there are some R packages for google translate but that requires API. And getting google API is paid now. In Excel VBA, I have a macro which visits google translate website and converts it. See the URL and parameters below -

getParam = "Прием (осмотр, консультация) врача-инфекциониста первичный"
translateFrom = "ru"
translateTo = "en"

URL = "https://translate.google.pl/m?hl=" & translateFrom & "&sl=" & translateFrom & "&tl=" & translateTo & "&ie=UTF-8&prev=_m&q=" & getParam

Can same thing be done in R?


回答1:


Here is a solution ,

library(RCurl)
library(XML)

getParam = "Прием (осмотр, консультация) врача-инфекциониста первичный"
translateFrom = "ru"
translateTo = "en"

search <- gsub(" ", "%20", getParam)

URL <- paste("https://translate.google.pl/m?hl=",translateFrom,"&sl=",translateFrom,"&tl=",translateTo,"&ie=UTF-8&prev=_m&q=",search,sep="")

page <- getURL(URL)

tree <-htmlTreeParse(page)

body <- tree$children$html$children$body 

body_text <- body$children[[5]]$children[[1]]  

print(body_text) 

You can find more information about web parsing from this question.



来源:https://stackoverflow.com/questions/46551298/google-translate-via-web-scraping-r

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!