Posts

European Association of Machine Translations (EAMT) Prague, Czech Republic – May 28-31

Welocalize is proud to sponsor the European Association of Machine Translation (EAMT) Conference which is taking pace in Prague, Czech Republic on May 29. Welocalize’s Vice President of Language Technology Solutions and noted language automation thought-leader, Olga Beregovaya will be attending the event, to advance the dialogue on hot topics such as neural linguistic processing, machine learning and language automation tools.
Olga will also share the latest findings in new projects and innovation studies taking place at Welocalize. For more information visit http://www.eamt.org/news/news_eamt_2017.php 

 

Welocalize Highlights Top Globalization and Localization Events of 2016

Welocalize takes part in a number of globalization and localization events around the world in North America, Europe, as well as Asia. We have provided highlights of some of the best industry conferences, forums and exhibitions in 2016 that we took part in, with recommendations for 2017 events that benefit anyone involved in globalization and localization activities.

LOCALIZATION WORLD & LOCLEADERS FORUMS 2016: TOKYO (APRIL), DUBLIN (JUNE), MONTREAL (OCTOBER)

Welocalize proudly sponsored and exhibited at all the Localization World 2016 conferences which took place in Tokyo in April, Dublin in June, and Montreal in October.

As we look forward into 2017, the Welocalize teams will be meeting up with clients and colleagues in Shenzhen in February, Barcelona in June and Silicon Valley in October.

For more information about these events, visit www.localizationworld.com.

To coincide with the Localization World events, Welocalize hosted LocLeaders Forums in the same locations. LocLeaders Forums are exclusive events for global brands and localization leaders to participate in an exchange of ideas about trending topics and challenges in the industry. They are highly regarded in the globalization industry and recognized as trusted, open forums that stimulate and generate ideas to help increase global business activities. Take a look at our popular Welocalize LocLeaders Magazines, which contains collective words and experiences from LocLeaders participants.

READ MORE: LOCLEADERS MAGAZINE DUBLIN 2016: EXPANDING YOUR GLOBAL REACH

READ MORE: LOCLEADERS MAGAZINE MONTREAL 2016: THE GLOBALIZATION JOURNEY

If you would like to learn more about the 2017 LocLeaders events, please contact marketing@welocalize.com.

LOCLEADERS LOCAL 2016

Welocalize hosted six LocLeaders Local Forums in 2016, which took place in Houston, Boston, New York City, Portland, Silicon Valley and Stuttgart, Germany. LocLeaders Local are special hosted dinners and peer panel discussions that focus on challenges faced by global brands in particular regions and sectors. This past year, we used the events as an opportunity to introduce Adapt Worldwide to attendees.  Adapt Worldwide, a Welocalize multilingual digital marketing agency under the theme of “Bridging the Digital Divide between Marketing and Localization.”

The LocLeaders Local format always encourages open, transparent conversation about language services and processes between industry professionals.  The local events proved incredibly successful, with client and industry experts demanding more regular events at a local level. We are excited to roll out more LocLeaders events in 2017. Please watch for our upcoming event announcements.

Find out more about what LocLeaders Germany panellist and SFDC Sales Director, Christian Weih has to say about Welocalize LocLeaders Local 2016 Germany – Technically Speaking

READ MORE: LocLeaders Local Forum 2016 – Stuttgart, Germany

WATCH: Bridging the Digital Divide between Marketing and Localization

TAKE THE DIGITAL QUIZ:  http://www.adaptworldwide.com/digital-marketing-quiz/

TAUS EVENTS 2016 – ROUNDTABLE AT VIENNA (MARCH), INDUSTRY LEADERS FORUM AND QE SUMMIT AT DUBLIN (JUNE), ANNUAL CONFERENCE AT PORTLAND (OCTOBER)

Welocalize attended key events organized by TAUS in Europe and North America in 2016. Program Manager, Lena Marg from Welocalize participated as a panel speaker at the TAUS Roundtable in Vienna. Welocalize VP of Language Technology Solutions Olga Beregovaya hosted a session and panel discussion at the TAUS Industry Leaders Forum in Dublin, focusing on the topic of innovative business and pricing models in translation. Olga, together with Welocalize Director of Client Solutions Darin Goble, contributed to the discussion in TAUS Annual Conference and QE Summit held in Portland in October 2016.

READ MORE: Welocalize Discusses MT and Quality at 2016 TAUS Events in Dublin

READ MORE: Welocalize Presents at Localization Game Changers 2016 TAUS Events in Portland

For more information, visit www.taus.net

GLOBALIZATION AND LOCALIZATION ASSOCIATION (GALA), LANGUAGE OF BUSINESS CONFERENCE 2016 – NEW YORK

Experts from Welocalize came together at the GALA 2016 Conference, hosting a series of meets to share some insights into one of our 2016 acquisitions, Adapt Worldwide, a Welocalize multilingual digital marketing agency, as well two Welocalize Life Sciences acquisitions, Nova and Global Language Solutions. Park IP Translations contributed to the discussion on topics related to regulated industries and interpretation. Tuyen Ho, Vice President of Corporate Development at Welocalize, presented “Abolish the Per Word Pricing Model” as part of the corporate strategy section of the GALA 2016 conference program held in March.

READ MORE: Welocalize to Present at GALA 2016

LOCALIZATION PROJECT MANAGERS ROUND TABLE – CALIFORNIA

Held in May 2016 in Berkeley, California, Welocalize shared industry expertise and lead discussions at The Localization Institute’s Localization Project Managers Round Table. Samantha Henderson, Welocalize Enterprise Program Senior Director, led and moderated the session “PM Responsibilities,” as part of the agenda for the event. The event gathered attendees with extensive experience in localization project management, sparking discussions on advanced topics such as stakeholder management, metrics and KPIs, agile content development, automation and quality management.

READ MORE: Welocalize Discusses Project Management Responsibilities at Localization Institute Round Table Event

EAMT CONFERENCE 2016

Welocalize sponsored The European Association for Machine Translation (EAMT) 2016. Olga Beregovaya, Welocalize’s VP of Language Technology Solutions, attended the European Association of Machine Translations (EAMT) Conference in May. Olga shared some insights on trending topics such as neural linguistic processing, machine learning, language automation tools and the latest findings in new projects and innovation studies taking place at Welocalize.

READ MORE: Welocalize at EAMT 2016

CONTENT MARKETING WORLD

Adapt Worldwide, a Welocalize multilingual digital marketing agency, exhibited alongside Welocalize at the Content Marketing World 2016, which  took place in Cleveland, Ohio. Together, teams from Adapt Worldwide and Welocalize met with leading global brand and content marketers. Our team of experts shared best practices and tips for creating next generation content, as well as deploying successful multilingual strategies with transcreation, digital content, SEO, mobile marketing and cultural adaption programs. General Manager and Co-Founder of Adapt Worldwide Huw Aveston, together with Ann Rockley and Charles Cooper, shared his knowledge on the language of digital marketing.

READ MORE: Welocalize’s Global Force Shares Next Generation Multilingual Content Solutions at Content Marketing World 2016

SCHOOL OF ADVANCED TECHNOLOGIES FOR TRANSLATORS

Welocalize sponsored and presented at the School of Advanced Technologies for Translators (SATT) 2016 that took place in Trento, Italy. This event is a two-day educational program for professional translators to stay up to date with the latest technologies and how they can be utilized in commercial solutions. Tanja Schmidt, Welocalize MT program manager and member of Welocalize Technology Solutions Team, shares some of her highlights from the event in this follow-up blog.

READ MORE: School of Advanced Technologies for Translators 2016

BRAND2GLOBAL

Brand2Global 2016 took place in Menlo Park, California. Leading global brand experts and professionals gathered to discuss trending topics in global marketing. Senior representatives from Welocalize and Adapt Worldwide participated in the event to help brand managers and leaders learn more about driving global digital strategies. Huw Aveston presented “Speaking the Same Language”, sharing some insights on bridging the divide between localization and marketing.

READ MORE: Welocalize and Adapt Worldwide Sponsor and Present at Brand2Global 2016 in Silicon Valley

AMTA 2016

The 12th biennial conference of the Association for Machine Translation in the America (AMTA) took place in Austin, Texas. AMTA have appointed Welocalize’s Olga Beregovaya as the new AMTA President and Welocalize Program Manager, Elaine O’Curran as Treasurer. The conference benefits everyone involved in machine translation, such as researchers, commercial users, and government users. Alex Yanishevsky, Welocalize Senior Manager, Globalization Technology Strategists, presented “I Ate Too Much Cake Beyond Domain-Specific MT Engines” at the AMTA 2016 conference.

Read more about the event highlights in a post by Elaine O’Curran, Welocalize Highlights From AMTA 2016 Conference.

The Welocalize team will be on the road again throughout 2017 and we hope to see you at the various events we’re participating in this year. Keep watch of our events posts, so we can connect. Come and say hello and let’s keep the conversation going!

Neural Machine Translation is the Next Big Thing

BulbWelocalize Senior Computational Linguist, Dave Landan, writes about the trends in machine translation (MT), neural machine translation (NMT) and takes us through the evolution of MT. He shares insights on how Welocalize is using cutting-edge innovation and technologies in its language tools solutions and MT programs.

It’s been almost nine years since Koehn et al. published Moses: Open Source Toolkit for Statistical Machine Translation1  in 2007, which fundamentally changed the way machine translation (MT) was done. But this was not the first fundamental shift in MT, and it looks like it won’t to be the last. To ensure our clients receive world-class levels of innovation in the area of language technology, we are working with what we are pretty sure will be the next big thing in MT. More about that to follow, but first a little context about how MT has evolved.

Brief History of MT

The field of MT began in earnest in the 1950s, first with bilingual dictionaries that permitted only word-by-word translation.  Translations by this method are seldom fluent. They are easily tripped up by polysemous words which are words with more than one meaning like “bank” or “Java,” and are often very difficult to understand by someone who doesn’t know what the intended meaning is beforehand.

From this beginning, the Next Big Thing was the introduction of rule-based machine translation (RBMT).  First there was direct RBMT, which used basic rules on top of the bilingual dictionaries.  Those helped with word order problems, but still didn’t address the other problems.  Next, we saw the introduction of transfer RBMT, which added more rules to deal with morphology and syntax to address those problems.  These systems can give performance that is quite good, but because of the richness of language, the systems are often incomplete in vocabulary coverage, syntactic coverage, or both.  RBMT is also expensive because it requires humans (linguists) to write all the rules and maintain the dictionaries that the systems use.  Still, due in part to the high cost of computing resources, RBMT dominated the field between 2000 and 2010.  There are still companies that offer good RBMT solutions today, often hybrid solutions combining RBMT with SMT.

Statistical Machine Translation (SMT)

Thanks to increased computing power at a lower cost and some pioneering research from IBM around 1990, work on statistical machine translation (SMT) began to take off in the late-1990’s and early-2000’s. In 2007, Moses was earmarked as the next big thing in MT; however, it wasn’t until 2010-2012 that it became the foundation upon which nearly every commercial SMT system was based.  SMT shifted the focus from linguists writing rules to acquiring aligned corpora, which are required to train SMT systems.  SMT has limitations as well. Language pairs that have different word order are particularly tricky and unless you have vast amounts of computing resources, modeling long-term dependencies between words or phrases is nearly impossible.

There have been incremental improvements to SMT over the past several years, including SMT using hierarchical models, and the introduction of linguistic meta-data for grammar-informed models. Nothing has come along that had such a huge impact as the jump from word-by-word to RBMT, or from RBMT to SMT, until now.

Neural Machine Translation (NMT)

Over the past two years, researchers have been working on using sequence-to-sequence mapping with artificial neural networks to develop what’s being called neural machine translation (NMT).  Essentially, they use recurrent neural networks to build a system that learns to map a whole sentence from source to target all at once, instead of word-by-word, phrase-by-phrase, or n-gram-by-n-gram.  This eliminates the problems of long-term dependencies and word-ordering, because the system learns whole sentences at once.  Indeed, some researchers are looking at extending beyond the limitations of the sentence to whole paragraphs or even documents. Document-level translation would theoretically eliminate our need for aligned files and allow us to train on transcreated material, which is unthinkable in any system available today.

NMT has shortcomings as well. Neural networks require a lot of training data, on the order of one million sentence pairs, and there’s currently no good solution to translating rare or unseen words and out of vocabulary (OOV) words.  There have been a few proposals on how to address this problem, nothing firm yet.  At Welocalize, we’re actively pursuing ideas of our own on how to fix the OOV problem for client data and we’re also working on how to overcome the amount of client data necessary to train a good NMT system.

The other major shift is that in order to train large neural networks efficiently, this requires a different set of hardware.  SMT requires a lot of memory to store phrase tables and training can be “parallelized” to work better on CPUs with multiple cores.  NMT on the other hand requires high-end GPUs (yes, video cards) for training.  We’ve invested in the infrastructure necessary to do the work and we’re working hard to get this exciting new technology ready for our clients to use.  Our early results with a variety of domain-specific data sets are very promising.

We’re not alone in our excitement. Many talks and posters at MT conferences are dedicated to advancement and progress in NMT. Google and Microsoft are both working on ways to use NMT in their translation products, with a special interest in how NMT can significantly improve fluency in translation between Asian and European languages. Watch this space in the weeks and months to come for updates on our progress with this exciting technology.

Dave

Dave Landan is Senior Computational Linguist at Welocalize. David.Landan@welocalize

Welocalize is a bronze sponsor at EAMT 2016. Click here for more information.

Read Welocalize & Trend Micro MT Case Study: MT Suitability Pilot Shortens Translation Times & Reduce Costs

1 Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan Herbst. 2007. Moses: Open source toolkit for statistical machine translation. In Proceedings of the ACL-2007 Demo and Poster Sessions, Prague, Czech Republic.

 

Welocalize Sponsors 19th European Association for Machine Translation Conference

Frederick, Maryland – May 26, 2016 – Welocalize, global leader in innovative translation and localization solutions, is a bronze sponsor at the upcoming 19th Annual Conference of the European Association for Machine Translation (EAMT) taking place in Riga, Latvia, May 30 – June 1, 2016, at the University of Latvia.

Senior members of Welocalize Language Technology Solutions team will be attending the event and taking part in discussions related to machine translation and language automation. The team will be sharing expert insights on the latest topics including neural linguistics processing, machine learning and language automation tools.

“The EAMT conference is such an important global event for the community of people who are involved in MT and translation tools, both commercially and academically,” said Olga Beregovaya, Welocalize’s VP of language technology solutions at Welocalize. “As a leading global language service provider, the language technology and translation automation strategy is very important and we look forward to sharing the latest findings on new projects and innovation studies taking place at Welocalize.”

The annual EAMT conference features presentations and descriptions of the latest MT technology and will include real-life enterprise use case studies. It attracts over 150 delegates from across Europe and brings together key players from the localization industry, as well as significant universities and research centers. For more information about the EAMT conference 2016, visit http://eamt2016.tilde.com/

About Welocalize – Welocalize, Inc., founded in 1997, offers innovative translation and localization solutions to help global brands reach audiences around the world in more than 175 languages. We provide translation and localization services, talent management, language tools, automation and technology, quality and program management. Our range of managed language services include machine translation, digital marketing, validation and testing, interpretation, staffing and enterprise translation management technologies. We specialize in consumer, technology, manufacturing, learning, oil and gas, travel and hospitality, marketing and advertising, finance, legal and the life sciences industry language solutions. With more than 800 employees worldwide, Welocalize maintains offices in the United States, United Kingdom, Germany, Ireland, Spain, Italy, Romania, Poland, Japan and China. www.welocalize.com

eamtAbout EAMT – The European Association for Machine Translation (EAMT) is an organization that serves the growing community of people interested in MT and translation tools, including users, developers, and researchers of this increasingly viable technology. http://www.eamt.org/

Welocalize Reviews Top Localization Events of 2015

iStock_000045192190_MediumWelocalize was on the road, taking part in a number of 2015 globalization and localization events in North America and Europe. We thought it would be a good time to reflect at some event highlights, along with several great references that capture details about some of our favorite industry forums.

WELOCALIZE LOCLEADERS FORUM: LOCLEADERS BERLIN 2015 (JUNE), LOCLEADERS SILICON VALLEY 2015 (OCTOBER)

Welocalize held two LocLeaders Forum events in 2015: LocLeaders Forum 2015 Silicon Valley: Game Changers and LocLeaders Forum 2015 Berlin: Elevating Results.

Welocalize LocLeaders Forum events are incredibly popular, attracting globalization and localization leaders from global brands and leading industry experts. Attendees and Welocalize senior management gather to openly exchange ideas, discuss challenges and collaborate about solutions. The format for the day is a combination of presentations, peer panel discussions and open floor Q&A sessions. Welocalize CEO, Smith Yewell hosted both events and at LocLeaders Silicon Valley, he delivered his crystal ball forecast for the industry in his presentation, How To Predict the Future. It which describes his vision around predictive analytics and how it will fundamentally change the localization industry and the way we do business.

Game Changers CoverWe publish the LocLeaders Magazine after each event, which is a widely read publication, containing featured blogs from attendees and articles about localization topics discussed during the day. It is a must read for all localization professionals

READ: LOCLEADERS MAGAZINE SILICON VALLEY 2015: GAME CHANGERS!

READ: LOCLEADERS MAGAZINE BERLIN 2015: ELEVATING RESULTS!

LOCLEADERS LOCAL EVENTS: BOSTON AND DENVER

11008622_10152913810947283_2284498713384384521_nWelocalize launched the LocLeaders Local Forums in 2015, with two events. The inaugural events took place in Boston and Denver. We invited special guests to kick-off the networking and idea exchanges. Nancy Anderson of EMC was our featured guest in Boston. Molly Wendell Smith, who was also featured at our LocLeaders Forum in Berlin, shared valuable tips in building influence at the Denver LocLeaders Local.  Read more about building influence in Molly’s post, How to Build, Influence and Leverage Your Network.

LOCALIZATION WORLD EVENTS: LOCWORLD28 BERLIN AND LOCWORLD29 SILICON VALLEY

IMG_7989The world’s globalization and localization leaders descended on Shanghai, Berlin and Santa Clara to attend Localization World 2015 events and to discuss significant topics affecting the industry and global business. Welocalize always fully supports the Localization World event series, as sponsor and exhibitor. Many of the Welocalize leadership team and Welocalize clients took part in presentations and discussions at all LocWorld events. Smith Yewell, Welocalize CEO, provided the keynote at the Welocalize Silicon Valley event.

Read more: Highlights from LocWorld28 Berlin

Read more: Welocalize Inside LocWorld29 and LocLeaders Silicon Valley 2015

2015 TAUS EVENTS: TAUS QUALITY EVALUATION SUMMIT, INDUSTRY LEADERS FORUM, ROUNDTABLE AND ANNUAL CONFERENCE

Olga TAUS Haus bandYet again, TAUS organized some brilliant events and 2015 was no different. Welocalize attended key TAUS events in Europe and North America.  It was also where stories will be told about the TAUS HAUS Band’s sold out concert!

The TAUS Quality Evaluation Summit took place in Dublin in May and members of the Welocalize Language Tools team presented content on dynamic content and the use of MT technology. At the TAUS Industry Leaders Forum, held in Berlin in June, Smith Yewell, Welocalize CEO and Olga Beregovaya, VP of Language Tools, attended and participated in a number of discussions relating to translation automation. Read more in Olga’s blog, Four Key Highlights from TAUS Industry Leaders Forum 2015.

Welocalize leadership led industry discussions at the TAUS Roundtable, which took place in Washington D.C. in October and the TAUS Annual Conference, San Jose, held in October. The Roundtable discussions defined key strategies for translation in the 21st century and the theme of the TAUS Annual Conference was to drive change and innovation within the localization and translation industry. For more information see TAUS Website.

GLOBALIZATION AND LOCALIZATION ASSOCIATION (GALA), THE LANGUAGE OF BUSINESS CONFERENCE 2015

GALA_logoHeld in Sevilla, Spain, Welocalize shared industry insights at this important GALA industry conference in the presentation, “Localizing for Travel: Diverse Solutions for Diverse Needs”. The presentation generated many fascinating discussions, centered on the use of innovative translation technology, machine translation (MT) and post-edited machine translation (PEMT) to translate low impact content. Click here to read more and download the presentation.

CONTENT MARKETING WORLD

picture perfect cmworld 2015 1 - CopyWelocalize exhibited at the Content Marketing World event, in Cleveland, Ohio, to meet with leading global brand and content marketers. This event was alive with fresh new ideas and techniques for developing compelling marketing content. Welocalize took on the selfie movement and talked about picture perfect content at the “everything orange” event. Conversations centered around localization of content, in all formats and types, to provide a multinational, multilingual customer experience. Welocalize shared insights and best practices for localizing global marketing content through language services and also gave away printed copies of the Welocalize eBook, Reaching Global Audiences: How to Localize Your Marketing Strategy.

BRAND2GLOBAL

brand2global whiteThis year’s Brand2Global event which took place in London this past October. Many leading global brand experts, Welocalize leadership, friends from Traffic Optimiser and clients took part in discussions on how to expand a brand’s international footprint and increase return on content. As sponsor and exhibitor, Welocalize experts were on hand to help attendees to grow their brand globally. www.brand2global.com

NYU SCHOOL OF PROFESSIONAL STUDIES MT EVENT, NEW YORK

Rachel Lord NYU event 2Welocalize and Park IP representatives were honored to be present for an event held by the NYU School of Professional Studies entitled: Machine Translation (MT), CAT (Computer Assisted Translation) Tools and the Shifting Landscape of the Translation Industry. Welocalize Language Tools experts, Elaine O’Curran and Park IP Translations (a Welocalize company), Rachel Lord, joined a panel of six for a discussion on the merits and advantages of Computer Assisted Tools (CAT) tools within the translation workflow.

EAMT CONFERENCE 2015

The Welocalize Language Tools team presented at the EAMT Conference 2015 in beautiful Antalya, Turkey, in May, sharing knowledge of advanced language technology and MT. Olga Beregovaya, Welocalize VP of Language Tools presented “What we want, what we need, what we absolutely can’t do without – an enterprises user’s perspective on MT technology and stuff around it”. Senior Computational Linguist Dave Landan presented “Streamlining Translation Workflows with Welocalize StyleScorer.” Click here to read more about Welocalize involvement in EAMT conference and to view both presentations.

AMTA MT SUMMIT XV, MIAMI, FLORIDA

The 15th Biennial Conference of the Association for Machine Translation in the Americas (AMTA) took place in November in Miami, Florida, USA. It is an international conference that, every two years, brings together researchers, developers and users of MT technology in industry and government. Experts from Welocalize Language Tools team took part and delivered two presentations on MT to attendees of this important MT event:

Presentation by Elaine O’Curran, Language Tool Expert and Program Manager: MT Quality Evaluations: From Test Environment to Production

Presentation by Alex Yanishevsky, Senior Manager: How Much Cake to Eat: The Case for Targeted MT Engines

Read more in the blog Welocalize Spotlights at AMTA MT Summit XV

The best events for Welocalize are when we can meet you, on the road, face-to-face. Welocalize will be out on the road once again in 2016, with exciting events planned throughout the year. Let’s connect! Where will you be in 2016?

Welocalize Language Tools Team Highlights EAMT 2015 Conference

The Welocalize Language Tools Team recently presented at the 2015 EAMT Conference in Antalya, Turkey.  Olga Beregovaya, Welocalize VP of Language Tools and Automation was the invited guest speaker at the conference.  She presented, What we want, what we need, what we absolutely can’t do without – an enterprise user’s perspective on machine translation technology and stuff around it,with the main objective of promoting collaboration between academia and field users. Olga also presented with Welocalize Senior Computational Linguist Dave Landan “Streamlining Translation Workflows with Welocalize StyleScorer, as part of the project and product description poster session.

In this blog, Olga Beregovaya, Dave Landan and Dave Clarke, Principal Engineer for the Language Tools Team, share their insights from the 2015 EAMT Conference.

HIGHLIGHTS FROM OLGA BEREGOVAYA

Olga Beregovaya gives her impressions of EAMT 2015 and highlights her favorite presentations from the user track.

As a global language service provider, the language technology and translation automation strategy is very important. The EAMT conference and associated conferences are excellent forums to attend as the team can share real-life MT production experiences and learn more about the latest innovations and research projects. As always, there were many interesting research papers and posters at EAMT, all delivered by highly-talented colleagues in the field of MT and all describing very innovative and promising approaches.

Welocalize EAMT Poster Presenation 2015I was proud of Welocalize’s own poster presentation, describing work by colleague Dave Landan,  Streamlining Translation Workflows with StyleScorer. Capturing and evaluating the style of both training corpora and target text has traditionally been one of the biggest challenges in the industry. The tool Dave has created allows us to compare style of the input text and the available training data, and build the most relevant MT engine, and also to assess the stylistic consistency of the target text and its adherence to the client’s style guide.

The poster presented by Mārcis Pinnis, Dynamic Terminology Integration Methods in Statistical Machine Translation, was very interesting for the team. Integrating terminology in a linguistically aware way is a major pain point for domain adaptation of SMT engines. Speaking as a program owner, this poster presentation was particularly relevant to our work.

Another very relevant presentation was the paper delivered by Laxström et al, called Content Translation: Computer-assisted translation tool for Wikipedia articles. This presentation talked about a tool created by Wikipedia to promote translation and post-editing of machine-translated articles by Wikipedia users. Community translation is more important for Wikipedia than for any other organization in the world. As content democratization is the key paradigm shift of the modern times, such tools that enable a “casual translator” to contribute and make content available globally have become an essential component of the global content universe.

Finally, Joss Moorkens and Sharon O’Brien presented an excellent poster called Post-Editing Evaluations: Trade-offs between Novice and Professional Participants. Building an efficient and productive supply chain for  post-editing, that would be open to new tools and new ways of working, is an essential component of an LSP MT program success. Joss and Sharon compare the perception of MT output and a new CAT environment by experienced translators and by novice users.

HIGHLIGHTS FROM DAVE LANDAN

Dave Landan, Computational Linguist at Welocalize and EAMT 2015 presenter identified two presentations he found particularly interesting.

This year’s EAMT conference started strong with several interesting talks and papers on a range of topics.  While there were many strong research papers, I would like to mention two that stood out for me. Bruno Pouliquen presented findings on linear interpolation of small, domain-specific models with larger general models. At Welocalize, we hope to try these methods with our own data, and we are optimistic about the possibilities!  The other research paper that stood out for me was by Wäschle and Riezler. This paper presented innovations around using fuzzy matches from monolingual target language documents to improve translations. I am excited about expanding our collaborations with the academic community.

HIGHLIGHTS FROM DAVE CLARKE

Dave Clarke, Principal Engineer at Welocalize is a regular participant at EAMT. One topic that was touched on many times at EAMT 2015 was the evolution of CAT tools and their impact on productivity. He shared the following perspective.

From a technical or tools perspective, the EAMT conference provided considerable insight into how translation tools could and should evolve. One such insight was provided by the best paper award winner, “Assessing linguistically aware fuzzy matching in translation memories,” by Tom Vanallemeersch and Vincent Vandeghinste from the University of Leuven. The algorithms typically used in CAT tools to calculate fuzzy match values from translation memories have little or no linguistic awareness. They are firmly established as stable units in our industry word currency. This paper implemented and tested alternative fuzzy match algorithms that identify potentially useful matches, based on their linguistic similarities. The results were gathered from tests carried out with translation master’s degree students measuring translation time and keystrokes. The results strongly suggest the potential for unlocking further productivity from existing resources.

The other presentation that stood out for me was “Can Translation Memories afford not to use paraphrasing?” by Rohit Gupta, Constantin Orasan, Marcos Zampieri, Mihaela Vela and Josef Van Genabith.

More MT productivity and quality can be achieved with incremental and specialized improvements; however, it will be a cumulative process. Importantly, NLP can drive ‘intelligent’ aids to productivity, including auto-suggest/complete, advanced fuzzy matching and automatic repair and others, within a translator’s working environment. Not all will benefit every user. CAT tool platforms may now evolve so that these innovations can be quickly absorbed into the environment with little cost or effort. This leads to how each translator can maximize their own productivity with the combination of aids that best suits their style of work. We even saw a project from ADAPT in the early stages of developing a platform for CAT tool designers that allows the fast definition and measuring of data during testing of prototype productivity-enhancement functions.

To echo the words of the outgoing EAMT President, Professor Andy Way, it was good to see researchers really getting to grips with specific, known problems. It was encouraging to see more focused work on such errors that we know first-hand to have a particular impact on productivity, for example, improvements in terminology selection, new methods to improve choice of preposition and more. It was also encouraging to see the increase in research presented with supporting data gained from end-user evaluation rather than the automatic evaluation metric staples that have long been the norm. In fact, ‘BLEU scores’ almost, just almost, became a dirty… bi-gram.

“Overall, EAMT 2015 was a great conference, attended by extremely talented people, and we should not forget to mention in beautiful Antalya, Turkey, where the conference was held this year,” Olga Beregovaya.

View Olga Beregovaya’s EAMT presentation, “What we want, what we need, what we absolutely can’t do without – an enterprise user’s perspective on machine translation technology and stuff around it” below.

For more information about Welocalize’s MT program, weMT, click here.

Click the link to see Dave Landan and Olga Beregovaya’s EAMT poster presentation, Streamlining Translation Workflows with StyleScorer: EAMT_POSTER 2015 by Welocalize.

Welocalize EAMT Poster Presenation 2015

Welocalize to Present at 18th European Association for Machine Translation Conference

Frederick, Maryland – May 7, 2015 – Welocalize, global leader in innovative translation and localization solutions, will share industry insight and expertise at the 18th Annual Conference of the European Association for Machine Translation (EAMT) taking place in Antalya, Turkey, May 11-13, 2015, at the WOW Topkapi Palace.

“I am very excited to be taking part as an invited speaker at this year’s EAMT 2015 Conference in Turkey,” said Olga Beregovaya, VP of language tools and automation at Welocalize. “EAMT is an important international conference for the MT community. It is where experts, thought leaders and users of machine translation can meet and share research, findings and new tools to help their language technology strategy.”

Featured Welocalize presentations at the 18th Annual Conference of the European Association for Machine Translation:

  • Welocalize VP of Language Tools and Automation, Olga Beregovaya will deliver her keynote, “What We Want, What We Need, What We Absolutely Can’t Do Without – An Enterprise User’s Perspective on Machine Translation Technology and Stuff Around It” at 9:30 – 10:00am on Tuesday, May 12.
  • Olga Beregovaya along with Welocalize Senior Computational Linguist Dave Landan will be presenting “Streamlining Translation Workflows with Welocalize StyleScorer” as part of the poster project and product description session on Tuesday, May 12.

For more information about the EAMT 2015 conference, visit http://www.eamt2015.org.

About Welocalize – Welocalize, Inc., founded in 1997, offers innovative translation and localization solutions helping global brands to grow and reach audiences around the world in more than 157 languages. Our solutions include global localization management, translation, supply chain management, people sourcing, language services and automation tools including MT, testing and staffing solutions and enterprise translation management technologies. With over 600 employees worldwide, Welocalize maintains offices in the United States, United Kingdom, Germany, Ireland, Italy, Japan and China. www.welocalize.com

EAMT Conference 2014: Welocalize Language Tools Team Overview

Laura CasanellasThe Welocalize Language Tools team attended and presented at the 2014 EAMT Conference in Croatia. In this blog, Laura Casanellas, Welocalize Language Tools Program Manager and presenter at EAMT, provides her highlights and insights from her Welocalize colleagues who took part in the conference.

Just like Trento in 2012 and Nice in 2013, the Welocalize Language Tools Team participated in the Annual Conference of the European Association for Machine Translation (EAMT). The conference took place June 16 – 18 in the city of Dubrovnik, Croatia and four members of the Welocalize Language Tools team attended:

Olga Beregovaya, VP of Language Tools, and Dave Landan, Pre-sales Support Engineer, presented a project poster on “Source Content Analysis and Training Data Selection Impact on an MT-driven Program Design with a Leading LSP.”

Lena Marg, Training Manager and I delivered our presentation “Assumptions, Expectations and Outliers in Post-Editing.”

We take the EAMT conference and associated conferences (International, Asian and American) seriously, as most of the important developments that are currently taking place around machine translation (MT) are presented and followed up in those forums.

As a global language services provider (LSP), Welocalize adds value to the EAMT conference by being able to share real-life MT production experiences, demonstrated through thorough analysis of large and varied quantities of actual data. We are privileged in that we work in a real scenario where some of the new technologies around natural language processing (NLP) and MT can be tested in depth.

EAMT 2014 Poster Presentation WelocalizeIn their poster: Source Content Analysis + Training Data Selection Impact – EAMT POSTER by Welocalize, Olga and Dave stressed the idea of the importance of preparing the training corpus in advance and matching it to the specifics requirements of the content that subsequently will be translated. To give an example, many translation memories come from different projects created at different points in time. They may contain inconsistencies or the sentences in these translation memories can simply be too long or may contain a lot of “noisy” data. They need to be cleaned up before they can be used as engine training assets. Going deeper into the possibilities of automatic data selection and matching it with the source content, Olga and Dave spoke about our suite of analytic applications, divided between proprietary tools like Candidate Scorer, Perplexity Evaluator, StyleScorer and others that are being developed as part of an industry partnership with CNGL: Source Content Profiler and TMT Prime.

Olga Beregovaya’s impressions about the EAMT Conference and Welocalize’s role within it are very positive. “Overall, the great thing about the conference was the applicability of the new generation of academic research in real live production scenarios. Many of the academic talks were relevant for the work on MT adaptation and customization that we do at Welocalize. Today, we need to cover more and more domains and content types so the domain and sub-domain adaptation is becoming the key area of our R&D. This means that we benefit greatly from academic and field research around data acquisition for training SMT systems and the relatively new developments around using terminology databases to augment the SMT training data. Not all of our clients come to us with their legacy translation memories, and while there is some public corpora available, we still need to rely on us acquiring and aligning data ourselves.”

Dave found two presentations he attended particularly interesting that focused on common pain points within the industry. “The challenges of using MT with morphologically rich languages are well-known, and we were happy to see interesting research in possible ways to overcome those challenges. We also found a talk on gathering training data from the web very interesting. The presenters discussed using general and specific data to train separate engines which could be weighted and combined to give improved results in cases of sparse in-domain training data. Indeed there were several innovations from academia that we are looking forward to incorporating into our bleeding-edge MT tools and processes.”

In our presentation, Lena and I focused on different challenges in a real MT production scenario: the necessity of forecasting future post-editing effort, with an emphasis on post-editors behavior, and their personal and cultural circumstances, as an important variable of the MT + PE equation. As part of a large LSP, we have been able to gather large amount of data and focus on the quality of a number of MT outputs related to different languages and content types. Our presentation elaborated on our findings around correlations between different types of evaluation methods (automatic scoring, human evaluations and productivity tests). We obtained interesting findings around the adequacy score in our human evaluation tests and the productivity gains contained on the post-editing effort. We will continue gathering data and investigating around this area.

Another topic that was touched upon during the conference was the area of quality. Lena and Olga both shared their perspectives:

“After closely following the QTLaunchpad project for several months, it was particularly interesting to see and discuss results from their error annotation exercises using MQM earlier in the year. Welocalize took part in these exercises by providing data and annotator resources. The findings of this exercise are contributing to further advances both in quality estimation and quality evaluation, fine-tuning metrics further for better inner-annotator agreement, etc. These discussions also provided some immediate take-aways for our approach to evaluation.” – Lena Marg

“The other area of high relevance to us is Quality Evaluation. Again, it is great to see so many research projects dealing with predicting MT quality and utility. While it still may be challenging to deploy such quality estimation systems in-production as various CAT tools and TMS systems have their own constraints around metadata-driven workflows, it is very encouraging to know that this research is available.” – Olga Beregovaya

“A general theme of the EAMT Conference was the question of how to increase cooperation between the translation and the MT research community. In this context, Jost Zetsche’s keynote speech was important in pointing out that translators should take an active interest in providing constructive feedback on MT and on how they work, to ensure new advances in MT developments are truly benefiting them. And yet, with the presence of some interested freelance translators, translation study researchers and a handful of LSPs presenting on MT, it would seem that progress has already been made in bringing the two sides together.” – Lena Marg

Stradun la nuit_dubrovnikThe EAMT Conference was a great opportunity to meet professionals, academics and researchers who work in the field of MT. The Welocalize team members were able to exchange ideas around the current pressing challenges surrounding MT technology and we still had time to admire the beautiful surroundings of historical Dubrovnik.

Laura
Laura.casanellas@welocalize.com
Laura Casanellas is program manager on the Welocalize Language Tools team.

Welocalize to Present at 17th Annual European Association for Machine Translation Conference

FREDERICK, MD – June 16, 2014 – Welocalize, global leader in innovative translation and localization solutions, will share industry insight and expertise at the 17th Annual Conference of the European Association for Machine Translation (EAMT) taking place in Dubrovnik, Croatia, June 16 – 18, 2014.

Senior members of the Welocalize Language Tools Team will be taking part in a number of presentations and discussions related to machine translation (MT) and automation at this year’s EAMT conference.

“Welocalize is excited to participate at this year’s EAMT 2014 Conference in Dubrovnik,” said Olga Beregovaya, VP of language tools and automation at Welocalize. “As more content is created every day, the demands for language services related to machine translation deployments is growing exponentially. EAMT is an important international conference where thought leaders and experts in machine translation can collaborate through shared research and innovations to advance our industry and meet the escalating demands.”

Featured Welocalize presentations at EAMT 2014:

For more information about the EAMT 2014 conference, visit http://hnk.ffzg.hr/eamt2014/papers.html and to find out more general information about EAMT, visit http://www.eamt.org.

About Welocalize – Welocalize, Inc., founded in 1997, offers innovative translation and localization solutions helping global brands to grow and reach audiences around the world in more than 125 languages. Our solutions include global localization management, translation, supply chain management, people sourcing, language services and automation tools including MT, testing and staffing solutions and enterprise translation management technologies. With over 600 employees worldwide, Welocalize maintains offices in the United States, UK, Germany, Ireland, Japan and China. www.welocalize.com