IBM Data Science Community: Text Extensions for Pandas

Loading Events
  • This event has passed.

IBM Data Science Community: Text Extensions for Pandas

February 24 @ 2:00 pm - 3:00 pm EST

💬 Text Extensions for Pandas

Most areas of Python data science have standardized Pandas DataFrames for representing and manipulating structured data in memory. Natural Language Processing, though, not so much. In this presentation, we’ll explain why you should be using Pandas for NLP. Pandas DataFrames make every phase of NLP easier, from creating new models to evaluating their effectiveness to building applications that integrate those models. We’ll talk about our open source library, Text Extensions for Pandas, which adds special data types and library integrations specifically geared to NLP use cases. We’ll also explain how these extensions connect to some basic NLP concepts, and then we’ll finish with an example of using Pandas to build an NLP application.

Learn more:

👨🏻‍💻 Speaker bio

Fred Reiss is a Principal Research Staff Member at IBM Research and Chief Architect at IBM’s Center for Open-Source Data and AI Technologies (CODAIT). He is also one of the authors of the Text Extensions for Pandas library. Fred received his Ph.D. from U.C. Berkeley in 2006 and immediately IBM Research, joining the CODAIT center in 2015. Fred has written multiple peer-reviewed papers in the areas of natural language processing, database systems, and machine learning.

🦄 Join your local IBM Community meetup team

Want to get involved in your local IBM Community meetup? Join our Slack:

💻 Dial-in instructions

Join link:

Webinar number: 145 534 7300

Webinar password: GWyKsfBN239 (49957326 from phones)

Join by video system
You can also dial and enter your webinar number.

Join by phone
1-844-531-0958 United States Toll Free
1-669-234-1178 United States Toll
Access code: 145 534 7300