NLP Data Engineering Intern

Chicago, IL
Internship
iManage U
Student (College)

What is iManage U?

iManage U provides students the chance to experience a dynamic, rapid growth technology company firsthand. iManage will provide a structured program which delivers project-based activities, improved knowledge of business fundamentals, tackling complex problem solving, collaboration, team building, and some fun experiences along the way!  This year, our paid internship program will kick-off on Monday, June 8th and will run through Thursday, August 13th.  

This internship will be based out of our downtown Chicago office, with activities requiring in-person presence.

Goals of the Program:

  • iM Making An Impact: Leave your mark on your team by owning and completing assigned projects
  • iM A Mentee: Learn from teammates across departments & gain perspectives from a diversity of people
  • iM A Connector: Meet & connect with as many interns and iManage employees as possible
  • iM Inspired: Learn from our leadership team and ask questions during our lunch and learns
  • iM Social: Enjoy intern events, and everything iManage has to offer this summer

Being an NLP Data Engineering intern at iManage means…

You are excited about transforming unstructured text into meaningful insights that power AI and machine learning solutions. You thrive at the intersection of data engineering and natural language processing and are eager to contribute to the pipelines and datasets that fuel generative AI applications, agentic systems, and other NLP-driven capabilities across iManage.

As an NLP Data Engineering Intern on the AI and knowledge engineering team, you will get hands-on experience designing, building, and optimizing text data pipelines that power AI/ML and Generative AI solutions for our customers. You’ll collaborate with knowledge engineering, applied AI, and product teams to help prepare, enrich, and integrate document data. Your contributions will be essential to enabling intelligent, AI-powered features across the iManage platform.

iM Responsible For…

  • Performing exploratory analyses on large text corpora and developing preprocessing pipelines for training and evaluation data
  • Supporting the design of automated workflows for text normalization, deduplication, language identification, PII redaction, and metadata enrichment
  • Assisting with building automated data validation processes to ensure accuracy and consistency of NLP datasets
  • Contributing to dataset curation, prompt dataset preparation, labeling coordination, and text quality validation to support model fine-tuning, semantic search, and Gen AI evaluations
  • Partnering with the Applied AI team to understand data requirements and help build data interfaces for machine learning systems
  • Learning and applying data lineage best practices and data privacy, security, and governance principles
  • Maintaining highest quality standards through processes that identify and correct mistakes and inconsistencies
iM Qualified Because I have…
  • Current enrollment in a Master’s, or PhD program in Computer Science, Data Engineering, Data Science, Applied Mathematics, Computational Linguistics, or a related quantitative field
  • Proficiency in Python and experience using it to extract, structure, classify, and analyze text data
  • Foundational understanding of NLP concepts such as tokenization, embeddings, and semantic search
  • Familiarity with standard NLP libraries such as SpaCy, HuggingFace Datasets, or NLTK
  • Solid knowledge of data structures, algorithms, and statistics
  • Proficiency with Git and collaborative development workflows
  • A passion to learn and improve, and an eagerness to share knowledge with colleagues
  • Problem-solving, creativity, curiosity, and a collaborative mindset
Bonus points if you have..
  • Exposure to Microsoft Azure services such as Fabric, ADLS, AI Foundry, or Azure ML
  • Experience with data pipeline orchestration or workflow automation tools like Databricks
  • Familiarity with knowledge graphs or semantic data modeling

Don't meet every qualification listed above? Studies show that women and people of color are less likely to apply to jobs unless they meet all qualifications. At iManage, we are committed to building a diverse and inclusive environment, and encourage everyone to show up as their full authentic selves. We welcome those that come with a growth mindset and a hunger for learning; so, if you are excited about this role but your past experience doesn't align perfectly with every qualification we encourage you to apply anyways!  

About iManage

iManage is dedicated to Making Knowledge WorkTM.  Over one million professionals across 65+ countries rely on our intelligent, cloud-enabled, secure knowledge work platform to uncover and activate the knowledge that exists inside their business content and communications.   
We are continuously innovating to solve the most complex professional challenges and enable better business outcomes; Our work is not always easy but it is ambitious and rewarding.  

So we’re looking for people who love a challenge. People who are happiest when they’re solving problems and collaborating with the industry’s best and brightest. That’s the iManage way. It’s how we do things that might appear impossible. How we develop our employees’ strengths and unlock their potential. How we find meaning in everything we do.  

Whoever you are, whatever you do, however you work. Make it mean something at iManage.

Learn more at: www.imanage.com     

Please see our privacy statement for more information on how we handle your personal data: https://imanage.com/privacy-policy/     

 #LI-DNI

Share

Apply for this position

Required*
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Human Check*