Informatica Developer: Remove Duplicate Records

Marketplace

Explore Listings
Explore Listings
The Informatica Marketplace is an open platform for quality-vetted, cloud or on-premise data integration solutions.

By Category

Custom Solutions

Workflows

Developer Tools

Mapping & Mapplets

By Products

B2B Data Exchange

BI & Analytics

Cloud Application Integration

Data Governance

Data Quality

Master Data Management

IpaaS Recipies

Discover one-click Integration setup for GenAI apps & business processes
Partners
Partners
Explore how our trusted partners can help drive innovation for your business

Cloud Ecosystem Partners

Speed up cloud initiatives on your preferred hyperscaler

Technology Partners

Expand platform features with cloud data warehouses & ISVs

Global System Integrators

Find a consulting partner to support your next project

Channel Partner

Locate channel partners by region

Become a Partner

Partner Support Form
Informatica.com
More
- Manage your Success Plans and Engagements, gain key insights into your implementation journey, and collaborate with your CSMs
  
  Manage your Success Plans and Engagements, gain key insights into your implementation journey, and collaborate with your CSMs
  
  Customer Experience Accelerators
  
  Accelerate your Purchase to Value by engaging with Informatica for Customer Success
  
  My Engagements
  
  All your Engagements at one place
- A collaborative platform to connect and grow with like-minded Informaticans across the globe
  
  A collaborative platform to connect and grow with like-minded Informaticans across the globe
  
  Product Communities
  
  Connect and collaborate with Informatica experts and champions
  
  Discussions
  
  Have a question? Start a Discussion and get immediate answers you are looking for
  
  User Groups
  
  Customer-organized groups that meet online and in-person. Join today to network, share ideas, and get tips on how to get the most out of Informatica
  
  Get Started
  
  Community Guidelines
- Troubleshooting documents, product guides, how to videos, best practices, and more
  
  Troubleshooting documents, product guides, how to videos, best practices, and more
  
  Knowledge Base
  
  One-stop self-service portal for solutions, FAQs, Whitepapers, How Tos, Videos, and more
  
  Support TV
  
  Video channel for step-by-step instructions to use our products, best practices, troubleshooting tips, and much more
  
  Documentation
  
  Information library of the latest product documents
- Rich resources to help you leverage full capabilities of our products
  
  Rich resources to help you leverage full capabilities of our products
  
  Trainings
  
  Role-based training programs for the best ROI
  
  Certifications
  
  Get certified on Informatica products. Free, Foundation, or Professional
  
  Product Learning Paths
  
  Free and unlimited modules based on your expertise level and journey
  
  Experience Lounge
  
  Self-guided, intuitive experience platform for outcome-focused product capabilities and use cases
- Library of content to help you leverage the best of Informatica products
  
  Library of content to help you leverage the best of Informatica products
  
  Tech Tuesdays Webinars
  
  Most popular webinars on product architecture, best practices, and more
  
  Product Availability Matrix
  
  Product Availability Matrix statements of Informatica products
  
  SupportFlash
  
  Monthly support newsletter
  
  Support Documents
  
  Informatica Support Guide and Statements, Quick Start Guides, and Cloud Product Description Schedule
  
  Product Lifecycle
  
  End of Life statements of Informatica products
  
  Pulse
  
  Monitor the status of your Informatica services across regions
  
  Events
  
  Change Request Tracking
  
  Marketplace
  
  Trust Platform

Informatica Developer: Remove Duplicate Records

Posted by: Sriraman Premkumar

Informatica Developer Client mapping example showing how to remove duplicate rows from the source data.

Data Integration Mappings & Mapplets

Download now

Overview
Features
Resources
Support

Overview

Duplicate records are occasionally found in source data. Due to primary key constraints on a target database, only one version of a duplicate source record should be loaded into the target. This mapping illustrates one alternative for removing duplicate records when the source has a primary key that can be used for grouping.The mapping illustrates the concept of using the functionality within an Aggregator transformation to remove duplicate records from a source and load this data into a target table of the same structure. Implementation Guidelines :

Selects all rows from NIELSEN table which contains duplicate rows. The source rows are ordered by the primary key. e.g. STATE_TAX_ID.
The aggregator after the source qualifier is configured for ?Sorted Input? and the STATE_TAX_ID port is selected as the "Group By" port in the transformation. In general, the number of "Group By" ports must correspond to the "Number of Sorted Ports" indicated in the Source Qualifier.
The Informatica server, by default, returns the last row in a group if no aggregate function is specified. If two records with the same value for STATE_TAX_ID enter the Aggregator, only one record will be returned by the Aggregator. As a result, duplicate source records are eliminated.

Informatica Developer: Remove Duplicate Records

Overview

Features

Resources

Support

Recommended Products