Leading
distributor of Chemicals, Life Science and
Laboratory Products
Purpose of
the project
-
To
cleanse, classify and enrich product
data from the Customer’s Inventory
-
To
collect, cleanse, classify and enrich
product data from competing distributors
-
Design
and Development of Software Application
for field deployment to allow cross-referencing
between customer Items and competing
Items in the market.
Volume
-
1,000,000 Items
Type of items
-
Antibodies,
Reagents, Bio-Reagents, Laboratory
Equipment, Laboratory Supplies etc.
Process
-
All
items were classified to customer
taxonomy. Domain experts suggested
new categories and attributes for
items which did not have a relevant
class in the existing schema.
-
Data
was cleansed and normalized, including
Manufacturer Name and Supplier Name
-
Mandatory
attribute values such as Manufacturer
and Supplier Name and Part No. were
extracted from Item Descriptions.
-
Similarly,
product specific attributes were
extracted to templates provided
by the customer. Item Descriptions
were enriched by visiting Manufacturer
Websites and sourcing additional
information for Customer distributed
items.
-
Competitor
web-sites were scraped and matching
Product Data was collected and cleansed.
-
Created
two comparable sets of data -Items
distributed by the Customer and
Items distributed by competitors.
-
Software
tool based on J2EE architecture
was designed with provision to give
weightage to attributes and a comparison
logic written to allow cross referencing
between products.
Key Success Factors
-
Domain
Expertise coming from experienced
professionals with strong Life Science
and Chemical Industry background.
-
Dedicated
team of 120 professionals generating
80,000 items per month.
-
Strong
process definition and on-site project
manager
-
Ability
to handle large datasets and management
of production facilities using Unilog’s
proprietary software Applications.