Paxata is a privately owned software company headquartered in Redwood City, California. It develops self-service data preparation software that gets data ready for data analytics software. Paxata's software is intended for business analysts, as opposed to technical staff. It is used to combine data from different sources, then check it for data quality issues, such as duplicates and outliers. Algorithms and machine learning automate certain aspects of data preparation and users work with the software through a user-interface similar to Excel spreadsheets.

Paxata
Company typePrivate[1]
IndustryEnterprise analytics software
FoundedJanuary 2012; 12 years ago (2012-01)
Headquarters
Redwood City, CA[2]
Area served
Worldwide
Key people
  • Prakash Nanduri (Co-Founder & CEO)[3]
  • Dave Brewster (Co-Founder & CTO)[3]
  • Chris Maddox (Co-Founder & VP of Business Development)[3]
  • Nenshad Bardoliwalla (Co-Founder & Chief Product Officer)[3]
ProductsThe Paxata suite of self-service data preparation software

The company was founded in January 2012 and operated in stealth mode until October 2013. It received more than $10 million in venture funding before being acquired by DataRobot.[4][5]

History

edit

Paxata was founded in January 2012.[6] It initially raised $2 million in venture capital.[7] The company came out of stealth mode in October 2013.[6] Simultaneously with its public release, Paxata announced an $8 million funding round led by Accel Partners.[6][8] Adoption of the software grew quickly.[6][9] In March 2014, In-Q-Tel acquired an interest in the startup.[10] It raised an additional $18 million in funding in September 2015.[11] It also began working with Cisco to jointly develop the Cisco Data Preparation suite of software and services.[12]

Software

edit

Paxata refers to its suite of cloud-based data quality, integration, enrichment and governance products as "Adaptive Data Preparation."[8][13][14][15] The software is intended for business analysts, who need to combine data from a variety of sources, then check the data for duplicates, empty fields, outliers, trends and integrity issues before conducting analysis or visualization in a third-party software tool.[15][16] It uses algorithms and machine-learning to automate certain aspects of data preparation.[15][17] For example, it may automatically detect records belonging to the same person or address, even if the information is formatted differently in each record in different data sets.[17][18]

The software has a spreadsheet-based user interface.[15][18] Patterns and anomalies in the data are color-coded in the spreadsheet. Then users are provided with instructions on how to resolve data quality issues or to supplement the data with contextual information.[14] Data sets and related quality issues can also be addressed in a collaborative environment through the "Paxata Share" feature.[18] It runs on Apache Spark.[11][19]

According to analyst firm Ovum, the software is made possible through advances in predictive analytics, machine learning and the NoSQL data caching methodology.[15] The software uses semantic algorithms to understand the meaning of a data table's columns and pattern recognition algorithms to find potential duplicates in a data-set.[15][7] It also uses indexing, text pattern recognition and other technologies traditionally found in social media and search software.[20] One of the software's users is dairy producer Danone, which uses the software so that business staff can create their own reports on merchandising, supply chain and product data, without the IT department.[21]

 
Paxata's spreadsheet-based user interface

Reception

edit

In its 2014 report "Cool Vendors in Data Integration and Data Quality", Gartner praised Paxata for developing a "business-user-friendly" data quality product that does not use code.[14] Ventana Research said its spreadsheet-based user interface "should resonate well with business analysts," who are resistant to move away from familiar Excel-like programs.[18] Gartner also said Paxata was recognized in the report due to its automated, algorithm-based features and how it tracks any changes made to the data.[14]

Ventana Research said Paxata was in a "noisy marketplace".[18] According to Gartner, while Paxata is an early entrant into the market, many startups and large corporations are making investments in developing similar competing products.[14] According to Gigaom and IT Business Edge, one way Paxata differs is that it automatically merges multiple data-sets into a single table, so it can be easily imported into a visualization or analysis tool.[7][22]

Gartner said Paxata will have a difficult time finding a compelling pricing model, when many data discovery tools that it supplements provide some similar features.[14] In contrast, Ventana said Paxata's pricing was "a pretty small amount" compared to the amount of time users can save.[18]

References

edit
  1. ^ "Paxata: Company Profile". Bloomberg L.P. Retrieved September 28, 2014.
  2. ^ "Contact us". Paxata. Archived from the original on October 27, 2014. Retrieved September 28, 2014.
  3. ^ a b c d "Paxata Leadership". Paxata. Archived from the original on August 12, 2014. Retrieved September 28, 2014.
  4. ^ Vizard, Michael (2019-12-19). "DataRobot Acquires Paxata to Extend AI Platform". RTInsights. Retrieved 2020-03-18.
  5. ^ "DataRobot is acquiring Paxata to add data prep to machine learning platform". TechCrunch. 12 December 2019. Retrieved 2020-03-18.
  6. ^ a b c d Blattberg, Eric (October 28, 2013). "Paxata grabs $8M to help data scientists skip the dirty work". VentureBeat. Retrieved June 19, 2014.
  7. ^ a b c Harris, Derrick (October 28, 2013). "With $10M from Accel, Paxata wants to make data prep a breeze". Gigaom. Retrieved June 19, 2014.
  8. ^ a b Woodie, Alex (October 28, 2013). "Paxata Debuts Data Quality Tools at Strata". Datanami. Retrieved June 19, 2014.
  9. ^ McStravick, Alan (February 12, 2014). "Paxata: streamlining data analytics". SiliconAngle. The SiliconAngle Network. Retrieved June 19, 2014.
  10. ^ Wait, Patience (March 7, 2014). "In-Q-Tel Invests in Data-Prep Platform Paxata". InformationWeek. UBM Tech. Retrieved June 19, 2014.
  11. ^ a b Harris, Derrick (September 9, 2015). "This startup raised $18 million to make data analysis less of a chore". Fortune. Retrieved October 13, 2015.
  12. ^ "Cisco Makes Move Into Data Preparation Space". eWeek.com. September 30, 2015. Retrieved October 13, 2015.
  13. ^ Forrest, Conner (March 4, 2014). "Startup Paxata automates the dirty work of big data". TechRepublic. CBS Interactive. Retrieved June 26, 2014.
  14. ^ a b c d e f Thoo, Eric; Friedman, Ted; Judah, Saul; Sallam, Rita L.; Edjlali, Roxane (April 24, 2014). "Cool Vendors in Data Integration and Data". Gartner. Retrieved June 19, 2014.
  15. ^ a b c d e f Baer, Tony (October 28, 2013). "Paxata puts a business-user face on data preparation". Ovum. Archived from the original on January 12, 2015. Retrieved June 19, 2014.
  16. ^ Baer, Tony (December 13, 2013). "On the Radar: Paxata". Ovum. Archived from the original on January 12, 2015. Retrieved June 13, 2014.
  17. ^ a b Fitzgerald, Michael (February 11, 2014). "Is Your Company Running a Data Dump?". InformationWeek. UBM Tech. Retrieved June 19, 2014.
  18. ^ a b c d e f Cosentino, Tony (January 29, 2014). "Paxata Give Analysts Valuable Time Back for Analytics". Ventana Research. Archived from the original on June 20, 2014. Retrieved June 19, 2014.
  19. ^ "Paxata Applies Data Governance Controls to Big Data". IT Business Edge. April 23, 2015. Retrieved August 20, 2015.
  20. ^ Woodie, Alex (January 24, 2014). "Automating the Pain Out of Big Data Transformation". Datanami. Retrieved June 19, 2014.
  21. ^ Feretic, Eileen (March 26, 2014). "Dannon Speeds Up Data Preparation and Analysis". Baseline. Bradbourne Publishing. Retrieved June 22, 2014.
  22. ^ Vizard, Mike (November 27, 2013). "Paxata Rises to the Challenge of Big Data Preparation". IT Business Edge. QuinStreet. Retrieved June 19, 2014.
edit
  NODES
Note 1
USERS 4