When non-organic (man-made) fabric was introduced into fashion, there were a number of harsh warnings about using polyester and man-made synthetic fibres, including their flammability.
In creating non-organic data sets, should we also be creating warnings on their use and flammability? Let’s look at why synthetic data is used in industries such as Financial Services, Automotive as well as for new product development in Manufacturing.
Synthetic Data Defined
Synthetic data can be defined as data that is artificially developed rather than being generated by actual interactions. It is often created with the help of algorithms and is used for a wide range of activities, including as test data for new products and tools, for model validation, and in AI model training. Synthetic data is a type of data augmentation which involves creating new and representative data.
Why is it used?
The main reasons why synthetic data is used instead of real data are cost, privacy, and testing. Let’s look at more specifics on this:
- Data privacy. When privacy requirements limit data availability or how it can be used. For example, in Financial Services where restrictions around data usage and customer privacy are particularly limiting, companies are starting to use synthetic data to help them identify and eliminate bias in how they treat customers – without contravening data privacy regulations.
- Data availability. When the data needed for testing a product does not exist or is not available to the testers. This is often the case for new releases.
- Data for testing. When training data is needed for machine learning algorithms. However, in many instances, such as in the case of autonomous vehicles, the data is expensive to generate in real life.
- Training across third parties using cloud. When moving private data to cloud infrastructures involves security and compliance risks. Moving synthetic versions of sensitive data to the cloud can enable organisations to share data sets with third parties for training across cloud infrastructures.
- Data cost. Producing synthetic data through a generative model is significantly more cost-effective and efficient than collecting real-world data. With synthetic data, it becomes cheaper and faster to produce new data once the generative model is set up.
Why should it cause concern?
If real dataset contains biases, data augmented from it will contain biases, too. So, identification of optimal data augmentation strategy is important.
If the synthetic set doesn’t truly represent the original customer data set, it might contain the wrong buying signals regarding what customers are interested in or are inclined to buy.
Synthetic data also requires some form of output/quality control and internal regulation, specifically in highly regulated industries such as the Financial Services.
Creating incorrect synthetic data also can get a company in hot water with external regulators. For example, if a company created a product that harmed someone or didn’t work as advertised, it could lead to substantial financial penalties and, possibly, closer scrutiny in the future.
Conclusion
Synthetic data allows us to continue developing new and innovative products and solutions when the data necessary to do so wouldn’t otherwise be present or available due to volume, data sensitivity or user privacy challenges. Generating synthetic data comes with the flexibility to adjust its nature and environment as and when required in order to improve the performance of the model to create opportunities to check for outliers and extreme conditions.
Hitachi announced their plans to acquire US based software development company GlobalLogic for an estimated USD 9.6 billion, including debt repayment. The transaction is expected to close by end of July, after which GlobalLogic will function under Hitachi’s Global Digital Holdings.
GlobalLogic was founded in 2000, and the Canada Pension Plan Investment Board and Swiss investment firm Partners Group have 45% of ownership; with the remainder owned by the company’s management.
Hitachi’s Business Portfolio Expansion
The acquisition of GlobalLogic is a part of Hitachi’s move to focus and extend the range of Hitachi’s digital services business. As Hitachi aims to expand from electronics hardware to concentrate on digital services, they are looking to benefit from GlobalLogic’s range of expertise – from chips to cloud services. Silicon Valley-based GlobalLogic has a presence in 14 countries with more than 20,000 employees and 400 active clients in industries including telecommunications, healthcare, technology, finance and automotive. This will also expand Hitachi’s network outside Japan by providing them access to a global customer base and will boost their software and solutions platforms, including Hitachi IoT portfolio and data analytics.
The GlobalLogic deal follows another big acquisition of ABB’s power grid business by Hitachi in July 2020 to focus on clean energy and distributed energy frontiers. This makes Hitachi one of the largest global grid equipment and service providers in all regions.
Hitachi is also planning to divest parts of their portfolio such as Hitachi Metals, their chemical unit and their medical equipment business.
Ecosystm Comments
“Hitachi’s move to acquire GlobalLogic is very interesting and is in line with the growing trend of global Operation Technology (OT) vendors riding the wave of Industry 4.0 and ‘Product as a Service’ models – essentially, to move up the margin ladder with more digital services added on to their already established equipment business. Siemens, Schneider Electric, Panasonic, ABB, Hitachi and Johnson Controls are some of the prominent vendors who have taken pole positions in their respective industry domains, in this race to digitally transform their businesses and business models. Last year, Panasonic made a very similar move, taking a 20% equity stake in Blue Yonder, a leading supply chain software provider.
With rapid advancements in computing and communications (5G), it is now possible to converge the IT (Information Technology supporting enterprise information flows), the OT (Operational Technology – machine level control of the physical equipment), and the ET (Engineering Technology in the Product Design and Development space such as CAD, CAM, PDM etc.) domains. Three worlds that were separate till now. The convergence of these three worlds enables high impact use cases in automation, product, process, and business model innovation in almost all sectors, such as autonomous vehicles, energy efficient buildings, asset tracking and monitoring, and predictive and prescriptive maintenance. For the OT vendors therefore, it becomes critical to acquire IT and ET capabilities to become successful in the new cyber physical world. Most OT vendors are choosing to acquire these capabilities through strategic partnerships (such as Siemens with Atos and SAP; Panasonic with Blue Yonder) or acquisitions (such as Hitachi and GlobalLogic) rather than develop such capabilities organically in completely new domains.“