The Network, Not the Node
How Data and Analytics are Critical to Action Against Illicit Finance
📅 June 5, 2025
📅 June 5, 2025
“Illicit actors’ objective is to dilute the capability to identify the sources of their funds. Given we know that they operate as part of a network, we need to target networks rather than individual accounts. That’s how you can actually move the needle to disrupt criminal enterprises and take effective action. This is the objective we must have in mind when we are implementing data analytics and the advanced technology that builds on it.”
– Seb Drave, Director of Customer Solutions and Innovation, Harbr
This article draws key observations and findings from a conversation between Seb Drave, Director of Customer Solutions and Innovation at Harbr, which has a mission to build next-generation data businesses, and the Institute for Financial Integrity’s Associate Managing Director, Catherine Woods.
“Data analytics” is a field that encompasses multiple disciplines focusing on leveraging data to make the most of the insights that can be gained. Previously referred to using terms such as exploratory data analysis, data mining, or applied statistics, more recently the discipline has come to be referred to as data science. This can be attributed to the integration of research and development approaches and experience to the enterprise workforce, and the recognition that as data analytics techniques and technology have advanced, they enable organizations to use mathematically and statistically-driven methods to gain increasingly accurate insights from less complete data. Data analytics intersect with machine learning and artificial intelligence including generative AI.
“You can’t predict what you don’t have experience of, and you can’t explain and understand things you don’t measure.”
– Seb Drave, Director of Customer Solutions and Innovation, Harbr
An organization that implements a data-driven approach to its business ensures it has situational awareness of what is happening within that business and in the wider market. Well-designed models built on appropriate and well-structured data can be used to advance workflows, optimize products and services, improve client interactions, and enhance decision-making. Without them, organizations are more likely to be reactive, less efficient with their resources, and only able to resolve issues after they have occurred – rather than gain early insights that can be actioned sooner.
In the financial crime context, data analytics enhance the ability to detect and take action against illicit activity. They also enable productivity gains and an increase in effectiveness. For example, by minimizing the number of false positive alerts, they reduce the resources required for validation and assessment. Commercial benefits can also be achieved too. For example, data analytics can be used to identify which vendors or suppliers present the most risk, are most cost-effective or provide the best performance outcomes.
“If you’re not capturing data and using that to make your decisions, then your business decisions and processes are being run on perceptions and best guesses. It’s like shooting in the dark: occasionally you might hit something, but a lot of the time you won’t. In an environment where resources are constrained and there’s competition for market share, you want to make sure that you are getting the maximum from the resource that you have and doing the best that you can for your clients.”
– Catherine Woods, Associate Managing Director, Institute for Financial Integrity
An organization seeking to implement a data strategy and analytical capacity can take the following actions to maximize successful outcomes.
The first step is to establish a data strategy. This includes both technical requirements – such as data architecture and security – as well as governance and organizational elements – such as roles, responsibilities, and processes for managing data. The strategy ensures data is managed consistently and appropriately throughout the organization in accordance with its strategic objectives.
In alignment with the strategy, data should be captured, monitored, and evaluated to inform and optimize operational processes, products and client interactions, and decision-making, to name just a few potential applications.
To gain support for data initiatives, it is most effective to identify a specific problem that can be solved use data-centric methodologies. Resources can then be focused to acquire the necessary data, apply relevant analytics, then demonstrate a quantifiable improvement in the original process or problem. Rapidly demonstrating measurable improvement over current operations will encourage support and adoption by the rest of the organization.
In contrast, many data initiatives begin by attempting to acquire and integrate all organizational data sources, an approach which is resource intensive, lengthy, and complex – and where measurable benefits are not evident for extended periods.
“There is undoubtedly information outside your organization that will make your capabilities more powerful. But there’s also a lot of information that may make your capabilities more prone to noise and if used in an unconstrained manner, they may overwhelm your systems, processes, and the people working through this.”
– Seb Drave, Director of Customer Solutions and Innovation, Harbr
While internal data is a key data resource, it can be enhanced by well-considered selection and integration of external data sources. Well-established examples of external sources used within financial crime compliance for due diligence and investigations include watchlists of adverse news, sanctions lists, and corporate registry information.
“The greatest and most complete view of the world will come with organization’s sharing and working together both at the operational level and potentially the data level as well. When anyone in one of those organizations is thinking about what is the most valuable external data, I wouldn’t necessarily limit that to commercial providers of certain types of information or particular public sources. Potentially, your peer group organizations are also going to have very kind of valuable resources from the industry as a whole.”
– Seb Drave, Director of Customer Solutions and Innovation, Harbr
In some contexts – financial crime being a key example – some the most valuable data is held in organizations where there are significant restrictions on sharing information. For example, governments and law enforcement hold information about investigations and networks, while financial institutions hold details on client profiles and suspicious activities. While restrictions on sharing are imposed for valid reasons such as data privacy, the effect is that each organization has a more limited view of the problem set and usually only has visibility of part of an end-to-end transactions flow or illicit network. In the context of data analytics, this means that each institution has fewer data points to train its models, with a particular shortage of “confirmed positives” such as validated suspicious activities.
To improve data capabilities, organizations should identify what data can be shared and how to balance regulatory and internal policy objectives. This may include incremental sharing of data points, starting with less sensitive information or attributes, or it may include using technology solutions to maintain confidentiality and ensure that each organization maintains custody of its own data.
By finding solutions, the performance of participating institutions can be enhanced, and most importantly so can effective action against financial crime.
Deep Dive: Privacy-Enhancing Technologies and Operational Checks and Balances
“Hopefully as an organization your controls are working effectively and during onboarding, you are preventing businesses and individuals from gaining access to your organization and misusing your products and services. If that’s working well, it means confirmed cases of actual illicit activity will be quite small. This has implications when training a model to ensure that the limited dataset does not impact its accuracy and reliability.”
– Catherine Woods, Associate Managing Director, Institute for Financial Integrity
To effectively train models, for example to enable machine learning, a set of validated outcomes is required. In many use cases, these validated outcomes are plentiful. Examples include a hospital using large volumes of patient records to enhance models to diagnose diseases, or an airline using aircraft sensors and service records to improve fleet maintenance schedules.
However, the financial crime context is different and validated outcomes are scarce. If a model is being trained to identify activity indicative of financial crime, the dataset of “confirmed positives” may be based on suspicious activity reports. However, within an institution, these known positives are rare and there may be systematic biases (such as defensive filings of SARs/STRs). The result may be that there may be insufficient data to train an accurate model, or that the model’s learning may be skewed by the few examples that are available.
While data sharing between institutions provides one way to augment these datasets (described above), another solution is “synthetic data”. A carefully constructed synthetic dataset can augment real data to enhance model development. Firstly, it can “boost the signal”, referred to as “up sample”, the amount of financial crime activity within the system, while still having a large population of normal transactional patterns. Secondly, synthetic data can enable a feedback loop of build-train-validate, which can be difficult to do in the real world where confirmed positives are infrequent and may not always be representative.
With a more comprehensive and well-constructed dataset, models can be trained for greater accuracy and effectiveness, enhancing confidence in their use.
Deep Dive: Synthetic Data
Deep Dive: Regulators Encouraging Innovations Such as Synthetic Data
“Data democratization is one of the key elements that allows you to foster a multidisciplinary ethos within a team… It also enables more technical members of a team to build, test, and deploy repeatable capabilities for use by other members of the team.”
– Seb Drave, Director of Customer Solutions and Innovation, Harbr
Data democratization refers to providing the skills and tools to enable data consumers, such as financial crime investigators, to more effectively interact with and access data. In a siloed model, capabilities and access to extract data and meaning from it are held by specialist data scientists, working in separate teams from those using the data to achieve operational outcomes. This can be described as a separation between “deployment and production” and “consumption and outcomes”. If more staff can be provisioned with and confidently utilize more advanced analytical capabilities, there will be a greater return on investment.
“This is a partly about increasing the kind of capability of the investigators and analysts, but also making the data more accessible so that they are more effectively able to access it directly and run queries… As an investigator or intelligence analyst, that would be extremely valuable because when you are following a lead, being able to query a data set directly rather than to refer it to a separate team and then it comes back and then you ask another question, would increase the effectiveness of everyone involved. And of course, the other advantage is that you have a common data picture and potentially the scope to more effectively feedback the outcomes that you’re identifying, rather than them being captured in a spreadsheet on a team share drive somewhere.”
– Catherine Woods, Associate Managing Director, Institute for Financial Integrity
Deep Dive: Data Democratization
“You could ask AI to generate an investigative case summary for someone who is not a financial criminal in any way. And the AI could build an incredibly compelling view of the world that you would read, but ultimately it would be totally false and not rooted in reality. From the point of view of anti-money laundering and counter terrorist financing, utilizing those AI capabilities in a very controlled way, ensuring that only the appropriate kind of the appropriate context is fed to those models, is necessary.”
– Seb Drave, Director of Customer Solutions and Innovation, Harbr
In the context of financial crime investigations, use of technologies such as machine learning and other artificial intelligence, must be carefully considered. Over recent years, we’ve seen the development of technologies such as data analytics and data science, graph analytics, synthetic data, and technologies to enable sharing while protecting privacy and custody. In the next few years, we are likely to see these different capabilities unifying together to enable rapid progress towards solutions which are more powerful and more targeted at the problems they intend to solve. In the financial crime context, these will empower intelligence functions to execute the mission in a way that hasn’t previously been possible, and at scale.
Deep Dive: AI Agents
Humans will continue to fulfil a critical role, particularly in contexts where the consequences of inaccuracy are so significant. They will train models, validate their performance, and tune their outputs. Humans will evaluate the results of the models – such as potentially suspicious findings – and ensure they stand up to scrutiny, before taking action on real customers with real lives.
“Even with machine learning, it is incredibly important to have the human in the loop. It is especially important in the financial crime use case, and that will continue to be the case even with generative AI because of the high stakes, the nature of the problem space, and the kinds of determination you’re making around members of your client base.”
– Seb Drave, Director of Customer Solutions and Innovation, Harbr
Author
Catherine Woods is an Associate Managing Director at the Institute for Financial Integrity where she leads content development on countering narcotics and fentanyl trafficking, illicit procurement networks and export controls, emerging technologies including digital assets, and other financial integrity domains. For more information about our courses and services, please contact info@finintegrity.org.
This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies.
Accept settingsHide notification onlySettingsWe may request cookies to be set on your device. We use cookies to let us know when you visit our websites, how you interact with us, to enrich your user experience, and to customize your relationship with our website.
Click on the different category headings to find out more. You can also change some of your preferences. Note that blocking some types of cookies may impact your experience on our websites and the services we are able to offer.
These cookies are strictly necessary to provide you with services available through our website and to use some of its features.
Because these cookies are strictly necessary to deliver the website, refusing them will have impact how our site functions. You always can block or delete cookies by changing your browser settings and force blocking all cookies on this website. But this will always prompt you to accept/refuse cookies when revisiting our site.
We fully respect if you want to refuse cookies but to avoid asking you again and again kindly allow us to store a cookie for that. You are free to opt out any time or opt in for other cookies to get a better experience. If you refuse cookies we will remove all set cookies in our domain.
We provide you with a list of stored cookies on your computer in our domain so you can check what we stored. Due to security reasons we are not able to show or modify cookies from other domains. You can check these in your browser security settings.
These cookies collect information that is used either in aggregate form to help us understand how our website is being used or how effective our marketing campaigns are, or to help us customize our website and application for you in order to enhance your experience.
If you do not want that we track your visit to our site you can disable tracking in your browser here:
We also use different external services like Google Webfonts, Google Maps, and external Video providers. Since these providers may collect personal data like your IP address we allow you to block them here. Please be aware that this might heavily reduce the functionality and appearance of our site. Changes will take effect once you reload the page.
Google Webfont Settings:
Google Map Settings:
Google reCaptcha Settings:
Vimeo and Youtube video embeds:
You can read about our cookies and privacy settings in detail on our Privacy Policy Page.
Privacy Policy