Responsible AI – When is Probability Dangerous?

I read an interesting article a while back, and am going to break down some concerns about probability and AI.

Both are critical to discuss as we roll forward into an untamed future where unregulated AI intersects with not just business practices, but the human condition, or even potential human existence. Here I’ll discusses the use of big data and AI to make decisions about life, without having the knowledge, regulation, or oversight to do it ethically.

An existential example of this absence is in the field of reproductive technology. AI is being used to evaluate a person’s life before birth. Polygenic screening within IVF treatments is a fascinating study of how unregulated AI, with incomplete data and opaque algorithms, is influencing deeply personal, and truly existential, decisions.

Unregulated IVF

On April 1, 2025, the New York Times published an opinion piece, IVF Gene Selection Fertility: Should Human Life be Optimized. It’s a fascinating article, starting with the genetic defect her mother suffered from, spurring her to start a genetic screening on embryos’ DNA for a variety of conditions. Starting with the emotional screening for a genetic switch for blindness, nearly seamlessly goes on to discuss polygenic screening – stating it’s a risk profile for conditions such as heart disease.

What is Polygenic Screening, Really?

To understand ethical concerns around polygenic embryo screening, we have to start with what these predictions are actually based on, and how fragile that foundation is.

Polygenic Risk Score (PRS)

Now, if you want to actually look up Polygenic Screening, you will want to search for Polygenic Risk Score (PRS). A PRS is the sum of the effects of Single Nucleotide Polymorphisms (SNP), which are small differences in a DNA sequence that vary from person to person. One SNP has minimal impact, but adding multiple together can produce what is called a Polygenic Risk Score (PRS) to allow for an estimate of individual condition risk¹.

This means we are stacking thousands of tiny genetic differences to generate a profile. It’s probabilistic, not predictive; built on patterns, not certainties.

Single Nucleotide Polymorphisms (SNP)

There are more than 10 million SNPs within the human genome, and when they interact with each other, they can have a variety of different outcomes, like height, tumor risk, or even psychiatric conditions. Because SNPs interact in complex ways, predicting outcomes is regularly uncertain. Stacking these millions of SNPs, their conditions, or mutations, can all alter and have effects on gene expressions; like how tall you are able to grow, your likelihood of having medical conditions, and more.

This is very exciting technology, a huge leap forward in the human understanding of how we are made, and how all our genes, stack to make us whole. But, it is in its infancy, and we have over 10 million SNPs. Consider if we had only 10 million independent SNPs, and each SNP had 2 possible alleles (version of a gene), then there is too large a possibility of combinations to even begin to fathom.

2^10,000,000≈ 10^3,010,299

While this formula is a simplification, the point remains: the sheer scale of genetic variation makes prediction practically impossible.

And here’s one more key data point: if you add together participants of three of the most comprehensive publicly available genome sequencing datasets, including gnomAD v4, UK Biobank, and 1000 Genomes project, they collectively represent only about .15% of the European population². That means probabilities are being generated based on an incredibly small and incomplete dataset.

We Don’t Have All the Information Yet.

Clearly, a fundamental problem is that the data being used to build these predictive models is incomplete. SNPs come from limited datasets, and companies are using those incomplete samples to build statistical probabilities about complex human traits. The incomplete foundation makes the outputs, the “predictions”, fundamentally unreliable.

On top of that, the SNPs are then added and combined to create the PRS. This means that the risk score is a set of numbers from a non-complete data set, combining that to form relationships with other non-complete data sets, to predict complex human traits. Building predictions on incomplete datasets is foundationally unsound.

In a well cited paper Nature Genetics, Manuscript: Common SNPs explain article states “SNPs identified to date explain only ~5% of the phenotypic variance for height”³. Even for something as measurable as height, our best models still explain only a small fraction; the study used Human Height SNP because it was one of the most identifiable, quantifiable, and best understood SNPs to date. What does this imply about our ability to predict more complex traits?

If using an SNP for height is one of the most current, defined and understandable SNPs to date, we must accept that combining these even less defined and understood to aggregate a score for probabilities is risky at best – misleading at worst.

Carefully Selected as an Embryo

Returning to the NY Times article, Orchid is described as “… offering what is essentially a risk profile on each embryo’s propensity for conditions such as heart disease, for which the genetic component is far more complex.” This is a very simplified statement to explain the SNP and PRS as discussed above⁴.

The article also states Orchid can screen for conditions like obesity, autism, intellectual ability, and height. However, it glosses over a critical point: the United States has little to no regulation in the IVF field. These PRS-based screenings have made the U.S. a destination for global fertility patients, not because the science is more advanced, but because the regulatory barriers are fewer.

Today, Preimplantation Genetic Testing (PGT) is used in over half of all IVF procedures. Several American companies now offer PGT-P, which incorporates Polygenic Risk Scores. This is happening even though, in adults, the results of these scores remain ethically controversial and scientifically uncertain.

Most other countries have recognized concerns regarding PRS. Many have banned or strictly limited the use of non-medical predictive scores in IVF. Several European nations prohibit the use of polygenic embryo screening for traits like intelligence or psychiatric risk, citing ethical and scientific concerns⁵. In contrast, the lack of oversight in the United States has fueled a booming and largely unregulated market. It is a market driven by hope, but based on data that does not justify that level of confidence.

Big Data

Continuing with Orchid as our example, we know they run modeling techniques on both partners DNA to map inheritance patterns.

Advanced statistical models are used, but they add complexity and reduce transparency for the perspective parent. Unless the parent is already a geneticist, or a computer scientist, they are unlikely to understand Orchids description for Statistical Modeling⁶, including Monte Carlo simulations, Bayesian probability, recombination modeling and more, to predict embryo outcomes. These aren’t simple calculators, they are big data aggregators running probabilistic scenarios based on incomplete datasets. Their complexity challenges any principles of transparency and informed consent that we would want from responsible AI. The end of this hopeful future parents can get an embryo report, allowing them to make an informed decision about which embryo to prioritize for transfer. But, are they informed? Given the complexity of the models and the opacity of the language used to describe them, the answer is almost certainly no. Informed consent becomes nearly impossible under these conditions.

This entire process is built on large datasets and intricate modeling methodologies. It’s interesting, and promises all sorts of great things; but it is all probabilistic. Consider what it means to tell someone to select an embryo that is 50% less likely to develop schizophrenia, when the base risk for the condition is already under 1 percent, the prediction is drawn from data representing less than 0.15 percent of the population, and the models themselves have wide margins of uncertainty

The numbers may imply precision, but the science and data behind them does not.

AI Studies

Several recent studies highlight how artificial intelligence is being used to improve polygenic risk prediction.

Aside from the excitement showing that AI can boost the performance of polygenic prediction, they are each using predictions that are best known and defined, such as height, breast cancer risk, cholesterol, etc. They didn’t each choose identical prediction sets, but they were within a set of well defined, best known combinations.

As discussed earlier, even for a trait like height, only about 5% of the variation we observe in people’s height can be explained by the SNPs identified so far. That means even the most advanced tools remain limited. Applying machine learning and AI to these best-known examples allows for more effective testing and training, which leads to more consistent outputs. This success does not automatically carry over to traits that are more complex or poorly understood.

Make no mistake, it’s incredibly exciting that some of the machine learning tools are capable of increasing the accuracy of probabilistic risk to have breast cancer, or high blood pressure, but it’s just a probability. These studies show AI can improve prediction slightly for well understood conditions, but the models are still dependent on incomplete data from small data groups⁷. The excitement around AI’s predictive power must be tempered by the reality that these are still just probabilities, not certainties.

Increasing probability is not the same as achieving accuracy, and when considering embryo selection, that distinction matters.

Responsible AI and Global Regulation

All of this brings us back to the ethical foundation of AI use. Responsible AI is supposed to be transparent, fair, inclusive, and accountable⁸. Yet the use of PRS in embryo selection in the U.S. is none of these. It is opaque, unregulated, built on incomplete data, and applied in deeply personal, high-stakes decisions.

By contrast, the European Union and United Kingdom have laws in place that prohibit the use of polygenic screening in IVF for non-medical traits or poorly validated conditions. They have drawn clear ethical boundaries that can be seen in multiple countries, and from multiple governing bodies.

Conclusion: Building the Tech, Ignoring the Guardrails

What we are seeing is a path of two futures: the U.S. builds the technology, while the EU builds the guardrails. Without urgent alignment between innovation and ethical governance, we risk letting unproven AI systems make decisions that shape human life before it even begins. This reflects a complete abdication of governance. As described in this paper, there is an unreasonable amount of information that a person would have to know, to be well enough informed to understand the use of these tools and what they mean.

Handing a person who wants to be a parent, a sheet of paper with predictions based on incomplete data, and then them choosing which life to pursue? This paper didn’t even begin to scrape the deeper ethical questions of what happens when that child isn’t the tallest, smartest, or what ever trait they were selected for? Do parents raise that child as if they purchased a pre made pizza, and wonder where the toppings they ordered are? Probabilistic outcomes on a human existence, being used to decide if one cell cluster is of greater value than another. The problems here lay much deeper than just the selection or avoidance of potential heart disease.

The USA stands out among the countries with advanced in vitro fertilization for its lack of regulations governing PGT-P. Often, its ethical and legal guardrails are shaped by international influences, like the GDPR inspired California and Colorado privacy laws. Here’s to hoping that the states start looking at their IVF industries, and start to explore how to support the future families.

At some point, we may reconsider whether what is being offered by these clinics is truly “probabilistic” at all. When the data behind these models is so limited, the output becomes less a scientific prediction and more a matter of hope. It is almost certain that legal teams have carefully worded disclaimers to avoid claims of certainty or guarantees, yet it may be inevitable that at some point, disappointed parents may decide that the probability of a lawsuit is worth pursuing.

Could the IVF clinics perhaps be curtailed from selling designer babies under false advertising or predatory practices? Nothing is decided, and I don’t have the answer. But I do think we should be asking these questions.

I look forward to watching and learning more as our future unfolds.

Footnotes

Psychiatry at the Margins ↩︎
Approximate Participant Counts in known genome projects that become part of the data lake: UK Biobank (500k), 1000 Genomes Project (2504), gnomAD v4 (800k). The estimated average participant of European descent was 95%, 24%, 83%. Estimate of European population 750 million. ↩︎
National Library of Medicine | National Center for Biotechnology Information | Sizing up human height variation published in May 2008 and Common SNPs explain a large proportion of heritability for human height. These two articles go into great depths and details of the combining of SNPs as well as the fact that there is an average of 45% variance even when measuring SNP of the most recognized type (Height). They also highlight that SNPs (at current understanding) only account for a small fraction of genetic variation. ↩︎
US Leadership in AI: P8 – States trust requires accuracy, reliability, explainability, objectivity, and more, the use of AI and probabilistic models for PGT-P is missing the mark. ↩︎
In the UK the Human Fertilization and Embryology Authority (HFEA) prohibits the use of PGT-P for non medical purposes. Similar laws can be found in most of the EU member states as well. ↩︎
Orchidhealth.com website page The Science Behind our GRS, they mention simulations, modeling patterns, statistical computing, and recombination. ↩︎
Nature | Analysis of polygenic risk score usage and performance in diverse human populations. Most common data sets for PRS and SNPs are of white European descent (67%), East Asian (19%), and others in smaller quantities. ↩︎
KPMG Trusted AI governance approach lists fairness, transparency, explainability, accountability, data integrity, reliability, security, safety, privacy, and sustainability as their principles within their pillars of KPMG Trusted AI. Using that same set of pillars, PRS-P lacks at least five of the ten. We haven’t reviewed the other five in this paper. ↩︎

References

Adrien Badre, L. Z. (2023, July 24). Arxiv | Quantitative Biology > Quantitative Methods | Deep neural network improves the estimation of polygenic risk scores for breast cancer. https://arxiv.org/abs/2307.13010

Aftab, A. (2024, February 17). Psychatry at the Margins | Polygenic Embroy Screening and Schizophrenia. https://www.psychiatrymargins.com/p/polygenic-embryo-screening-and-schizophrenia

Biobank. (2025). Uk Biobank | QHole genome sequencing. https://www.ukbiobank.ac.uk/enable-your-research/about-our-data/genetic-data

Elgart, M., Lyons , G., Romero-Brufau, S., Kurniansyah, N., Brody, J. A., Guo, X., . . . Sofer, T. (2022, August 22). Communications Biology | Non-linear machine learning models incorporating SNPs and PRS improve polygenic prediction in diverse human populations. Retrieved from Communications biology: https://www.nature.com/articles/s42003-022-03812-z

Gabriel Lázaro-Muñoz, P. J. (2023, November 9). ELSI Hub | Screening Embroyos for Psychiatric Conditions: Public Perspectives, Ethical and Social Issues. https://elsihub.org/sites/default/files/2025-05/Screening%20Embryos%20for%20Psychiatric%20Conditions_Nov%202023%20version.pdf

Genet, N. (2011, December 6). National Library of Medicine | Common SNPs explain a large proportion of heritability for human height. https://pmc.ncbi.nlm.nih.gov/articles/PMC3232052/

Global Resilience Federation. (2023). The Leadership Guide to Securing AI. https://static1.squarespace.com/static/60ccb2c6d4292542967cece7/t/64de2fcdedf2a93df1177eea/1692282832064/AI+Balancing+Act_DASDesign+FINAL_digital+Secured.pdf

Global Resilience Federation. (2025). Global Resilience Federation | AI Security. https://www.grf.org/ai-security

HHS. (Revised 2024, March 27). Federal Policy for the Protection of Human Subjects (‘Common Rule’). https://www.hhs.gov/ohrp/regulations-and-policy/regulations/common-rule/index.html?utm_source=chatgpt.com

Human Fertilisation & Embryology Authority. (2025, April 15). Embryo testing and treatments for disease. https://www.hfea.gov.uk/treatments/embryo-testing-and-treatments-for-disease

IGSR: International Genome Sample Resource. (2025). How many individuals have been sequenced in IGSR projects. https://www.internationalgenome.org/faq/how-many-individuals-have-been-sequenced-in-igsr-projects-and-how-were-they-selected/

Jan Henric Klau, C. M. (2023, June 26). Frontiers | AI – based multi-PRS models outperform classical single-PRS models. https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2023.1217860/full

Katherine Chao, g. P. (n.d.). gnomAD v4.0. https://gnomad.broadinstitute.org/news/2023-11-gnomad-v4-0/

KPMG. (2023, December). KPMG Trusted Approach . https://assets.kpmg.com/content/dam/kpmgsites/xx/pdf/2023/12/kpmg-trusted-ai-approach.pdf?v=latest

L. Duncan, H. S. (2019, July 25). Nature Communications | Analysis of polygenic risk score usage and performance in diverse human populations. https://www.nature.com/articles/s41467-019-11112-0

Logan, J. (2022, September 2). Mad in America | Genetic Embryo Screening for Psychiatric Risk Not Supported by Evidence, Ethically Questionable. Mad in America | Science, Psychiatry and Social Justice: https://www.madinamerica.com/2022/09/genetic-screening-ethically-questionable/

Merriam Webster Dictionary. (2025, May 4). Merriam Webster | Dictionary | Morals. https://www.merriam-webster.com/dictionary/morals

NIST. (2019, August 9). NIST | U.S. Leadership in AI: A Plan for Federal Engagement in Developing Technical Standards and Related Tools. https://www.nist.gov/system/files/documents/2019/08/10/ai_standards_fedengagement_plan_9aug2019.pdf

OECD.AI and GPAI. (2025). OECD | Policies, data and analysis for trustworthy artificial intelligence. https://oecd.ai/en/

Sussman, A. L. (2025, April 01). Should Human Life Be Optimized. https://www.nytimes.com/interactive/2025/04/01/opinion/ivf-gene-selection-fertility.html

The White House. (2023, November 01). Federal Register | Safe, Secure, and Trustworth Development and Use of Artificial Intelligence | EO 14110. https://www.federalregister.gov/documents/2023/11/01/2023-24283/safe-secure-and-trustworthy-development-and-use-of-artificial-intelligence

The White House. (2025, 01 31). Federal Register | Removing Barriers to American Leadership in Artificial Intelligence | EO 14179. https://www.federalregister.gov/documents/2025/01/31/2025-02172/removing-barriers-to-american-leadership-in-artificial-intelligence

Tom L. Beauchamp, J. F. (2009, 2013). Pinciples of Biomedical Ethics. Retrieved from Internet Archive | https://archive.org/details/principlesofbiom0000beau_k8c1/page/n5/mode/2up

April 4, 2025April 4, 2025

Non-Human Identity (NHI) and PLCs

Introduction: PLCs in Everyday Automation

When people consider robots, many think of the humanoid replacements seen on TV, with digitized faces, and funny walking gates. Most don’t consider how robotics are already part of nearly every aspect of our current everyday life. The backbone for many controls for infrastructure, including critical such as water systems, transportation, turbines, and also non critical systems such as automated warehouses and similar robotics applications is the humble Programming Logic Controller (PLC).

To illustrate this, consider something as simple as buying a bottle of water at your local market. That bottle has traveled through multiple automated networks before reaching the shelf. It was likely stored in an automated warehouse, where PLCs controlled its journey from storage to a truck, merging with other soda co-products into a perfectly stacked, mixed pallet. This entire process is orchestrated by a network of PLCs ensuring seamless movement.

PLCs are the backbone of millions of automation services, and to explore it’s ubiquitous, and importance, we will explore the PLC in an Automated Warehouse to understand how something so common, can be so critical.

What does a PLC do

As mentioned above, a PLC is a Programming Logic Controller, but that isn’t enough for the lay person to understand what a PLC does. A PLC is in fact, a computer used for industrial automation.

It is designed to replicate a set operation or process over, and over again- while collecting vital information from connected systems such as sensors, SCADA (Supervisory Control and Data Acquisition) systems, and HMIs (Human Machine Interfaces)—to collect and process data.

Based on this input, the PLC determines the appropriate response, such as activating motors for conveyors, lifts, and other components within an automated warehouse. The PLC itself operates by: detecting the state of all things it’s connected to, following its repeatable logic for action, and output communication back to what it is connected to; in our example it will be turning on or off the motors for conveyors, lifts, and other parts of an Automated Warehouse system, while also broadcasting status the the larger system.¹[1]

PLC in its “Natural Environment”

Before diving into how PLCs identify and communicate, let’s explore an environment in which they operate. Consider an automated warehouse, like an amusement park ride for your bottle of water. Products start at different bays, and ride conveyors, that climb, twist, merge, and more, all to end at the the loading dock; PLCs manage every transition.

Automated warehouses function as massive logistical hubs, moving thousands of products daily through an intricate system of conveyors, sorters, and palletizers. These warehouses are structured into zones, each managed by dedicated PLCs that control specific identified (NHI) conveyor types, motors, actuators, etc. Here I provide some human readable examples of identifiers:

PLC: (PLC001, PLC002, etc.) – Each PLC must be identified and identifiable for communication and controls to happen.
BAY: (B001, B002, etc.) – where the product is waiting to be “picked” and have the PLC release onto a conveyor.
Release Conveyor: (RC001, RC002, etc.) move products from storage.
Merge Conveyor: (MC001, MC002, etc.) Multiple conveyors merge into a single conveyor monitored by sensor controllers (e.g., Raspberry Pi devices).
Divert Conveyor: (D001, D002, etc.) When the boxes on the conveyor are splitting onto two or more conveyors.
Sequence Check: (SC001, SC001, etc.) found at intersections to proper order.
Palletizer Merge: till it reaches the Palletizer zone we will call the conveyor that delivers to the palletizer “PM”. Within the Palletizer there are multiple zones:
Palletization & Patterns – Different automated systems have different palletization or loading patterns. (We will not go into the specifics of any individual warehouse pattern systems.)

At each stage, PLCs ensure that products follow the correct path, communicating with sensors using their programmed logic to maintain order. At many PLC locations, there may be another controller working in tandem, a sensor controller, often a Raspberry Pi. This little card sized controller is used specifically to capture data on sensors, lasers, and similar; used to identify a container as it rolls through a sequence check, this device must have its own identification, and be able to communicate to and be recognized by the PLC. Now that we have a high-level overview the warehouse flow, we can explore how PLCs identify and communicate with each other within this environment.

PLC Identity and Non-Human Identity (NHI)

For PLCs to function in an automated system, they must be able to recognize and communicate with all the devices around them – motors, actuators, and even secondary controllers like the Raspberry Pi. To do this, they rely on a set of unique identifiers known as Non-Human Identity (NHI). These identifiers allow PLCs to track and communicate with every connected device in real time, enabling the automation of operations.

Some of the key NHI mechanisms used in Industrial Automation include:

IP or MAC Addresses – Common in modern Ethernet-based networks.
Industrial Protocols – Such as Ethernet/IP, Modbus TCP/IP, and Profinet.
Legacy Network Identifiers – Older systems use Profibus, CANopen, and DeviceNet, which assign Node IDs instead of IP addresses enabling PLC to communicate to different machines.
Memory Addresses & Tags – PLCs store references to connected devices, ensuring recognition even after hardware replacements.
Routing Tables & Network Maps – Define communication pathways in complex systems.
Raspberry Pi Running Node-RED – fetches data from Allen-Bradley PLC using Modbus TCP/IP, a quick SCADA alternative, and can in some instances be a sub network within a PLC network.

In the warehouse, these identifiers allow different zones to work in tandem. When a product is released from storage, the release conveyor’s PLC communicates with merge and divert PLCs, ensuring proper sequencing for palletization. If anything goes wrong – like a PLC not recognizing a product assigned path, it will trigger a fault, forcing human workers to intercept and correct. Even a single miscommunication can create delays that ripple through the entire warehouse.

Mixed-Age Systems and Heartbeat Identification

Industrial automation systems change, adapt, and evolve over time. As facilities upgrade, they often end up with mixed-age systems, where legacy PLCs must coexist with modern networked controllers and machines². In such environments, older PLCs often rely on heartbeat signals—simple, periodic pings that confirm a device is online If a heartbeat is lost, the system assumes failure and may trigger emergency shutdowns.

While this mechanism ensures safety, it also presents a risk: heartbeat ID spoofing could allow an unauthorized device to mimic a PLC’s presence, potentially disrupting warehouse operations. (We’ll discuss more in depth to come.)

Multiple PLC Networks and Leader-Follower Configurations

In automated warehouses, PLCs do not operate in isolation, they are part of segmented networks. To manage complexity, PLCs are often grouped into leader-follower configurations, where a leader PLC oversees several subordinate controllers. This structure:

Reduces network congestion by centralizing decision-making.
Ensures coordinated actions across multiple warehouse zones.
Helps isolate faults—if a follower PLC fails, the leader can reroute operations or trigger alerts.

When a PLC broadcasts an error, it’s effects ripple through the system. The Leader PLC broadcasts the error, and upstream PLCs must determine whether it will impact their operations. If an issue is detected, they halt to prevent further errors. Meanwhile, downstream PLCs continue running until a sequence check further along the process detects a missing product. At that point, the downstream PLC registers the failure and alerts its own upstream systems, trigging a secondary shutdown.

This cascading effect can halt sections of the warehouse or, in extreme cases, bring the entire facility to a standstill if warehouse staff and Controls Engineers do not quickly identify and resolve the originating PLC failure.

For example, consider merge zone PLC detecting a sequencing error. The PLC immediately notifies its leader PLC, which then signals upstream systems to pause product flow. By stopping movement before the issue spreads further, the system minimizes disruption and reduces downtime.

The Interconnected Nature of PLCs

The ability of PLCs to recognize and communicate with each other and partner systems is what keeps an automated system running smoothly. But as warehouses grow more complex, integrating mixed age networks, external controllers, and industrial IoT devices, the question of identity becomes just as important as function. Without strong Non-Human Identity (NHI) mechanisms, PLCs cannot securely authenticate the machines they interact with, leaving gaps for errors and exploitation.

In the next section, we will explore some mechanisms PLCs use to establish identity. From IP/MAC addressing to legacy network identifiers, each method plays a role in ensuring that every PLC, sensor, and actuator knows it’s place in the system. These identities and identity methods allow PLCs to interact reliably, but come with limitation and challenges.

Key Non-Human Identity Methods in Automated Warehouses

We continue to explore some of the top uses and vulnerabilities of the Non-Human Identity of Automated Warehouses, and how they relate specifically with regards to the PLC.

IP or MAC Address-Based Identification

When properly set up, PLCs rely on IP or MAC addresses for network communication and identification. In most warehouse environments, leader PLCs may use multiple identifiers for redundancy and protection, while subordinate PLCs may be identified by their MAC address for simplicity.

While MAC spoofing doesn’t have much news coverage, it does happen. There was a 2016 MAC Spoofing attack that cost millions of dollars³. In an industrial setting, even if the malicious actor is successfully blocked through traveling latterly or upstream through network segmentation, we have seen how a single PLC error can cascade and effect the whole system. Strong segmentation may not be enough to prevent disruptions.

And recall, the PLC is not only communicating with other PLCs, but actuators, and other devices such as sensors, and interfaces. If there isn’t a good and regular inventory of all connected devices, the impact of an identity failure can cascade across the entire system.

Industry Protocols (Ethernet/IP, Modbus TCP/IP, Profinet, etc)

Industrial control systems were originally designed with isolation in mind, and isolation was considered secure when Operational Technology (OT) networks were separate from IT infrastructure. However, as automation environments have become more interconnected, these once-closed networks are now discoverable on the network, and face security risks.

Many industry-standard protocols, including Ethernet/IP, Modbus TCP/IP , and Profinet, were developed assuming that the network was closed and secure. These protocols were designed without encryption or authentication mechanisms⁴^,⁵, making them inherently insecure for communication over modern networks.

This introduces a path to access and capture MAC addresses, verification protocols, or other operational information, widening the door for attack. There are add-ons for security, however the core issue remains; these protocols were not designed with cybersecurity in mind, leaving critical systems vulnerable.

Legacy Network Identifiers

Recall discussing “mixed age systems” above? Older PLCs may not be fully compatible with newer PLC’s, even when using the same brand. When a facility upgrades, it is very unlikely to transition all existing hardware and components; instead, legacy products that are still working remain, sometimes segmenting by network, or even using “heart beat” protocols, where an older PLC broadcasts a heartbeat (ping) as “proof of life”.

The problem; this heartbeat/follower PLC protocol lacks any NHI identifiers at all, and opens another avenue for entry and disruption. When combined with a non-encrypted network protocol, and a threat actor may be able to map older network segments, identify vulnerable devices, and then make plans accordingly.

Claroty TEAM82 has demonstrated the risk in multiple ways, one of the most interesting involves leveraging a legacy PLC to access the SCADA systems. The fastest way to achieve this? Trigger a fault in a legacy PLC, and then the Engineer may use SCADA or HMI to review (and thus attacker gains access to engineer SCADA and more)⁶. If we have older devices that are using heartbeat as it’s identifier, the bar to access that is pretty low .

Protect the Non-Human Identity – Protect the System

By now, it should be clear just how deeply PLCs are embedded in modern life. They don’t just move your bottle of water from storage to shipment, they quietly control much of the world’s infrastructure, from manufacturing and logistics to water plants and critical utilities.

A warehouse shutdown is inconvenient, but what happens when a PLC error does more than stop operations? What if instead of halting a system, it mistakenly activates equipment? What if a disrupted PLC logic sequence sends the wrong command at the wrong time?

Can you imagine an entire pallet of water falling from 7 stories of a warehouse bay? Who was working there at the time, how were they affected? Now, take that same failure, and apply it to a water treatment plant. What happens when a gate controlling chemical flow opens too early or too late?

Non-Human Identity in industrial automation is established through control systems, MAC and IP addresses, industrial protocols, and authentication mechanisms that help machines communicate with their intended counterparts. As automation networks grow more complex and interconnected, protecting these identity structures becomes critical. If a PLC’s identity is spoofed, or compromised, the consequences could ripple far beyond a single warehouse, impacting safety, security, and infrastructure at a much larger scale.

DO Supply, Explaining HMI, SCADA, and PLCs, What They Do, and How They Work Together ↩︎
[1] POR Automated Wherehouse, Overcoming Common Software Implementation Challenges(p8) ↩︎
Secure W2, MAC Spoofing Attacks Explained: A Technical Overview ↩︎
Veridify Security, OT Security: Cybersecurity for Modbus ↩︎
ODVA, Overview of CIP Security ↩︎
Claroty Team82, Evil PLC Attack: Using a Controller as Predator Rather than Prey ↩︎

References

Allen-Bradley. (2005). Rockwell Automation | Literature | Documents | ag-um008. Retrieved from Rockwell Automation: https://literature.rockwellautomation.com/idc/groups/literature/documents/um/ag-um008_-en-p.pdf

DO Supply. (2019, February 4). Explaining HMI, SCADA, and PLCs, What They Do, and How They Work Together. Retrieved from DO Supply: https://www.dosupply.com/tech/2019/02/04/explaining-hmi-scada-and-plcs-what-they-do-and-how-they-work-together/

Huges, C. (2025, February 20). Understanding OWASP’s Top 10 List of non-human identity criticlal risks. Retrieved from CSO: https://www.csoonline.com/article/3828216/understanding-owasps-top-10-list-of-non-human-identity-critical-risks.html

ODVA. (n.d.). ODVA | Technology Standards | Distincht CIP Services. Retrieved from ODVA: https://www.odva.org/wp-content/uploads/2023/07/PUB00319R2_CIP-Security-At-a-Glance.pdf

Panduit. (2022, October). Panduit | Markets | Documents | Infrastructure Warehouse Automation. Retrieved from Panduit: https://www.panduit.com/content/dam/panduit/en/website/solutions/markets/documents/infrastructure-warehouse-automation-cpcb261.pdf

Project, O. W. (2025). OWASP Non-Human Identities Top10. Retrieved from OWASP: https://owasp.org/www-project-non-human-identities-top-10/2025/

Rockwell Automation. (2024, June). Rockwell Automation | Literature | PlantPAx Distributed Control System Configuration and Implementation. Retrieved from Rockwell Automation: https://literature.rockwellautomation.com/idc/groups/literature/documents/um/proces-um100_-en-p.pdf

Secure W2. (2025). MAC Spoofing Attacks Explained: A Technical Overview. Retrieved from Secure W2: https://www.securew2.com/blog/how-do-mac-spoofing-attacks-work

Sharon Brizinov, M. S. (2022, August 13). Claroty | Team82 | Evil PLC Attack: Using a Controller as Predator Rather than Prey. Retrieved from Claroty: https://claroty.com/team82/research/evil-plc-attack-using-a-controller-as-predator-rather-than-prey

Tecsys. (2024). infohub Tecsys | Resources | e-book | Improving Warehouse Operations with Low Code Application Platforms. Retrieved from Tecsys: https://infohub.tecsys.com/resources/e-book/improving-warehouse-operations-with-low-code-application-platforms

The Robot Report. (2024). Automated Warehouse | Overcoming Common Software Implementation Challenges. WTWH Media LLC.

Veridify Security. (n.d.). OT Security: Cybersecurity for Modbus. Retrieved from Veridify Security: https://www.veridify.com/ot-security-cybersecurity-for-modbus/

March 23, 2025March 23, 2025

Dual Hat – NSA and CYBERCOM

Reasons for the Dual Hat, and Reasons against – solution – it’s complicated.

The National Security Agency (NSA) and U.S. Cyber Command (CYBERCOM) are both part of the U.S. Department of Defense, with a single leader overseeing both agencies. CYBERCOM operates under Title 10, governing military operations, while the NSA operates under Title 50, governing intelligence activities. While distinct missions, in cyber operations they frequently intersect.

Intelligence Gathering: Strategic vs. Operational

Intelligence gathering often overlaps with operational activities when identifying threat actors. The methods and tactics used may be inherently operational or offensive, blurring the distinction between intelligence and military operations.

Intelligence has a history of intersecting with military action, as seen from within The DoD War Manual. Item 16.1.2.1 lists in Cyber Operations actions such as advance force, reconnaissance, and gathering of intelligence;¹ identifying intelligence as a distinct act.

Splitting the NSA and U.S. Cyber Command would not change how cyber intelligence is gathered but could increase costs, create duplicative efforts and reduce efficiency.

To Split or Not

Post-Gathering: What to Do with the Intelligence?

The NSA’s directive to share intelligence with relevant agencies contrasts with CYBERCOM’s mission to disrupt and impose costs on adversaries. This divergence creates a conflict – who decides how the intelligence is used? For instance, if CYBERCOM wants to gather long-term intelligence or develop countermeasures without disclosure, it could clash with the NSA’s responsibility to share the data.

Splitting the NSA and U.S. Cyber Command would not change how cyber intelligence is gathered, would likely increase costs, and reduce operational efficiency. Maintaining the current dual-hat structure, however, may continue to create conflicts between the agencies’ differing missions, potentially complicating intelligence priorities.

Ultimately, the decision to split or consolidate involves weighing the trade-off between efficiency and resolving mission conflicts.

DOD Law of War Manual, Updated July 2023, Office of General Counsel, Department of Defense ↩︎

References

Department of Defense. (2023, July). Office of General Counsel | Department of Defense | Treaty Documents > DoD Law of War Manual. Retrieved from Office of General Counsel | Department of Defesne: https://ogc.osd.mil/Portals/99/Law%20of%20War%202023/DOD-LAW-OF-WAR-MANUAL-JUNE-2015-UPDATED-JULY 202023.pdf

Garamone, J. (2023, March 8). Cyber Command, NSA Successes Point Way to Future. Retrieved from U.S. Department of Defense: https://www.defense.gov/News/News-Stories/Article/Article/3322765/cyber-command-nsa-successes-point-way-to-future/

House.Gov. (2025). TITLE 10 / Subtitle A / PART I / CHAPTER 6 / §167b. Retrieved from uscode.house.gov: https://uscode.house.gov/view.xhtml?req=granuleid:USC-prelim-title10-section167b&num=0&edition=prelim

Maryuama, J. A. (2020, December 24). Split Up NSA and Cybercom. Retrieved from Defense One: https://www.defenseone.com/ideas/2020/12/split-nsa-and-cybercom/171033/

National Security Agency. (n.d.). About NSA/CSS Mission. Retrieved from NSA.gov: https://www.nsa.gov/about/mission-values/

Office of the Director of National Intelligence. (n.d.). Rev Book – 1947 National Security Act. Retrieved from Office of the Director of National Intelligence: https://www.dni.gov/index.php/ic-legal-reference-book/national-security-act-of-1947

Schoka, A. (2019, April 3). Cyber Command, The NSA, and Operating in Cyberspace: Tie To End The Dual Hat. Retrieved from War On The Rocks: https://warontherocks.com/2019/04/cyber-command-the-nsa-and-operating-in-cyberspace-time-to-end-the-dual-hat/

Swaney, R. (2023, September 11). Why Keep the Cybercom and NSA’s Dual Hat Arrangement. Retrieved from Security Intelligence: https://securityintelligence.com/articles/why-keep-cybercom-and-nsas-dual-hat-arrangement/

January 20, 2025January 20, 2025

FRCA v GDPR – USA Scattered Privacy Protections

In this post, I will explore a bit of USA Scattered Privacy Protections as compared to the GDPR. It is important to note- the United States doesn’t have individual privacy protections within the constitution, nor has Congress considered it a priority enough to develop such. Due to this, the laws regarding cyber, and the laws regarding your privacy are being protected in a scattershot fashion, using existing laws. One such law is the FCRA or Fair Information Credit Reporting Act.

How does the FCRA compare with the GDPR

Privacy Protection

When comparing Fair Credit Reporting Act to the General Data Protection Regulation, one must first recognize the FCRA is about banking and credit reporting, not about privacy. In contrast, the GDPR identifies privacy as a human right, and is a regulation specifically about privacy of individuals.

FCRA Purpose

The purpose of FRCA is to protect the banking system and prevent impact on “… the efficiency of the banking system… [and] continued functioning of the banking system.”¹ The FCRA doesn’t identify persons as the data subject, and instead defines a person to be “…any individual, partnership, corporation, trust, estate, cooperative, association, government or governmental subdivision”. The definitions continue, clarifying “…’consumer’ means an individual.”²

The FRCA is about the appropriate passage of reports to and from the banking system, specifically regarding credit worthiness of consumers. It holds some limits on what a report can contain, and the approved reasons for transmission. In this limited scope, it has impacts on privacy, and does allow for data subjects to review and dispute.

GDPR Purpose

Compare that definition to the GDPR, where it is “designed to protect the fundamental rights and freedoms of natural persons…” and in the Definitions section, “…’personal data’ means any information relating to an identified or identifiable natural person (‘data subject’); an identifiable natural person is one who can be identified…”³

The GDPR is about the rights of the data subject including the right of access, the right to rectification, the right to erasure or restrict processing, and the right to not be the subject to automated decision making.

EU Credit Measuring and Privacy

The different EU nations measure and manage credit differently, but they are obligated to protect privacy in accordance with the EU GDPR.

This paper recognizes that EU has a law regarding credit rating agencies, however Regulation (EC) No 1060/2009 on Credit Rating Agencies is not about individuals or persons and is not relevant to credit scores or reports as discussed in this paper. ⁴

EU and Individual Persons Credit – Different in Each Country

The EU is made up of 27 member states, representing varied credit monitoring methodologies and laws different from the United States. For example, Germany has the SCHUFA that holds data on persons over 18, and all persons start with a score of 100 and then can have deductions of points based on debts. Spain Risk Management Center tracks debts, and can make customer lists from loans for up to 6 years⁵, and France is entirely dependent on relationships with banks⁶; an individual must open an account with a bank to build a relationship, and banks don’t share customer information with other banks.

Individual Credit ranking systems in EU are managed within each country, and don’t travel well between countries; many of them vastly different than any credit reporting methodologies in the USA. The laws regarding these different credit methodologies vary, but then must comply with the GDPR regarding privacy.

There have been consequences when an EU nation-states banking/credit/loan system did not comply with the GDPR; in 2021, the Court of Justice of the European Union (CJEU) delivered a judgment regarding “automated decision making” within the GDPR, finding that credit scoring by the SCHUFA constitutes automated decision making, and profiling⁷. The decision was disputed, but held in 2024.

Regulatory Limitations & Consent Mechanisms

Regulatory

The FRCA provides instances where the data subject has input over the information within a report, as well as rules over the sharing.

FCRA defines legal limits on material in reports, including exclusion of material over seven years old, and much health care information⁸. The data subject has the right to dispute the accuracy of a report⁹; it falls to the credit monitoring institution to “reinvestigate” the information for veracity, including reaching out to the source of the disputed information to review the dispute¹⁰.

While both GDPR and FRCA have the right to dispute and correct, there are some dramatic differences.

Article 5 GDPR Recital 1 lists the provisions that are specifically about the capture of a data on an individual, how long it can be held on to, and how it can be used. The data collected by a business in within the GDPR zone, must be anonymized, used for the purpose of the transaction, and the companies must have a disposal of data plan. Further, the information should not be stored and used for purposes outside of the initial scope of the transaction for which it was gathered, or be subject to automated decision making.

Under both laws the data subject does have the right to file a lawsuit, against a credit bureau/controller for inaccurate information if they have filed for corrections, the corrective actions prescribed by law were not met, and the data subject then suffered material harm due to this. In addition to this, within the GDPR, the rights for suite extend over a wider range; individuals do have that the right to sue the controller, and the legal compliance organizations within the EU for failure to enforce the rules of the GDPR.¹¹ Lastly, the right to suite is not limited to material harm within the GDPR.

Level of Protection – FRCA Limitations

The level of protection offered by the Fair Credit Reporting Act (FRCA) is limited in scope, as defined by the law itself. The rules within the FRCA apply to, and are limited to, Credit Reporting entities and the data shared by them to other entities. Transactions outside of the Credit Reporting market are not within the scope of the law, and thus, not protected; I.E., a user’s search engine data, purchases made on an online retailer or millions of other potential out of scope transactions.

Harm Definitions and Treatments

FRCA defines harm and the ability to sue for harm to be a measurable material harm. For example, FRCA §616 lists civil noncompliance, with limitations of damages to to match the material harm.

Where the GDPR allows for suite from “Any person who has suffered material or non-material damage…”¹²; non material damage can include non-tangible effects like mental duress.

Accountability Measures

Within the FRCA, the accountability is much more about the banking system and information flows between industry sources than about the individual to from which the reports are made. Within the GDPR, there are enforcement measures from all levels. Governing bodies have enforcement levers (similar to FCRA) but the data source has many more enforcement levers, and greater potential financial returns due to the vastly different definition or scope of what is “harm.”

Consent

FRCA Default Data Collection and Distribution on Data Subject

Consent is a tricky concept in the flow of information within the FCRA. Within the rules of FCRA, a credit monitoring company may be asked for, and provide a report on the data subject, and where the consumer of the report is an entity that has legal permission to request the report, both entities are engaging in this transaction in an informed manner. We can clearly see a request and reply to the request between two entities that have what could be considered an informed data flow; yet the data subject may not be not part of this communication flow.

Within the FRCA, it is allowable for parties to get credit reports from reporting agencies for establishing consumer’s eligibility for credit, insurance, employment, and other purposes¹³ which includes things like court orders, credit transactions, insurance, licenses or government required by law, and, notably “otherwise has a legitimate business need¹⁴”. FRCA §604(c) allows for acquiring a consumer report not initiated by the consumer¹⁵. Interested parties can get a credit report on selected individuals not only without the individual’s direct consent, but even without the data subjects’ awareness. This takes the data subject of the report right out of the equation. By being able to collect a report on a subject, while excluding the data subject from participation in the transaction, the credit report is removed from any contextual integrity heuristic with the data subject.

A data subject has two ways to control the flow of their information. The first is if there is a fraud alert they are informed of, and the consumer then requests an “extended” alert¹⁶, which would begin a five-year period of “exclude the consumer from any list of consumers prepared by the consumer reporting agency and provided to any third party… as part of a transaction that was not initiated by the consumer.”¹⁷

Another method is for the consumer to enact a freeze¹⁸ on their credit, which prohibits a reporting agency from sharing a report on the data subject to any entity requesting a report. It becomes the responsibility of the consumer to then turn on, off, or temporarily suspend a freeze when initiating a transaction where the data subject approves sharing a credit report.

By use of the alert or freeze lever, the data subject inserts themselves in the communication flow between the credit monitoring company, and the entities whom receive the reports, making any of those transactions then require the participation and consent of the data subject.

The limitation remains, that this is it is specific to the credit reporting, and leaves out any other sort of data collection or distribution on the data subject.

Consent GDPR- Privacy by Default

Where it falls entirely upon the data subject to initiate controls for consent within the FCRA, the GDPR instead clearly protects the data subject by changing the dynamic; privacy is the default, and consent must be established for data collection. Data on a data subject must not be processed further than the purpose of the initial transaction, and those purposes must be listed, made in clear language, and transparent¹⁹. A data subject can change their mind about consent, revoke consent, and grant consent. The largest, and most defining point here is that within the GDPR, privacy is the default, and notice from the data subject is required for any variation. This process is across the board for all transactions, and is not limited to banking or credit monitoring.

Contextual Approach to Privacy Protection

In the above review and comparison of differences between FCRA and GDPR, we have lightly touched on some key principles and differences between the two legislations, and noted a difference in the contextual approach to privacy.

FCRA and a Contextual Approach

When considering the FCRA, if only looking at the information flows between the credit monitoring company and the recipient of credit reports, we see a clearly defined information flow. The material being asked for and provided matches, and falls within expected norms between those two entities. Where this information flow is broken, is that the material being provided is about a data subject, the data subject doesn’t ask for the report to be made, and the transaction between the reporting agency and the consumer of the report may even fall outside of the knowledge of the data subject.

Consent of the data subject for the collection of the material within a credit report is not even considered, and the data protections in the FRCA are limited specifically to transactions regarding credit reporting and monitoring. Within those limitations, the FRCA does offer the data subject does some default protection regarding their health care information²⁰. This protection could be considered, within a contextual approach, as a natural limit on the information flow.

While privacy is not directly considered in most of the FRCA, there are actions a data subject can take that put them into the communication flow, like alerts and freezes. The data subject becomes a participant of all credit report communication flows, and the provided information transfer would thus be considered within context.

GDPR and a Contextual Approach

The GDPR is built with a contextual approach, as can be seen in several of the recitals and directives within the document. For example, “Personal data shall be collected for specified, explicit and legitimate purposes, and not further processed in a manner that is incompatible with those purposes..”²¹. If the looking at contextual integrity as a privacy heuristic, then the entire Article 5, Recital 1 could be considered a method to define, in law, what acceptable information flows are expected to be within an individual’s privacy rights and controllers’ responsibilities with regards to the rights of the data subject.

Preferences

The contextual approach within the GDPR is a much more active and supportive privacy law. GDPR recognizes privacy as a human right, and concern, and defines individuals as natural persons!

FRCA is limited in scope due to being specific to Credit Reporting. Today’s world of data gathering is far past credit monitoring, and the use of FRCA as a privacy tool is like using a fly swatter to stop the rain. When considering today’s data landscape, the data gathered on people is much larger, and gathered from more sources, aggregated, and used for automated decision making, far past the scope of FRCA.

The tools that are in the FRCA that are most handy are also within the GDPR; the right to dispute, and even to file suit. However, the scope of protections are completely different, in part because FRCA is about the banking stability, where GDPR is about persons information and how far that information should be allowed to go, how long it should linger, and even a person’s right to be forgotten²². GDPR defines itself recognizes individuals’ privacy “… must be considered in relation to its function in society and be balanced against other fundamental rights…²³”.

While some argue that data is already out, I would counter that simply because a boat has already taken on water, doesn’t discount the need for patching it.

A person’s ability to lead productive and participatory lives safely in an open and free society can be dependent on data not being exposed. Freedom of expression, of movement, ability to participate in a society, can be dependent on expiration of information.

Under the FRCA a consumer who went bankrupt, or had a lien that defaulted, can count on that information expiring (being removed) from their report in 7 years. However, for data outside of credit reporting, if it is in a newspaper, web shopping, app tracking and more, there is no right to be forgotten; this can haunt people moving forward. If an individual has to move to become safe from persecution, be it from an institution or an individual, there are no protections under the FRCA. Under the GDPR, an individual is protected under both circumstances; and their ability to participate in society is not hampered by data following them indefinitely.

Per GDPR Recital 2, Respect of the fundamental Rights and Freedoms:

“… This Regulation is intended to contribute to the accomplishment of an area of freedom, security and justice and of an economic union, to economic and social progress, to the strengthening and the convergence of the economies within the internal market, and to the well-being of natural persons.”²⁴ [italics added for emphasis]

Footnotes

FCRA §602 Congressional findings and statement of purpose [15 U.S.C. §1681] ↩︎
From the Fair Credit Reporting Act, Definitions §603(b) and §603(c) [15 U.S.C.§1681a] ↩︎
From GDPR, Chapter 1, Article 4, Recital 1 ↩︎
Regulation (EC) No 1060/2009 of the European Parliament and of the Council of 16 September 2009 on credit rating agencies, Article 2 (2)(a) This regulation is high level guidance and directives for the banking sector specific to investing and credit ratings within and across banks specific the investing landscape; formed from the needs found after the collapse of the banking markets in 2011. ↩︎
Chase.com Do other countries have credit scores? ↩︎
finmasters What Countries Have Credit Scores and How Do They Work? ↩︎
Case C-634/21 – SCHUFA where the judgment of the court was that the automation within the SCHUFA (German Credit agency) and then used heavily in the application of a loan was in conflict with the GDPR under Article 15(1)(h) and Article 22. ↩︎
FRCA §605(a) Information Excluded from Consumer Reports ↩︎
FCRA §609(c) Summary of Rights to Obtain and Dispute Information ↩︎
FCRA, §611 Procedure in case of disputed accuracy [15 U.S.C. § 1681i] ↩︎
GDPR, Chapter 8, Articles 77, 78, and 79 ↩︎
GDPR, Chapter 8, Article 82, Recital 1 This link takes to a really easily searched GDPR by the Horizon 2020 Framework Programme of the European Union. ↩︎
FCRA §604 lists the permissible purposes of consumer reports. ↩︎
FCRA §604(a)(3)(F) ↩︎
Page 72, FCRA 615(d) calls to 604(c)(1)(B)[§1681b] ↩︎
FCRA §605A(b) Extended Alerts ↩︎
FCRA §605A(b)(1)(B) ↩︎
FCRA §605(i) National Security Freeze ↩︎
GDPR Chapter 2, Article 5, 6, and 7 ↩︎
FRCA §603(d)(3) Restriction on sharing of medical information and §604(g) Protection of Medical Information ↩︎
GDPR Chapter 2, Article 5, Recital 1(b) ↩︎
GDPR Chapter 3, Article 17 ↩︎
GDPR Chapter 1, Article 1, Recital 4 ↩︎
GDPR, Chapter 1, Article 1, Recital 2 ↩︎

References

108th Congress (2003-2004). (2003, December 4). H.R.2622 – Fair and Accurate Credit Transactions Act of 2003. Retrieved from Congress.Gov: https://www.congress.gov/bill/108th-congress/house-bill/2622/text

Consumer Financial Protection Bureau. (n.d.). § 1022.1 Purpose, scope, and model forms and disclosures. Retrieved from CFPB, Consumer Financial Protection Bureau: https://www.consumerfinance.gov/rules-policy/regulations/1022/1/

Consumer Financial Protection Bureau. (n.d.). Appendix K to Part 1022 – Summary of Consumer Rights. Retrieved from CFPB, Consumer Financial Protection Bureau: https://www.consumerfinance.gov/rules-policy/regulations/1022/k/

European Parliment, Council of the European Union. (1995, October 24). Directive – 95/46 – EN – Data Protection Directive – EUR-Lex. Retrieved from EUR-Lex | Access to European Union Law: https://eur-lex.europa.eu/eli/dir/1995/46/oj

European Parliment, Council of the European Union. (2000, June 8). Directive 2000/31/EC of the European Parliament and of the Council of 8 June 2000 on certain legal aspects of information society services, in particular electronic commerce, in the Internal Market (‘Directive on electronic commerce’). Retrieved from EUR-Lex | Access to European Union Law: https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX%3A32000L0031

European Parliment, Council of the European Union. (2016, April 05). General Data Protection Regulation (Document 32016R0679) | Regulation – 2016/679 – EN – gdpr – EUR-Lex. Retrieved from EUR-Lex | Access to European Union Law: https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:32016R0679

European Parliment, Council of the European Union. (n.d.). GDPR.EU – General Data Protection Regulation (GDPR). Retrieved from GDPR.EU: https://gdpr.eu

European Securities and Markets Authority. (2022, October 28). Guidelines on the Scope of the CRA Regulation. Retrieved from ESMA | European Securities and Markets Authority: https://www.esma.europa.eu/sites/default/files/library/esma80-196-6345_guidelines_on_the_scope_of_the_cra_regulation.pdf

Federal Trade Commission. (2023, May). Fair Credit and Reporting Act. Retrieved from Federal Trade Commission: https://www.ftc.gov/system/files/ftc_gov/pdf/fcra-may2023-508.pdf

Gesley, J. (2024, 01 10). European Union: Court of Justice Rules Credit Scoring Constitutes ‘Automated Individual Decision-Making’ under GDPR. Retrieved from Library of Congress: https://www.loc.gov/item/global-legal-monitor/2024-01-10/european-union-court-of-justice-rules-credit-scoring-constitutes-automated-individual-decision-making-under-gdpr/

Institute, L. I. (n.d.). Cornell Law School, Legal Information Institute, LII>U.S. Code > Title 22 > Chapter 78. Retrieved from Legal Information Institute, Cornell Law School: https://www.law.cornell.edu/uscode/text/22/chapter-78

Karst, K. L. (1966, Spring). The Files: Legal Controls Over the Accuracy and Accessibility of Stored Personal Data. Retrieved from Duke Law – Law and Contemporary Problems: https://scholarship.law.duke.edu/lcp/vol31/iss2/8/

Legal Information Institute. (n.d.). Cornell Law School, Legal Information Institute, LII >U.S.Code>Title 15>Chapter 41>Subchapter III. § 1681b. Retrieved from Legal Information Institute, Cornell Law School: https://www.law.cornell.edu/uscode/text/15/1681b

Legal Information Institute. (n.d.). Cornell Law School, Legal Information Institute, LII>U.S.Code>Title 11. Retrieved from Legal Information Institute, Cornell Law School: https://www.law.cornell.edu/uscode/text/11

Official Journal of the European Union. (2024, 01 09). Consolidated Version of the Treaty on the Functioning of the European Union. Retrieved from EUR-Lex | Access to European Law: https://eur-lex.europa.eu/legal-content/EN/TXT/HTML/?uri=CELEX:12016E/TXT&qid=1732727796448

November 18, 2024November 18, 2024

Caution – In Cyber Regulation

It is interesting discuss caution in cyber regulation. While caution is an integral part of the regulatory process, we currently see an incautious trend of dismantling regulations that were established with expert knowledge, deliberation, and care.

Cautious step 1: Initiation and Decision for an Agency

Building a regulatory agency requires that multiple branches of government recognize the need for expertise in creating rules ensuring public safety and security.

Article II, §2, Clause 2[1]: states that the president “by and with the advice and consent of the Senate, shall appoint … all other Officers of the United States, whose Appointments are not herein otherwise provided for….”. Agency formation is a careful, deliberate, and cautious process.

Cautious step 2: Designing & Approving, Laws to develop an Agency

Once the need for an agency is recognized, Congress must pass laws directing agency actions and scope on the subject[2]. Making a law is inherently cautious, involving committee revies, debates and votes. Only after approval by both chambers can the law(s) be submitted to the President for approval.

Cautious step 3: Procedural Guidance Upon Agencies

An Agency’s scope is defined by the law(s) Congress passed to establish it. The Administrative Procedure Act (APA) structures how agencies operate, including rules and guidelines for process and procedure. Agencies must publicly share their actions, methods, and processes in the Federal Register.[3] The allowances for secrecy are defined[4], and the participation of the public is built into the procedure in General Notice §4(a)(b)(c)(d).

Caution is expressed in deliberation, and methodology, to develop the greatest understanding of the rule to be made. These processes apply to any regulation rule, allowing for cool minds and diverse input, and aren’t different for Cyber.

Once a rule is proposed, it often is challenged in court by industries and others to challenge or modify the rule. Clearly, the craft of drafting and enacting any regulation is designed with care and caution.

Lack of Caution?

There is an area where caution is lacking. The judiciary risks dismantling regulations beyond their scope of understanding and neglecting their duty to review in favor of deregulation. The increasing reliance on the Major Questions Doctrine, suggests that Congress should draft more specific laws. This ignores the initial cautious step where Congress recognized that expertise on these matters lay outside its purview. This troubling lack of caution in regulation raises concerns about our agencies ability to be effective and the potential risks posed by insufficient protections against cyber threats.

[1] Constitution Annotated, on the congress.gov site, has not only the full text of the constitution, but as seen in the link, a break down of sections and relevance in current exploration.

[2] A Guide to the Rulemaking Process, Prepared by the Office of the Federal Register. What gives agencies the authority to issue regulations.

[3] 5 U.S.C § § 551-559, Administrative Procedure. An easier to read description specifically to rule making can be found on the Cornell Law School LII site.

[4] Administrative Procedure Act PDF – Public Information §3 (1)(2), Rule Making §4 (1)(2)

Reference material list can be found here.

September 12, 2024November 6, 2024

Universal Opt Out & Global Privacy Controls

What is the significance of UOO and GPC in the context of digital privacy and consumer rights.

Universal Opt Out (Mechanism) (UOO(M)) is not configured per website, but is a standardized signal sent to all visited websites from a browser. Universal Opt Out Mechanism(s) include GPC and will likely include similar technologies in future.

Global Privacy Control (GPC)¹ is a browser setting indicating a user’s preferences regarding the collection, distribution, and sale of the user’s data. It is HTTP or HTTPS signal, transmitted over the DOM (Document Object Model) (GitHub, 2024). It is specific to web browsers and HTTP protocols; meaning it is for internet browsers and does not apply to IoT, or other methods of data collection. GPC must be flagged on each browser used; If a user surfs with GPC on in Firefox, but later that day goes to the same site in another browser, the new browser will also need to be set to the users’ preferences.

The future of UOOM will likely include other mechanism and services and expand past just HTTP. UOOM has room to grow to encompass multiple signals; GPC for HTTP(s), and other mechanisms for mobile devices, IoT, perhaps even ISP’s. As the IoT and information flow continues to grow, so too will the need for the toolsets and regulations.

Legal & Regulatory Framework

One of the key components in many of the USA laws is the narrowing of the term processing. For example, Colorado’s new law allows users to opt out of possessing “to advertising and sale…”² (Rule 1.01, CCR 904-3) (Colorado Attorney General , 2021). California also focuses on the “Consumers’ Right to Opt Out of Sale or Sharing…”³ (California Privacy Protection Agency, 2020). The proposed New York law in the assembly focuses on, targeted advertising, sale, and profiling⁴. (New York Assembly, 2024)
Interestingly California, Colorado, and the GDPR (EU) all recognize and use the GPC HTTP signal in their laws, and New York’s proposal requires the acceptance of any type of opt out signal from multiple types of devices (leaving the door open for new UOOM).

Support

Focusing on the California Privacy Rights Act is a good place to start because it is the most populous state in the union, and represents the the largest tech industry.

The California AG lawsuit against Sephora proved that the state is willing to enforce those rules.

The mandate for opting out seems clear on the surface, yet different entities are defining “sale” differently- and the suit against Sephora helped clarify that sale doesn’t have to include financial transaction. In California law Sale of data means making available “to a third party for monetary or other valuable consideration.”⁵ (like rewards programs, or supplying to a service provider). A Browser with that signal turned on has not only opted out of collection, distribution, and sale of their data; but the responsibility of the data collector (in this case Sephora) does not stop at the point of turning on the signal. The collector must not share/distribute, and by that they must but make clear to service providers that the user of that data has opted out and the data is not available, should not be collected, and cannot be part of the transaction.⁶ (Office of Attorney General, San Francisco Superior Court, 2022)

Do Consumers Have Control of Their Data?

Sadly, no, UOOM and GPC are not the end game. UOOM and GPC are the very beginning, and necessary to start the conversation of opting out of data collection and sale.

Currently the UOOM and GPC is specific to HTTP – and it is browser driven. A regular person may surf using Chrome (where GPC isn’t default & requires an addon) or Firefox(where GPC is default if in “Incognito mode”) – but if they switch to edge, or their phone, the GPC flag may not be there.

From watching videos of the Colorado AG and other law officials discuss GPC⁷, there are also mis-understandings and misconceptions about how a user is identified on the web. Some arguing that the user’s data isn’t collected till passing a sign in wall. Faulty understanding of the technology can lead to faulty assumptions and make enforcement impossible- for example, if the people drafting or enforcing the law don’t understand or agree on an identifier, how can protection be enacted and enforced?

For consumers it will offer an incomplete understanding of privacy. Selecting or opting to turn it on, is removed when you dump your cache, and you have to do it again. GPC doesn’t carry across browsers, or devices. Even if the company knows it’s you, and you have signed in, and you opted out of tracking in Firefox- if you log in using another device, you are not sending the opt out signal. How companies choose to collect when a user has opted out, but navigated using a different tool – has not been settled, and is not part of the laws.

Privacy settings on HTTP(s) are a great starting point, and it is exciting to be moving in the right direction. However GPC reflects only a small fraction of the consumer data that is tracked and monetized. Consider the report by the FTC in October of 2021, regarding the privacy practices of six of our major Internet Service Providers. (Federal Trade Commission, 2021)

What Are Some Conflicts Between UooM and Convenience?

Access to Information Friction Points

Currently, because UOOM is not across all states, nor is it adopted across platforms, there are still sites that will prevent viewing if you don’t allow their cookies. In those instances, individuals could be blocked from information.

Companies, that don’t need to sell data to make money with your data, won’t feel any issue with it. But smaller companies may find acquiring data for their projects more difficult. Will the price for the sale of data go up, (from ISPs, or other data sources) when they have less competition. Will this make it less competitive and harder for younger startups and innovation??

Privacy V. Convenience

As for privacy v convenience, there isn’t much to say there. This is an initial step to grant some controls, and reduction of transmission of some data. Data continues to be collected from non-flagged browsers and non HTPP sources.

The convenience of the selection is a great first step, and a distinct improvement over opting out at each site. Clarity on the GPC and its limitations needs to be clearer in the support documentation on the different browsers.

Example WaPo

Washington Post appears to have used and accepted Universal Opt Out as a marketing tool. They are listed in the GPC site, yet on the WP privacy documents it is clear that they will segregate, and disregard the GPC if your IP or any other information indicates you are in a location where GPC is not required by law.

The WP looks good on the GPC Founding Organizations page, while actively striving to do the bare minimum. WP also strongly encourage the use of their apps by limiting browser functionality on mobile devices, while their Privacy Policy⁸ makes clear they gather data on “…sites, mobile and tablet apps and other online products and services…”⁹. (Washington Post, 2024)

Using Firefox Incognito (GPC is automatic) I navigated from the Privacy Statement to the Your Privacy Choices page, it is evident that GPC opt out is flag is received. That same page indicates if you don’t reside in the states where that is enforced, your privacy may be reset. Weather they do or not, is unclear, but with their verbiage and the amount of time to write these documents, it is likely that users location sets an automation to allow the tracking and selling if outside of the areas where it is required by law.

Monetizing data appears to be important enough to make these marketing decisions.

Increase Awareness

Currently it is only people who already care, that search and find out about privacy.

Awareness is increased when there are pushes on legislation through links and mentions on the news media. I don’t know how to make it “sexy”, but perhaps early education and exercises could increase awareness amongst the young, and their parents/caregivers.

Support Materials & Website Improvements

There are basic absences on all of the sites regarding privacy and GPC, such as:

Simplified explanations,
Quick start guides, and
Why some cookies are necessary.
What a third party is, and
- why it matters.

Essentially, to try to get the interest and information out, advocates must fight the noise of the endless information pollution. If the Colorado or California AG had influencer contacts, that could be a point to leverage.

However, there is nothing to leverage if simplified support materials are not available. If they leveraged an influencer now, and directed to their websites – any campaign would fail because the information provided is poorly developed for lay persons, and isn’t available in multiple languages.

The closest I can get to marketing, is to suggest: Simplify, sexify, amplify.

Future of Uoo & Privacy Enhancing Tech

The GPC as a UOOM tool is a fantastic start. I would hope it is only a start, and privacy advocates, and technologists would work together to explore the other areas that need addressing. In fact, starting small, like the GPC may be exactly the right start – if advocates can amplify the discussion of it’s value, and create stories of success. Those same stories can then be leveraged to ease progression and deployment of the next tool. I suspect it is easiest to develop the laws and tools in this process from smallest to largest: from HTTP(s) to Mobile to IoT, tracking across devices, and eventually to IP. This enables the defining of terms, that can then be used in the next stage, and allows the time and space for measurement of success. Once we have some established rules and mechanisms for privacy rights, we can explore what that means with regards to AI. We cannot establish rules around AI specific to privacy rights, prior to having some rules about privacy rights.

However, I do hope that the process is already begun; inertia is a battle that is regularly lost.

Policy Recommendations

I think one of the key components that must be done to enhance UOOM, is to incorporate the right to be forgotten into the rule making. While it is within GDPR, it is completely absent from the USA laws being developed and enacted.

The US laws are defining legal gathering and use of data to be “publicly available information.”

Consider in the draft of the American Privacy Rights Act of 2024¹⁰ stating “publicly available information” is excluded from covered data §2(9)(B)(iii) (Senate & House of Representatives, 2024)

It defines Publicly Available Information to mean any information that “… has been lawfully made available to the general public…”§2(32)(A)

Yet in the supreme court decision of DOJ v. Reporters Comm. for Free of the press, 489 U.S. 749 (1989) (U.S. Supreme Court, 1989)

Page 763 states

“…To begin with, both the common law and the literal understandings of privacy encompass the individual’s control of information concerning his or her person. In an organized society, there are few facts that are not at one time or another divulged to another. [SCOTUS Footnote 14] Thus, the extent of the protection accorded a privacy right at common law rested in part on the degree of dissemination of the allegedly private fact and the extent to which the passage of time rendered it private. [ SCOTUS Footnote 15] According to Webster’s initial definition, information may be classified as “private” if it is “intended for or restricted to the use of a particular person or group or class of persons: not freely available to the public.”¹¹

This would mean that just because it has been public (once upon a time) does not mean it is public now. The footnotes are very interesting and ties nicely with the Contextual Integrity heuristic; selective disclosure and fixing limits upon the publicity. Just because there is information on an individual attending university, it does not follow that that should be shared with that individual shopping service 30 years later.

Footnotes

GPC Signal Definition defining a signal transmitted over HTTP and through the DOM, GitHub, March 22, 2024 ↩︎
Rule 1.01 CCR 904-3 ↩︎
California Consumer Privacy Act of 2018, Amended in 2020, § 1798.120 ↩︎
New York State Assembly. (2024) Bill S00365: An Act to Enact the New York Privacy Act § 1102.2 ↩︎
California Consumer Privacy Act of 2018, Amended in 2020, § 1798.140(ad)(1) ↩︎
Filed Judgement – Office of the Attorney General, San Francisco County Superior Court, Aug 24, 2022 – the judgment & Sephora Settlement. Section 6 offers some clarity on the definition of Sale. Laymen’s terms of the same can be found at the same site, with the Press Release, Settlement Announcement, August 24, 2022. ↩︎
Video list provided at the end of this document. Includes presentations by law offices discussing the Colorado and the California Privacy laws. ↩︎
Washington Post Privacy Policy ↩︎
Italics added for emphasis ↩︎
2024 American Privacy Rights Act (APRA), ↩︎
DOJ v. Reporters Comm. For Free Press, 489 U.S. 749 (1989) pg -763 through 764 ↩︎

Videos

AG Colorado- Data Privacy and GPC Webinar Colorado office of Attorney General, Phil Weiser AG

CPRA Session 5 Universal Opt Outs and Global Privacy Control Sheri Porath Rockwell, California’s Lawyers Association, and Stacy Grey, Director of Legal Research and Analysis at Privacy Forum. Guest Speakers Dr. Rob van Eijk, EU managing Director, Future of Privacy Forum, and Tanvi Vyas, Principal Engineer at Mozilla

TEDx – Data Privacy and Consent | Fred Cate Fred Cate, VP for research at Indiana University, Distinguished Professor of Law at Indiana University Maurer School of Law, and Senior Fellow of the Center for Applied Cybersecurity Research.

Lessons Learned from California on Global Privacy Control Donna Frazier, SR VP of Privacy Initiatives at BBB National Programs and Jason Cronk, Chair and founder of the Institute of Operational Privacy Design.

July 30, 2024October 21, 2024

Tools Approachable to Small & Mid-Sized Businesses

MS CRS: Information Systems Security Engineering

Review CISA List of Tools and Services

I looked for Cybersecurity tools that would be most useful and approachable to a small/mid-sized company, specifically regarding protection of the internal network, intellectual property, workflows, etc. Areas to keep in mind include technical requirements, coding skill levels, surface area monitoring, information sharing, and initiation costs. Examples used in this document were from the CISA list Cybersecurity Best Practices Services.

Some of the areas of importance to a small business include:

Is it a service or a tool?
Surface area monitoring including passwords
Scan for weaknesses regularly
Does it require coding required or not (and what languages it is compatible with)
Updated information sharing
Latest vulnerability tables; how many and which ones
Knowledge Bases, Help files, Initiation videos, etc.

Services

There are many services out there that enable a company to outsource its security. This paper discusses tools and removing services from review.

Tools

There appeared to be three main categories of tools:

Code as Security (within a development pipeline),
Customizable suites that require coding literacy, and
Customizable Identity and Access Management (IAM) tools, that require a high level of technical literacy but do not require full coding literacy (at least at start).

Code as Security

The first category, Code as Security, are the tools that require coding skill, knowledge, and understanding. This subset of tools help within the development pipeline, but are not coverage for the business as a whole. For example, tools like Google OSS-Fuzz are useful to a company that has a development team, perhaps sells SaaS, and coders within the IT or Security team. OSS-Fuzz and similar Security as Code tools would be handy within the development pipeline, but don’t represent a full coverage or protection suite.

Customizable Suite of Security Tools Requiring Coding

The second category, Customizable suites of security tools require development level personnel; the amount of command line and other coding language required is high. Using Gripe as an example: It would require an internal dev team to establish, create the dashboards, and to manage it. This sort of tool requires keeping a portion of developers available for monitoring, updating, and keeping up to date not just on the dashboard and metrics tracking, but to also watch, and maintain the software itself. Many of these tools are available on Github, BitBucket, or other repository systems. Constant review and tracking of source files and updates would be necessary, as well as monitoring different boards for latest risks to track if the chosen tool is keeping up to date. If a company is going to establish a security team for this, they then have to watch the tool development itself – to ensure the tool remains safe, and that use of the tool remains up to date with the source code. Selecting this type of tool likely requires a full time CySec officer and team.

Cloud Protection Suites & Identity Access Management

Cloud Protection suites that include the Identity and Access Management (IAM) tools are our third tool category. These are larger protection suites, often provided by the cloud provider. Microsoft Entra ID (formerly Azure Active Directory), Google Security Command Center and AWS AIM, fall within this category.

These tool sets require a good understanding of technology, but do not require a team of coders and developers to manage them (at least to start). These tools have ability to build the reports and graphics required to convey complex data upstream, and have enough technical power to input work flows, track exposure & surface area, odd behavior analytics, and constant monitoring of the known surface area within that environment.

These larger tool sets, that include Identity Access Management (IAM), are an accessible starting point for many small to mid-sized companies. The dashboards that come with these tools can be used to help identify areas of exposure that may require looking for addons. Each of the above-mentioned toolsets have marketplaces for additional functionality, including third party vendors.

Of the three tool sets mentioned, we will more fully explore Google Security Command Center (SCC), because it has the easiest/simplest point of entry for a small to mid-sized company that may not have developed Access Management or Cybersecurity previously. Discussion of third party compatibility as a deciding factor will not be explored here.

Entra, AWS, and SCC tool sets have similar abilities and set up requirements at the small to intermediate business level. — Entra, AWS, and SCC tools sets have similar abilities and setup requirements.

Google Security Command Center (SCC)

Google Security Command Center is a cloud-based security platform that will monitor the attack surface area, and alert the operator to threats, weakness, incorrect configurations and more. It is set up with the ability to prioritize or “threat level identify” the threats. SCC allows the operator to select and view what the threat is, why it is a threat, and recommended mitigation and/or solutions.

Setup

The Google Security Command Center is the most approachable service of the three mentioned above, and has some of the best introductory materials to facilitate small to medium companies to be able to accomplish that initial lift required to gain that first step into Cybersecurity.

The initial setup of Google Security Command Center requires setting up the Google IAM, from within the Google Cloud Platform -> IAM page.

Setup even for the IAM requires 5 roles within the Google Cloud Platform -> IAM permissions page[i]. The operator setting up the SCC will need to setup and establish the organization, and select the services.

The “Standard” (free) level built in services include Security Health Analytics, which can identify misconfigured virtual machines, containers, networks, storage, and identity and access management policies. For the Standard tier, the level and depth of scanning is at “high level” misconfiguration, and can be increased in coverage with purchase of a higher-level service. For example, If the company requires API keys scanning or rotation or other configuration issues, they would be looking for moving up from the Standard to a Premium tier. Understanding and researching the difference in the different tiers would fall upon the team member(s) setting up the security. However, even starting at the free or “Standard” tier is better and more security than choosing not to do it all.

Initial work starts with the Identity Access Management (IAM), the operator setting up the SCC will have to communicate across multiple teams and stake holders; developing roles, permissions, and standards. This is not unique to the SCC; it would be required of every IAM tool or platform. There are times that cyber security and resiliency has dependencies, where one process cannot be implemented until another is accomplished[ii]. Understanding permissions, roles, groups, and access is a requirement that must be accomplished to achieve any level of cyber security coverage.

Secondary set up would be to define areas of interest. Correctly establishing the services, providers, data bases, and exposure points is necessary for the tool to be able to monitor and report on attack surface areas and traffic flow. Again, this is not a unique cost, but it does represent required resources and should be considered.

Once fully set up, the SCC has the ability to continuously monitor the attack surface area, provide reports, and suggests paths of control, response, and remediation if needed. The initial scan will likely take longer than usual (hours) but after that, Standard plan service runs a scan twice a day.

GC: SCC SWOT — Google Cloud, Security Command Center — SWOT

Some areas of opportunity may also be considered weakness – for example not having a report (weakness), but having third party integrations that build reports (Opportunity), what is the security of that third party and who is responsible (Threat). With that in mind, lets get a litter deeper.

One of the greatest assets to a system such as this, is that as part of a behemoth tech company, these systems of tools have access to some of the largest resources for monitoring, development of tools, remediation of their own defects and the discovery and management of the latest threats. This is an asset for the small to medium companies because there is no way that a single individual or single team, can keep up with the constantly changing threat landscape. Keeping that task on the tool-set, is a huge asset to a small company.

There are challenges, no product is perfect out of the box. Each of the listed tool sets can integrate with many third parties for more targeted coverage and reporting. Google Security Command Center has the Google Cloud Marketplace where there are thousands of compatible add-ons, services and tools. If the operator doesn’t find an exact match, they are likely to find something that comes close. Some of these integrations will take more work if they are native to a different platform, and it should be considered when deciding on a cloud protection system.

Of course there are differences between AWS, Entra, and Google options. A simple example is their firewalls; at the time writing this document, it appears that AWS offers AWS VPN (Site to site, and point to site) where Google offers Cloud VPN (Site to Site). Google’s cloud security model is not as mature as AWS, but AWS has been called overwhelmingly complex for small businesses or teams without extensive cloud experience. Google may not have the same level of threat detection as AWS, but it can be easier to launch, and is considered less complex.

Growth could require re-tooling (congratulations)

If a company grows from a mid-sized to large company, the scale of the team managing the SCC would have to expand. The ability to tailor the reports could become insufficient as reporting and compliance demands grow. Growth may force a revisiting of if the tools are sufficient, or if in house teams and developers using different tools is the path forward. The ability and flexibility for larger companies’ cybersecurity will be different between the three platforms listed here. At this point, I would suggest a celebratory dinner before visiting what tools they may want to research/acquire/manage.

[i] Getting Started with SCC Playlist

[ii] NIST Developing Cyber-Resilient Systems

[i] Getting Started with SCC Playlist

Other References & Related Articles

Free Cybersecurity Services and Tools – CISA

Free Non-CISA Cybersecurity Services – CISA

CISA’s Public Safety Communications and Cyber Resiliency Toolkit – CISA

Developing Cyber-Resilient Systems: A systems Security Engineering Approach – NIST December 2021

AWS vs Azure vs Google Cloud Security Comparison – BisBot Business Admin Tools – April 2024

Google Identity Services vs. Active Directory – Jumpcloud (addon service to GIS) – June 2023

Microsoft Entra ID –

Overview of Attack Surface Management – Microsoft Security Exposure Management – March 2024

What is Security Command Center – Google – March 2024

Google AIM –

GCP Security Command Center – Pros & Cons – JIT – Feb 2024

Google Cloud Security Command Center – Google

Getting Started with Security Command Center – Google – March 2023

Google Marketplace: Command Center Services – Google

Getting Started with Security Command Center Playlist – Google – youtube

AWS vs. Azure vs. Cloud: Security comparison – Sysdig- Feb 2023

NIST Developing Cyber-Resilient Systems – December 2021

June 9, 2024June 9, 2024

Blog Sample – Serverless

A sample of technical writing via Blog.

What is Serverless – in Laymen Terms

Serverless applications is an interesting name, that really has less to do with the application, and more to do with the technology hosting and storage of the application. Serverless applications do make use of servers, it’s just that they use them differently than in the past.

If you consider an application to be a product, activity, or service, you can in turn also think of the server as the house in which that product, activity, or service is homed. In traditional server systems, that house is static, probably like your house, or mine.

In the current “Serverless” system, you can have that same product, activity, and service, but the house can change as the needs grow or shrink- like adding a room when you need more space, or renting that room out when space is not being used.

Serverless technology has benefits for both the server hub, and the producer of the application. Applications using serverless architecture only pay for services when actively using those services- as in executing a process.

Let’s Take a More Technical Look

The most well-known and understood advantage and selling point of serverless computing is that it economizes the use of cloud resources. Serverless providers only charge for the time that code is executing, maximizing the function and profitability for both the provider and developer. Interestingly Serverless has also increased stability due to spinning services/instances as needed and having redundancy built into the system.

The numbers of applications and services that have moved to serverless is a testament to it’s economical use and function.

Additional interesting strengths are even greater costs reduction when multiple applications share common components, and in defining workflows.

Current thoughts on defining and describing serverless include calling it Event Driven, or Function and a Service (FaaS) protocol. Serverless architecture is best utilized to process events, or discrete chunks of data generated as a time series.

How it Works

Data arrives at the application, (via human or endpoint), and the architecture incorporates an API gateway that accepts the data and determines which serverless component receives the data.

Regardless of which host is being used for the applications serverless architecture, the runtime environment will pass the data is to the component, where it is processed, and returned to the gateway for further processing by other runtime functions, or returned to the user completed.

Application Development
- Developers write code, and deploy to the cloud provider.
Cloud Host
- Application Code is hosted by the cloud provider, and homed in a fleet of servers.
Application Use
- Requests are made to execute the Application code.
- The cloud provider creates a new container to run the code in.
- The container is deleted when the execution has been completed
  - Usually after a time period of inactivity

How Serverless Works — Simple flow diagram

Considerations

It’s important to keep in mind that serverless systems are not intended to become complete application. Successful use of serverless requires a separation of data input from computing actions. This separation will affect all stages of development and testing.

Timed out

One challenge is that Serverless isn’t as successful with longer computation times. For example, if processing takes to long, serverless can stop, and require a cold start- it simply may not work for that longer time period. There are some work arounds for this, but they can be problematic. One fix could be to make lots of little computations, that when broken apart, are fast enough to work well in a serverless environment; but the amount of coding time and rebuilding by developers can be prohibitive.

Serverless is Stateless (lack of persistence & it’s impact)

Another consideration is Serverless functions are stateless; individual functions accept input, they process that input, and they output a result. By design, there is no local or persistent storage.

The lack of persistence has impacts in both development and testing. For example, developers in data processing applications often want to be able to temporarily persist data that may be needed a few steps along, and testing can depend on maintaining a state from one step to the next in a workflow, results of previous operations can be understood as input to subsequent steps.

It becomes challenging to test more than one function at a time, and to replicate a serverless system for testing of a process that may use multiple functions is not always possible.

The most common approach is to break the development & tests into even smaller processes. It requires a heavy lift at the beginning, for a transformation of the workflow, as well as greater breakdown in understanding development and testing coverage into micro units, rather than full processes.

Some testing and developers have resorted to ad hoc methods of persisting data, such as creating and writing files to a cloud database. This can make an application more difficult to maintain, and could have security impacts depending on the platform/product/material being stored.

Major providers now have documentation and best practice methods and work arounds for providing persistence. AWS has introduced Step Functions, Microsoft Azure has Durable Functions and Logic Apps, and there are open source add on solutions as well.

Wrap Up

Serverless – or Function as a Service is one of the greatest transitions in recent computational history and demand. As the cost of moving data becomes more affordable, relative cost then increases on the storage or computation. Serverless architecture is a leap forward on this, moving our storage and computation from a static system to a kinetic system allowing for peaks and valleys to be represented and carried over in both costs and savings for providers and consumers. Finding a way to distribute the costs of both storage and functions based on use in a live and active manner is a huge leap forward, and we are still at the beginning stages of this.

What’s coming to Serverless? Things to keep an eye on include security, persistent storage, and data integrity. Global Serverless Computing Market is expected a compound annual growth rate of more than 22% in the period between 2024-2031.¹

Footnote

https://www.skyquestt.com/report/serverless-architecture-market ↩︎

June 6, 2024June 6, 2024

Interesting Cyber Threat Analysis Exercise

As you know, I’m enrolled for a MS in Cybersecurity Risk & Strategy; the people who teach, and the people who attend are all interesting, experts in their fields, and sources of knowledge to explore. It’s pretty amazing.

For homework in a Governance and Regulatory class, we had to read a bit, where lots of words were used to discuss how to discuss risk, and how to quantify it. Pages to tell you that first – you must name and define what you are looking for. Explaining and exploring how to quantify risk, and then create a methodology of ranking followed by exploring the issue over time. (Velocity Measurement, Distance Measurement, Persistence Measurement)

Super Simple Example:

Externally Accessible

Computers that have out of date patches.
- How out of date?
- What about the ones that fall out of date today, that were not on the last report? (If you pull this report monthly, you want to add an aging column)

Aging Machines Missing Patches
0-30 days	31-60 days	61-90 days	>90 days	Total
65	42	35	47	189
34%	22%	19%	25%	100%

Next Cool Thing

We had a lecture later – and covered how to prioritize.
But I have to get to my Cyber Crime class now – so we’ll explore the matrix map next time.