Guide
Security Operations Center (SOC) Guide
A security operations center (SOC) is the centralized function responsible for monitoring, detecting, and responding to cybersecurity threats across your environment. This guide covers what a SOC does, the operating models available to mid-market companies, the people and technology required, how to decide between building and buying, and the metrics that separate a functioning SOC from an expensive dashboard.
TL;DR: A security operations center is the detection and response engine of your cybersecurity program. It combines people, processes, and technology to monitor your environment around the clock, separate real threats from noise, and coordinate containment before an incident becomes a breach. Most mid-market companies cannot justify the $1.5M+ annual cost of building an in-house SOC, which is why outsourced models like SOC as a service and managed detection and response exist. Regardless of the model you choose, the SOC needs strategic oversight from a CISO or fractional CISO who defines what the SOC watches for, how it responds, and how its findings inform business decisions.
What a security operations center does
A security operations center is the centralized function responsible for continuous threat monitoring, detection, investigation, and response across an organization’s IT environment. It is not a room full of screens (though it can be). It is a combination of skilled analysts, detection technology, defined processes, and governance structures that work together to find and stop threats before they cause damage.
The SOC operates as the front line of your cybersecurity program. It ingests data from every corner of the environment, including endpoints, servers, network devices, cloud workloads, identity providers, email gateways, and SaaS applications. That data flows into a centralized platform where automated rules and human analysts work in tandem to identify suspicious activity, investigate it, and determine whether it represents a genuine threat.
When a real threat is confirmed, the SOC executes the response. Depending on the SOC’s authority level, this might mean isolating a compromised endpoint, disabling a hijacked user account, blocking a malicious IP at the firewall, or escalating to the incident management program for coordinated cross-functional response.
The core functions of a SOC include:
- Continuous monitoring across all telemetry sources, 24 hours a day, 365 days a year
- Alert triage to separate true positives from false positives and reduce noise reaching internal teams
- Threat detection using behavioral analytics, correlation rules, threat intelligence, and custom detection logic
- Investigation to determine the scope, severity, root cause, and business impact of confirmed threats
- Response coordination including containment actions, evidence preservation, and handoff to incident response teams
- Proactive threat hunting to find adversaries who have evaded automated detection
- Reporting and metrics that quantify detection performance and inform security investment decisions
What separates a functioning SOC from an expensive monitoring dashboard is the depth of investigation and the speed of response. A SOC that generates alerts but leaves investigation and response to already-stretched IT staff is not a SOC. It is a notification service.
Why the SOC matters for mid-market companies
Mid-market companies face the same threat actors that target enterprises, but with smaller security teams and tighter budgets. Ransomware operators, business email compromise actors, and state-affiliated groups do not filter targets by employee count. They filter by vulnerability.
The challenge for mid-market security leaders is that threats operate continuously. Attackers do not confine their operations to business hours. A phishing email that lands at 2 AM, a credential stuffing attack that runs over a holiday weekend, or a supply chain compromise that triggers at 3 AM on a Saturday will all go undetected without continuous monitoring.
Before SOC capabilities existed as outsourced services, mid-market companies had two options: accept the gap and hope for the best, or spend millions building an internal operation they could barely staff. Today, the market offers models that deliver genuine 24/7 detection and response at a fraction of the in-house cost. The question is no longer whether you can afford a SOC. The question is which SOC model fits your risk profile, your budget, and your existing security maturity.
SOC operating models
Choosing a SOC model is one of the most consequential decisions in a mid-market security program. Each model involves different tradeoffs around cost, control, talent requirements, and operational depth.
In-house SOC
An in-house SOC is staffed entirely by the organization’s own employees. The team operates the SIEM, writes detection rules, triages alerts, investigates incidents, and coordinates response. This model provides the greatest control and the deepest institutional knowledge of the environment.
The cost is substantial. A functional 24/7 in-house SOC requires eight to twelve analysts at minimum to cover three shifts, weekends, holidays, and inevitable turnover. Add a SOC manager, detection engineers, and a threat intelligence analyst, and the personnel cost alone reaches $1 million to $2 million annually. Layer on SIEM licensing, endpoint detection tools, SOAR platforms, threat intelligence feeds, and training, and the total runs $1.5 million to $3.5 million per year.
In-house SOCs make sense for large enterprises with unique threat profiles, regulatory constraints on outsourcing, or environments so complex that external providers cannot achieve adequate visibility. For most mid-market companies, the cost-to-coverage ratio makes this model impractical.
SOC as a service (SOCaaS)
SOC as a service delivers the full SOC function through a third-party provider. The provider operates the SIEM, staffs the analysts, writes and tunes detection rules, triages alerts, investigates threats, and escalates according to a predefined runbook. Your organization retains ownership of security strategy and escalation decisions, while the provider handles operational execution.
SOCaaS pricing typically ranges from $5,000 to $25,000 per month for mid-market companies, depending on log source count, data ingestion volume, and whether the service includes active response or stops at monitoring and escalation. This is a fraction of the in-house cost, with the tradeoff being less direct control over analyst priorities and detection engineering.
SOCaaS works well when you need broad operational coverage, including log management, compliance reporting, and full incident lifecycle coordination, and when you have a CISO or security leader who can manage the provider relationship and set strategic direction.
Managed detection and response (MDR)
MDR is a more focused model. Where SOCaaS covers the full breadth of SOC operations, MDR concentrates on threat detection and active response. MDR providers deploy endpoint, network, and cloud sensors, pair them with human analysts and threat hunters, and deliver confirmed findings with containment actions already taken.
MDR is deeper on detection and faster on response than broad SOCaaS, but narrower in scope. MDR typically does not include log management, SIEM operations, or compliance reporting. For organizations whose primary concern is “find threats and stop them,” MDR is often the right fit. For organizations that also need operational breadth, SOCaaS or a hybrid model is more appropriate.
Hybrid SOC
The hybrid model combines internal staff with an outsourced provider. A common pattern is to maintain a small internal security team (two to four people) who handle strategy, escalation, vulnerability management, and vendor oversight, while an outsourced SOCaaS or MDR provider handles 24/7 monitoring and first-line triage.
This model gives mid-market companies the benefits of continuous monitoring without the full headcount burden. The internal team provides institutional knowledge and strategic direction. The outsourced provider delivers operational scale and after-hours coverage. The hybrid model is often the best fit for companies with 500 to 5,000 employees that have some internal security capability but cannot staff a round-the-clock operation.
The three pillars: people, process, technology
Every SOC, regardless of model, depends on three pillars. When a SOC underperforms, the root cause almost always traces back to a weakness in one of them.
People
A SOC is only as effective as its analysts. Technology generates alerts. People investigate them, determine what they mean, and decide what to do. Without skilled analysts, a SOC produces noise rather than signal.
SOC team roles and analyst tiers:
Tier 1 analyst (monitoring and triage): The first line. T1 analysts monitor the SIEM dashboard, review incoming alerts, perform initial triage using established playbooks, and escalate alerts that require deeper investigation. They handle the highest volume of work. The risk at this tier is alert fatigue: when T1 analysts see thousands of alerts per shift, critical findings can be dismissed as false positives.
Tier 2 analyst (investigation): T2 analysts take escalated alerts from T1 and conduct deeper investigation. They correlate data across multiple sources, analyze endpoint and network forensics, determine the scope and severity of confirmed threats, and make containment recommendations. T2 analysts need stronger technical skills than T1, including experience with packet analysis, log correlation, and malware behavior analysis.
Tier 3 analyst (advanced threats and hunting): The most experienced analysts. T3 handles complex incidents, performs proactive threat hunting, develops custom detection rules, and conducts root-cause analysis after major incidents. In many SOCs, T3 analysts also serve as the escalation point for incidents that cross organizational boundaries or require executive notification.
SOC manager: Owns the operational performance of the SOC. Responsible for staffing, shift scheduling, analyst development, process improvement, metrics reporting, and serving as the liaison between the SOC and security leadership. The SOC manager translates strategic priorities from the CISO into operational priorities for the analyst team.
Detection engineer: Writes, tunes, and maintains the detection rules that drive automated alerting. Detection engineering is the discipline that determines whether the SIEM catches real threats or drowns in false positives. This role requires deep knowledge of adversary techniques (MITRE ATT&CK), the organization’s environment, and the SIEM platform’s query language.
Threat intelligence analyst: Gathers, evaluates, and operationalizes threat intelligence. This means monitoring intelligence feeds, identifying threats relevant to the organization’s industry and geography, and converting intelligence into detection rules and hunting hypotheses that the SOC can act on.
Staffing is the most persistent challenge in SOC operations. Cybersecurity unemployment is near zero, and SOC analyst burnout is high. Organizations that underpay, overwork, or fail to provide career development paths lose analysts to competitors. Outsourced SOC models exist in large part because most organizations cannot win the talent competition.
Process
Technology and people are necessary but insufficient without process. SOC processes define how work flows from alert to resolution.
Alert triage workflow: Every alert follows a structured path. The alert fires, a T1 analyst performs initial assessment using a triage playbook, and the alert is classified as a true positive, false positive, or requires escalation. True positives enter the investigation workflow. False positives are documented and fed back to the detection engineering team for rule tuning. This cycle of alert, triage, feedback, and tuning is what keeps the SOC’s signal-to-noise ratio manageable.
Incident escalation: When a confirmed threat exceeds the SOC’s authority or requires cross-functional coordination, the escalation process defines who gets notified, through what channel, within what timeframe, and with what information. Escalation procedures should be written, tested, and include after-hours contact paths. The severity classification system (P1 through P4) determines escalation speed and stakeholder involvement.
Threat hunting cadence: Proactive hunting should not be ad hoc. Define a hunting schedule, assign hunting hypotheses aligned with your threat landscape, and allocate dedicated analyst time. Hunting that only happens “when there’s time” never happens.
Reporting and review: SOC operations produce data that informs security investment decisions. Weekly operational summaries, monthly metrics reports, and quarterly trend analysis should flow from the SOC to security leadership. Post-incident reviews after every significant event complete the feedback loop.
Technology
The SOC technology stack is the detection and response infrastructure that analysts use to do their work.
SIEM (Security Information and Event Management): The central nervous system of the SOC. The SIEM ingests logs from across the environment, normalizes and correlates the data, applies detection rules, and generates alerts. Modern SIEMs also provide investigation workbenches for analysts and long-term data retention for forensic analysis. SIEM licensing is typically priced by data ingestion volume, which makes log source selection a cost-sensitive decision.
SOAR (Security Orchestration, Automation, and Response): SOAR platforms automate repetitive SOC workflows. When an alert fires, the SOAR can automatically enrich it with threat intelligence, query the asset database for context, check whether the affected user is a VIP, and present the T1 analyst with a pre-populated triage summary. For confirmed threats, SOAR can execute containment actions (isolate an endpoint, block an IP) in seconds rather than the minutes or hours manual processes require.
EDR (Endpoint Detection and Response): EDR agents run on endpoints (laptops, servers, workstations) and collect detailed telemetry about process execution, file changes, network connections, and registry modifications. EDR provides the visibility that SIEM alone cannot deliver. When the SOC needs to investigate what happened on a specific machine, EDR telemetry is the primary data source.
Threat intelligence platform (TIP): Aggregates intelligence from commercial feeds, open-source repositories, ISACs (Information Sharing and Analysis Centers), and government advisories. The TIP normalizes indicators of compromise (IoCs) and maps them to adversary campaigns. This intelligence feeds into SIEM detection rules and threat hunting hypotheses.
Network detection and response (NDR): Monitors network traffic for suspicious patterns including lateral movement, command-and-control communication, and data exfiltration. NDR fills visibility gaps where endpoint agents are not deployed, such as IoT devices, legacy systems, and network infrastructure.
A common mistake is over-investing in technology and under-investing in the people and processes needed to operate it. A $500K SIEM generating 10,000 alerts per day is worthless if there are only two analysts to investigate them and no triage process to prioritize which alerts matter.
SOC processes in detail
The SOC’s operational value comes from its processes. These are the workflows that turn raw telemetry into security outcomes.
Alert triage
Alert triage is the highest-volume activity in any SOC. The goal is to rapidly determine whether an alert represents a real threat, a false positive, or a benign anomaly that requires tuning. Effective triage follows a structured decision tree:
- Contextualize the alert. Who is the affected user? What asset is involved? What is the asset’s criticality? Is the user on a travel schedule, or is the login from an unexpected geography?
- Check for known patterns. Has this alert type been previously classified? Is there a documented false-positive pattern for this detection rule?
- Enrich with intelligence. Does the IP, domain, or file hash appear in threat intelligence feeds? Is it associated with a known campaign?
- Classify and route. True positive: escalate to investigation. False positive: document and submit a tuning request. Needs more analysis: escalate to T2.
The quality of triage depends on the quality of the playbooks analysts follow. Well-written playbooks reduce triage time, improve consistency, and give junior analysts a framework for making decisions under pressure.
Incident escalation
Not every confirmed threat requires the same response intensity. The escalation framework maps threat severity to response actions:
P1 (Critical): Active data exfiltration, ransomware deployment, compromised administrative credentials, confirmed APT activity. Response: immediate containment, executive notification within 30 minutes, all-hands incident response.
P2 (High): Confirmed malware on business-critical systems, successful phishing with credential compromise, unauthorized access to sensitive data. Response: containment within one hour, management notification within two hours, dedicated investigation team.
P3 (Medium): Suspicious but unconfirmed activity, policy violations, vulnerability exploitation attempts that were blocked. Response: investigation within four hours, documented findings, remediation recommendation.
P4 (Low): Informational alerts, minor policy violations, reconnaissance activity. Response: logged, reviewed during next business day, trend-tracked for pattern analysis.
Threat hunting
Automated detection catches known patterns. Threat hunting finds what detection missed. Hunting is the proactive, hypothesis-driven search for adversary activity within the environment.
A structured hunting program operates on a regular cadence. Each hunt starts with a hypothesis based on threat intelligence, industry trends, or gaps identified in detection coverage. The hunter queries historical telemetry, analyzes patterns, and either confirms a threat or refines the detection logic to catch similar activity automatically in the future.
Hunting requires T3-level expertise, access to rich historical data (90 to 180 days minimum), and dedicated time. If your analysts are buried in alert triage, they are not hunting. This is one reason many mid-market companies access hunting through MDR providers or managed cybersecurity services that include hunting as part of the engagement.
Reporting
SOC reporting translates operational data into decision-support intelligence for security leadership and the board. Effective SOC reports include:
- Volume and classification of alerts processed during the reporting period
- Confirmed incidents with severity classification, response timeline, and resolution status
- Mean time to detect and mean time to respond trended over time
- False positive rate by detection rule category, showing tuning progress
- Threat landscape summary highlighting new threats relevant to the organization
- Recommendations for security investment, process improvement, or policy changes
Reports that dump raw numbers without context are useless to executives. The SOC manager should interpret the data: what do these numbers mean for business risk, and what action should leadership take?
Building vs buying a SOC
This is the strategic decision that defines your SOC program. The right answer depends on your organization’s size, budget, threat profile, regulatory environment, and existing security maturity.
When building makes sense
Building an in-house SOC is justified when your organization has unique requirements that external providers cannot meet. This includes environments with classified data or national security implications, highly complex infrastructure where external analysts would lack the institutional knowledge to be effective, regulatory requirements that restrict outsourcing of security functions, or an existing security team that is large enough to staff a 24/7 operation with room for specialization.
If you have fewer than 3,000 employees and no regulatory prohibition on outsourcing, building an in-house SOC is rarely the right call from a cost-effectiveness standpoint.
When buying makes sense
Outsourcing SOC operations, whether through SOCaaS, MDR, or a hybrid model, is the better choice for most mid-market companies. The math is straightforward: an in-house 24/7 SOC costs $1.5M to $3.5M per year. An outsourced equivalent costs $60K to $300K per year. The outsourced model also eliminates the hiring and retention burden in a market where SOC analysts turn over at 30 percent annually.
Outsourcing does not mean abdicating responsibility. The outsourced SOC executes monitoring, detection, and response according to priorities and policies you define. Someone on your side still needs to own the relationship, set strategic direction, review performance, and integrate SOC findings into your broader security and risk management program. That person is your CISO.
The cost comparison for mid-market
| Cost category | In-house SOC (annual) | Outsourced SOC (annual) |
|---|---|---|
| Analyst salaries (8-12 FTEs) | $800K - $1.8M | Included |
| SOC manager | $150K - $200K | Included |
| SIEM licensing | $100K - $500K | Included |
| EDR/SOAR/TIP tooling | $100K - $300K | Varies (may be included) |
| Threat intelligence feeds | $30K - $100K | Included |
| Training and certifications | $25K - $50K | Provider’s responsibility |
| Facility and infrastructure | $50K - $150K | N/A |
| Total | $1.5M - $3.5M | $60K - $300K |
The outsourced column carries the caveat that you still need internal security leadership (CISO or fractional CISO) and likely one to three internal security staff for vulnerability management, vendor oversight, and escalation handling. Even with those additions, the total cost of the outsourced model is typically 20 to 40 percent of the in-house alternative.
SOC maturity model
SOC maturity is not binary. Organizations progress through stages, and understanding where you are helps prioritize investment.
Level 1: Reactive
The organization has basic security tools (antivirus, firewall) but no continuous monitoring. Security events are discovered through user complaints, IT support tickets, or external notification. There is no centralized log collection, no SIEM, and no dedicated security analyst. Response is ad hoc.
Level 2: Minimally operational
A SIEM or log aggregation tool is deployed. At least one person reviews alerts, but coverage is limited to business hours. Detection relies on vendor-default rules with minimal tuning. False positive rates are high. Response processes exist on paper but are rarely tested. This is where many mid-market companies sit before engaging an outsourced SOC provider.
Level 3: Functional
24/7 monitoring is in place, either through internal staff or an outsourced provider. Detection rules are tuned to the environment, and false positive rates are actively managed. Alert triage follows documented playbooks. Incident escalation paths are defined and tested. Regular reporting provides visibility into SOC performance. Most mid-market companies should target this level as their operational baseline.
Level 4: Advanced
The SOC conducts proactive threat hunting on a regular cadence. Detection engineering is a dedicated function that continuously develops new rules based on threat intelligence, hunt findings, and incident learnings. SOAR automation handles routine triage and enrichment. Metrics are tracked, trended, and used to drive continuous improvement. The SOC contributes to security strategy, not just operational execution.
Level 5: Optimized
The SOC operates as a strategic capability. Detection coverage is mapped to the MITRE ATT&CK framework with known gaps documented and prioritized. Threat intelligence is operationalized into detection and hunting workflows. Automation handles the majority of T1 triage. Analysts focus on investigation, hunting, and detection engineering. SOC metrics directly inform board-level risk reporting. Few organizations below Fortune 500 scale operate at this level.
Common SOC mistakes
Understanding where SOCs fail is as important as understanding how they should work.
Alert fatigue
Alert fatigue is the most common SOC failure mode. When analysts face thousands of alerts per shift and the majority are false positives, they begin to dismiss alerts without thorough investigation. Critical findings get buried in the noise. The root cause is usually a combination of poorly tuned detection rules, insufficient automation, and inadequate staffing.
The fix is disciplined detection engineering. Every false positive should trigger a tuning request. Detection rules that consistently produce low-value alerts should be refined or retired. SOAR automation should handle routine enrichment and classification, freeing analysts to focus on alerts that actually require human judgment.
Understaffing
Running a SOC with fewer analysts than the workload demands produces predictable results: long response times, incomplete investigations, accumulated alert backlog, and analyst burnout followed by turnover. If you cannot staff the SOC adequately, outsource the function rather than operating a SOC that looks functional on an org chart but is not in practice.
Tool sprawl
Adding a new security tool for every emerging threat category creates complexity without proportional improvement. Each tool generates its own alerts, requires its own expertise, and adds integration overhead. A SOC with 15 tools and two analysts is less effective than one with five well-integrated tools and a team that knows them deeply.
Ignoring process
Technology-first SOC builds skip the process work that makes the technology useful. Without documented triage playbooks, escalation procedures, detection engineering workflows, and post-incident review cadences, the SOC operates on tribal knowledge that leaves when analysts leave. Process is what makes a SOC repeatable and improvable.
No strategic oversight
A SOC that operates without direction from a CISO or equivalent security leader monitors everything but prioritizes nothing. Strategic oversight defines what the SOC should watch most closely, how it should respond to different threat scenarios, what risk tolerance the organization has, and how SOC findings should be communicated to the board. Without this direction, the SOC defaults to generic detection coverage that may not align with actual business risk.
SOC metrics and KPIs
Metrics transform the SOC from a cost center that is difficult to evaluate into a capability with measurable performance.
Mean Time to Detect (MTTD)
The elapsed time between a threat occurring and the SOC identifying it. MTTD measures the effectiveness of detection rules, telemetry coverage, and analyst monitoring. Lower is better. Industry benchmarks vary, but a mature SOC should target MTTD under four hours for endpoint threats and under one hour for identity-based attacks.
Mean Time to Respond (MTTR)
The elapsed time between detection and initial containment. MTTR measures the SOC’s ability to act on confirmed threats. For P1 incidents, target MTTR under 30 minutes. For P2 incidents, under two hours. SOAR automation can dramatically reduce MTTR by executing pre-authorized containment actions within seconds of threat confirmation.
False positive rate
The percentage of alerts classified as false positives after triage. A high false positive rate (above 80 percent) indicates poor detection tuning. Track this metric by detection rule category to identify which rules need attention. The goal is continuous improvement: each month, the false positive rate should trend downward as detection engineering refines the rule set.
Alert-to-incident ratio
The ratio of total alerts generated to confirmed incidents. This metric reveals the overall fidelity of the detection stack. A ratio of 500:1 (500 alerts for every confirmed incident) suggests significant tuning opportunities. A ratio of 50:1 is closer to a well-tuned environment.
Escalation accuracy
How often the SOC’s initial severity classification matches the actual severity determined after full investigation. Poor escalation accuracy means either P1 incidents are being classified as P3 (dangerous) or P4 noise is being escalated as P2 (wasteful). Both patterns indicate process or training gaps.
Analyst utilization
The percentage of analyst time spent on investigation and response versus administrative tasks, meetings, and idle time. SOCs that automate enrichment and triage through SOAR typically show higher effective utilization because analysts spend less time gathering context and more time analyzing threats.
When a fractional CISO helps
A SOC, regardless of model, needs someone to set its direction. The SOC executes. The CISO defines what the SOC should be executing against.
For mid-market companies that outsource SOC operations, the gap is strategic. The SOCaaS or MDR provider handles detection and response, but several critical functions sit outside the provider’s scope: defining risk priorities that drive detection focus, establishing response policies and escalation thresholds, evaluating SOC provider performance against business objectives, integrating SOC findings into board-level risk reporting, and aligning the SOC program with regulatory requirements and compliance frameworks.
A fractional CISO fills this gap. At a fraction of the cost of a full-time hire, a fractional CISO provides the strategic oversight that ensures your SOC investment produces business outcomes rather than just operational activity. The fractional model works particularly well for companies in the 200 to 2,000 employee range that have outsourced their SOC operations and need someone to own the security strategy that governs them.
The combination of an outsourced SOC for operational execution and a fractional CISO for strategic oversight is the most cost-effective security operations model available to mid-market companies today. It provides continuous detection and response capability, strategic direction, board-level reporting, and vendor accountability, all without the $2M+ annual investment of building everything internally.
Questions & answers
What is a security operations center (SOC)?
How much does it cost to build an in-house SOC?
What is the difference between a SOC and a NOC?
What are the main SOC models for mid-market companies?
What metrics should a SOC track?
Can a mid-market company operate a SOC without a full-time CISO?
Ready to turn this into a working plan?
Nick's team helps growth-stage companies, PE/VC sponsors, and cybersecurity product teams translate security questions into board-ready decisions. First call is strategy, not vendor pitch.