sfgov/covid19-testing-by-geography-and-date-qhc5-mubk

  • covid
  • covid-19
  • covid19 testing
  • geography
  • lab testing
  • + 3

Covid-19 Testing by Geography and Date

Covid-19 Testing by Geography and Date

<i><b>Note: As of April 16, 2021, this dataset will update daily with a five-day data lag.</i></b>

<strong>A. SUMMARY</strong>

This dataset includes COVID-19 tests by resident neighborhood and specimen collection date (the day the test was collected). Specifically, this dataset includes tests of San Francisco residents who listed a San Francisco home address at the time of testing. These resident addresses were then geo-located and mapped to neighborhoods. The resident address associated with each test is hand-entered and susceptible to errors, therefore neighborhood data should be interpreted as an approximation, not a precise nor comprehensive total.

In recent months, about 5% of tests are missing addresses and therefore cannot be included in any neighborhood totals. In earlier months, more tests were missing address data. Because of this high percentage of tests missing resident address data, this neighborhood testing data for March, April, and May should be interpreted with caution (see below)

Percentage of tests missing address information, by month in 2020

Mar - 33.6%

Apr - 25.9%

May - 11.1%

Jun - 7.2%

Jul - 5.8%

Aug - 5.4%

Sep - 5.1%

Oct (Oct 1-12) - 5.1%

To protect the privacy of residents, the City does not disclose the number of tests in neighborhoods with resident populations of fewer than 1,000 people. These neighborhoods are omitted from the data (they include Golden Gate Park, John McLaren Park, and Lands End).

Tests for residents that listed a Skilled Nursing Facility as their home address are not included in this neighborhood-level testing data. Skilled Nursing Facilities have required and repeated testing of residents, which would change neighborhood trends and not reflect the broader neighborhood's testing data.

This data was de-duplicated by individual and date, so if a person gets tested multiple times on different dates, all tests will be included in this dataset (on the day each test was collected).

<strong>The total number of positive test results is not equal to the total number of COVID-19 cases in San Francisco. </strong>During this investigation, some test results are found to be for persons living outside of San Francisco and some people in San Francisco may be tested multiple times (which is common). To see the number of new confirmed cases by neighborhood, reference this map: https://data.sfgov.org/stories/s/Map-of-Cumulative-Cases/adm5-wq8i#new-cases-map

<strong>B. HOW THE DATASET IS CREATED</strong>

COVID-19 laboratory test data is based on electronic laboratory test reports. Deduplication, quality assurance measures and other data verification processes maximize accuracy of laboratory test information. All testing data is then geo-coded by resident address. Then data is aggregated by <a href="https://data.sfgov.org/Geographic-Locations-and-Boundaries/Analysis-Neighborhoods/p5b7-5n3h

">analysis neighborhood</a> and specimen collection date.

Data are prepared by close of business Monday through Saturday for public display.

<strong>C. UPDATE PROCESS</strong>

Updates automatically at 05:00 Pacific Time each day. Redundant runs are scheduled at 07:00 and 09:00 in case of pipeline failure.

<strong>D. HOW TO USE THIS DATASET</strong>

Due to the high degree of variation in the time needed to complete tests by different labs there is a delay in this reporting. On March 24 the Health Officer ordered all labs in the City to report complete COVID-19 testing information to the local and state health departments.

In order to track trends over time, a data user can analyze this data by "specimen_collection_date".

Calculating Percent Positivity: The positivity rate is the percentage of tests that return a positive result for COVID-19 (positive tests divided by the sum of positive and negative tests). Indeterminate results, which could not conclusively determine whether COVID-19 virus was present, are not included in the calculation of percent positive. Percent positivity indicates how widesprea

Columns

NameSocrata field nameColumn name in sgr mountData typeDescription
data_loaded_atdata_loaded_atdata_loaded_atCalendar date
Cumulative Indeterminate Testscumulative_indeterminate_testscumulative_indeterminate_testsNumberCumulative indeterminate tests collected as of the specified date for residents living in the area
Cumulative Negative Testscumulative_negative_testscumulative_negative_testsNumberCumulative negative tests collected as of the specified date for residents living in the area
New Negative Testsnew_negative_testsnew_negative_testsNumberNegative tests collected on the specified date for residents living in the area
Cumulative Positive Testscumulative_positive_testscumulative_positive_testsNumberCumulative positive tests collected as of the specified date for residents living in the area
idididTextThe identifier for the area type
area_typearea_typearea_typeTextType of geographic area
Specimen Collection Datespecimen_collection_datespecimen_collection_dateCalendar dateDate tests were collected
Cumulative Testscumulative_testscumulative_testsNumberCumulative tests collected as of the specified date for residents living in the area
acs_populationacs_populationacs_populationNumberThe population from the latest 5-year estimates from the American Community Survey (2015-2019)
New Testsnew_testsnew_testsNumberTotal tests collected on the specified date for residents living in the area
Cumulative Testing Ratecumulative_testing_ratecumulative_testing_rateNumberThe cumulate testing in the area, calculated as (cumulative tests /acs_population) * 10000 which is a rate per 10,000 residents
New Positive Testsnew_positive_testsnew_positive_testsNumberPositive tests collected on the specified date for residents living in the area
last_updated_atlast_updated_atlast_updated_atCalendar date
New Indeterminate Testsnew_indeterminate_testsnew_indeterminate_testsNumberIndeterminate tests collected on the specified date for residents living in the area
multipolygonmultipolygonmultipolygonMultiPolygon

Upstream Metadata