Overview

Dataset statistics

Number of variables8
Number of observations244
Missing cells6
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.1 KiB
Average record size in memory67.5 B

Variable types

DateTime1
Categorical3
Text1
Numeric3

Dataset

Description경기도 코로나19 신천지 방역현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=74LAO29DU55OP50KBAMK30087813&infSeq=1

Alerts

데이터기준일시 has constant value ""Constant
조치현황 has constant value ""Constant
소재지우편번호 is highly overall correlated with 위도 and 1 other fieldsHigh correlation
위도 is highly overall correlated with 소재지우편번호 and 1 other fieldsHigh correlation
경도 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
소재지주소 has unique valuesUnique

Reproduction

Analysis started2023-12-10 21:54:02.906735
Analysis finished2023-12-10 21:54:04.736978
Duration1.83 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

데이터기준일시
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2020-07-23 10:00:00
Maximum2020-07-23 10:00:00
2023-12-11T06:54:04.777425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:04.884699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

시군명
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
고양시
25 
안산시
22 
부천시
21 
수원시
17 
가평군
15 
Other values (19)
144 

Length

Max length4
Median length3
Mean length3.0819672
Min length3

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
고양시 25
 
10.2%
안산시 22
 
9.0%
부천시 21
 
8.6%
수원시 17
 
7.0%
가평군 15
 
6.1%
남양주시 14
 
5.7%
성남시 13
 
5.3%
용인시 13
 
5.3%
화성시 13
 
5.3%
김포시 13
 
5.3%
Other values (14) 78
32.0%

Length

2023-12-11T06:54:05.010337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고양시 25
 
10.2%
안산시 22
 
9.0%
부천시 21
 
8.6%
수원시 17
 
7.0%
가평군 15
 
6.1%
남양주시 14
 
5.7%
성남시 13
 
5.3%
용인시 13
 
5.3%
화성시 13
 
5.3%
김포시 13
 
5.3%
Other values (14) 78
32.0%

소재지주소
Text

UNIQUE 

Distinct244
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T06:54:05.315474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length47
Mean length35.77459
Min length19

Characters and Unicode

Total characters8729
Distinct characters281
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique244 ?
Unique (%)100.0%

Sample

1st row경기도 가평군 청평면 청평리 301-4 7층 (경춘로 871) 네오 오피스텔
2nd row경기도 가평군 청평면 골안길 7-28 청평경남아너스빌 105동 1102호 (청평리 306)
3rd row경기도 가평군 청평면 청평리 301-4 6층 602, 608호 (경춘로 871 )
4th row경기도 가평군 청평면 청평리 301-4 5층 508호 (경춘로 871 )
5th row경기도 가평군 청평면 청평리 301-4 3층 304호, 4층 407,408호 (경춘로 871)
ValueCountFrequency (%)
경기도 244
 
12.6%
2층 28
 
1.4%
3층 28
 
1.4%
9 27
 
1.4%
고양시 25
 
1.3%
안산시 22
 
1.1%
단원구 22
 
1.1%
덕양구 22
 
1.1%
4층 21
 
1.1%
부천시 21
 
1.1%
Other values (696) 1482
76.3%
2023-12-11T06:54:05.792879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1702
 
19.5%
1 389
 
4.5%
2 280
 
3.2%
276
 
3.2%
259
 
3.0%
0 257
 
2.9%
250
 
2.9%
242
 
2.8%
225
 
2.6%
4 221
 
2.5%
Other values (271) 4628
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4569
52.3%
Decimal Number 1989
22.8%
Space Separator 1702
 
19.5%
Dash Punctuation 147
 
1.7%
Open Punctuation 113
 
1.3%
Close Punctuation 113
 
1.3%
Other Punctuation 73
 
0.8%
Math Symbol 12
 
0.1%
Uppercase Letter 11
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
276
 
6.0%
259
 
5.7%
250
 
5.5%
242
 
5.3%
225
 
4.9%
159
 
3.5%
151
 
3.3%
123
 
2.7%
123
 
2.7%
109
 
2.4%
Other values (242) 2652
58.0%
Decimal Number
ValueCountFrequency (%)
1 389
19.6%
2 280
14.1%
0 257
12.9%
4 221
11.1%
3 208
10.5%
6 146
 
7.3%
5 146
 
7.3%
7 132
 
6.6%
8 109
 
5.5%
9 101
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
B 3
27.3%
Y 1
 
9.1%
J 1
 
9.1%
A 1
 
9.1%
C 1
 
9.1%
I 1
 
9.1%
W 1
 
9.1%
G 1
 
9.1%
P 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
, 69
94.5%
. 3
 
4.1%
& 1
 
1.4%
Math Symbol
ValueCountFrequency (%)
~ 10
83.3%
> 1
 
8.3%
< 1
 
8.3%
Space Separator
ValueCountFrequency (%)
1702
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 147
100.0%
Open Punctuation
ValueCountFrequency (%)
( 113
100.0%
Close Punctuation
ValueCountFrequency (%)
) 113
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4569
52.3%
Common 4149
47.5%
Latin 11
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
276
 
6.0%
259
 
5.7%
250
 
5.5%
242
 
5.3%
225
 
4.9%
159
 
3.5%
151
 
3.3%
123
 
2.7%
123
 
2.7%
109
 
2.4%
Other values (242) 2652
58.0%
Common
ValueCountFrequency (%)
1702
41.0%
1 389
 
9.4%
2 280
 
6.7%
0 257
 
6.2%
4 221
 
5.3%
3 208
 
5.0%
- 147
 
3.5%
6 146
 
3.5%
5 146
 
3.5%
7 132
 
3.2%
Other values (10) 521
 
12.6%
Latin
ValueCountFrequency (%)
B 3
27.3%
Y 1
 
9.1%
J 1
 
9.1%
A 1
 
9.1%
C 1
 
9.1%
I 1
 
9.1%
W 1
 
9.1%
G 1
 
9.1%
P 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4569
52.3%
ASCII 4160
47.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1702
40.9%
1 389
 
9.4%
2 280
 
6.7%
0 257
 
6.2%
4 221
 
5.3%
3 208
 
5.0%
- 147
 
3.5%
6 146
 
3.5%
5 146
 
3.5%
7 132
 
3.2%
Other values (19) 532
 
12.8%
Hangul
ValueCountFrequency (%)
276
 
6.0%
259
 
5.7%
250
 
5.5%
242
 
5.3%
225
 
4.9%
159
 
3.5%
151
 
3.3%
123
 
2.7%
123
 
2.7%
109
 
2.4%
Other values (242) 2652
58.0%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct120
Distinct (%)49.6%
Missing2
Missing (%)0.8%
Infinite0
Infinite (%)0.0%
Mean13834.285
Minimum10059
Maximum18606
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T06:54:05.941895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10059
5-th percentile10130.5
Q112108.25
median13362.5
Q315806
95-th percentile18294.8
Maximum18606
Range8547
Interquartile range (IQR)3697.75

Descriptive statistics

Standard deviation2512.9489
Coefficient of variation (CV)0.18164646
Kurtosis-1.0658418
Mean13834.285
Median Absolute Deviation (MAD)1998.5
Skewness0.24316308
Sum3347897
Variance6314912.2
MonotonicityNot monotonic
2023-12-11T06:54:06.067299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15361 20
 
8.2%
10497 15
 
6.1%
12451 12
 
4.9%
14548 12
 
4.9%
16205 9
 
3.7%
16977 8
 
3.3%
18401 8
 
3.3%
11161 6
 
2.5%
10109 6
 
2.5%
12561 6
 
2.5%
Other values (110) 140
57.4%
ValueCountFrequency (%)
10059 1
 
0.4%
10098 3
1.2%
10104 1
 
0.4%
10108 1
 
0.4%
10109 6
2.5%
10117 1
 
0.4%
10387 1
 
0.4%
10414 2
 
0.8%
10461 1
 
0.4%
10473 1
 
0.4%
ValueCountFrequency (%)
18606 1
 
0.4%
18591 2
 
0.8%
18453 1
 
0.4%
18401 8
3.3%
18303 1
 
0.4%
18139 1
 
0.4%
18137 1
 
0.4%
17909 2
 
0.8%
17907 1
 
0.4%
17906 3
 
1.2%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct148
Distinct (%)61.2%
Missing2
Missing (%)0.8%
Infinite0
Infinite (%)0.0%
Mean37.466593
Minimum36.989745
Maximum37.860335
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T06:54:06.195052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.989745
5-th percentile37.150883
Q137.309924
median37.48788
Q337.635809
95-th percentile37.746768
Maximum37.860335
Range0.87058997
Interquartile range (IQR)0.32588475

Descriptive statistics

Standard deviation0.20039003
Coefficient of variation (CV)0.005348499
Kurtosis-0.56179397
Mean37.466593
Median Absolute Deviation (MAD)0.1701864
Skewness-0.14171767
Sum9066.9154
Variance0.040156166
MonotonicityNot monotonic
2023-12-11T06:54:06.317841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.3173704199 20
 
8.2%
37.6358091769 15
 
6.1%
37.5020205668 9
 
3.7%
37.3099244291 9
 
3.7%
37.741461021 8
 
3.3%
37.2713127744 8
 
3.3%
37.2109324755 6
 
2.5%
37.4894240096 5
 
2.0%
37.6166497612 5
 
2.0%
37.2989359747 4
 
1.6%
Other values (138) 153
62.7%
ValueCountFrequency (%)
36.9897450905 1
0.4%
36.9903126471 2
0.8%
36.9915887354 1
0.4%
36.9921498129 2
0.8%
36.9950225937 1
0.4%
36.9991270836 1
0.4%
37.116327629 1
0.4%
37.1322535082 2
0.8%
37.1465910554 1
0.4%
37.1479827575 1
0.4%
ValueCountFrequency (%)
37.860335062 1
0.4%
37.8573956743 1
0.4%
37.8511821313 2
0.8%
37.8508774589 1
0.4%
37.8507579687 2
0.8%
37.8492356539 1
0.4%
37.8329123909 1
0.4%
37.7591653054 1
0.4%
37.7590960784 1
0.4%
37.7519869505 1
0.4%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct148
Distinct (%)61.2%
Missing2
Missing (%)0.8%
Infinite0
Infinite (%)0.0%
Mean127.04953
Minimum126.62352
Maximum127.63821
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T06:54:06.434802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.62352
5-th percentile126.72459
Q1126.83488
median127.03231
Q3127.15536
95-th percentile127.50154
Maximum127.63821
Range1.0146906
Interquartile range (IQR)0.32048773

Descriptive statistics

Standard deviation0.2451303
Coefficient of variation (CV)0.0019294075
Kurtosis-0.39619682
Mean127.04953
Median Absolute Deviation (MAD)0.1899988
Skewness0.64343154
Sum30745.985
Variance0.060088866
MonotonicityNot monotonic
2023-12-11T06:54:06.561404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.8423088804 20
 
8.2%
126.8325305558 15
 
6.1%
126.771080153 9
 
3.7%
126.9944024119 9
 
3.7%
127.4228531567 8
 
3.3%
127.1265471151 8
 
3.3%
127.038340772 6
 
2.5%
127.5015384481 5
 
2.0%
126.7157782734 5
 
2.0%
127.6295661667 4
 
1.6%
Other values (138) 153
62.7%
ValueCountFrequency (%)
126.6235211378 1
 
0.4%
126.6990247253 1
 
0.4%
126.7098349282 2
 
0.8%
126.7109119001 1
 
0.4%
126.7157782734 5
2.0%
126.7161646313 1
 
0.4%
126.7176293139 1
 
0.4%
126.7231910662 1
 
0.4%
126.7510816918 1
 
0.4%
126.7597745001 1
 
0.4%
ValueCountFrequency (%)
127.6382117572 1
 
0.4%
127.6377251315 1
 
0.4%
127.6363716292 1
 
0.4%
127.6353351273 1
 
0.4%
127.6295661667 4
1.6%
127.5065909497 1
 
0.4%
127.5015384481 5
2.0%
127.5008823622 1
 
0.4%
127.4965496673 1
 
0.4%
127.4905763707 1
 
0.4%

시설구분명
Categorical

Distinct18
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
부속기관
140 
숙소
41 
교회
23 
<NA>
 
10
문화센터
 
6
Other values (13)
24 

Length

Max length5
Median length4
Mean length3.3852459
Min length2

Unique

Unique6 ?
Unique (%)2.5%

Sample

1st row교회
2nd row숙소
3rd row부속기관
4th row부속기관
5th row부속기관

Common Values

ValueCountFrequency (%)
부속기관 140
57.4%
숙소 41
 
16.8%
교회 23
 
9.4%
<NA> 10
 
4.1%
문화센터 6
 
2.5%
미확인 5
 
2.0%
확인중 3
 
1.2%
선교교회 2
 
0.8%
사택 2
 
0.8%
창고 2
 
0.8%
Other values (8) 10
 
4.1%

Length

2023-12-11T06:54:06.680729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부속기관 140
57.4%
숙소 41
 
16.8%
교회 23
 
9.4%
na 10
 
4.1%
문화센터 6
 
2.5%
미확인 5
 
2.0%
확인중 3
 
1.2%
위장교회 2
 
0.8%
복음방 2
 
0.8%
창고 2
 
0.8%
Other values (8) 10
 
4.1%

조치현황
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
방역처리완료
244 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row방역처리완료
2nd row방역처리완료
3rd row방역처리완료
4th row방역처리완료
5th row방역처리완료

Common Values

ValueCountFrequency (%)
방역처리완료 244
100.0%

Length

2023-12-11T06:54:06.791189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:54:06.869122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
방역처리완료 244
100.0%

Interactions

2023-12-11T06:54:03.843211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:03.275248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:03.560214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:03.932236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:03.355178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:03.646310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:04.036402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:03.461336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:03.741131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:54:06.920180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명소재지우편번호위도경도시설구분명
시군명1.0000.9950.9840.9790.635
소재지우편번호0.9951.0000.9530.9390.507
위도0.9840.9531.0000.8790.578
경도0.9790.9390.8791.0000.432
시설구분명0.6350.5070.5780.4321.000
2023-12-11T06:54:07.006046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명시설구분명
시군명1.0000.233
시설구분명0.2331.000
2023-12-11T06:54:07.078023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호위도경도시군명시설구분명
소재지우편번호1.000-0.8830.1040.9460.221
위도-0.8831.000-0.0440.8800.263
경도0.104-0.0441.0000.8540.180
시군명0.9460.8800.8541.0000.233
시설구분명0.2210.2630.1800.2331.000

Missing values

2023-12-11T06:54:04.418203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:54:04.570632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:54:04.681923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

데이터기준일시시군명소재지주소소재지우편번호위도경도시설구분명조치현황
02020-07-23 10가평군경기도 가평군 청평면 청평리 301-4 7층 (경춘로 871) 네오 오피스텔1245137.741461127.422853교회방역처리완료
12020-07-23 10가평군경기도 가평군 청평면 골안길 7-28 청평경남아너스빌 105동 1102호 (청평리 306)1245137.741893127.422058숙소방역처리완료
22020-07-23 10가평군경기도 가평군 청평면 청평리 301-4 6층 602, 608호 (경춘로 871 )1245137.741461127.422853부속기관방역처리완료
32020-07-23 10가평군경기도 가평군 청평면 청평리 301-4 5층 508호 (경춘로 871 )1245137.741461127.422853부속기관방역처리완료
42020-07-23 10가평군경기도 가평군 청평면 청평리 301-4 3층 304호, 4층 407,408호 (경춘로 871)1245137.741461127.422853부속기관방역처리완료
52020-07-23 10가평군경기도 가평군 청평면 청평리 301-4 2층 203호 (경춘로 871)1245137.741461127.422853부속기관방역처리완료
62020-07-23 10가평군경기도 가평군 청평면 청평리 301-4 2층 201호,202호 (경춘로 871)1245137.741461127.422853부속기관방역처리완료
72020-07-23 10가평군경기도 가평군 청평면 청평리 301-4 1층 104호 (경춘로 871 )1245137.741461127.422853부속기관방역처리완료
82020-07-23 10가평군경기도 가평군 청평면 잠곡로 69-16 (청평리 355-1)1245237.739057127.427112부속기관방역처리완료
92020-07-23 10가평군경기도 가평군 청평면 골안길 7-28 청평경남아너스빌 105동 1203호 (청평리 306)1245137.741893127.422058숙소방역처리완료
데이터기준일시시군명소재지주소소재지우편번호위도경도시설구분명조치현황
2342020-07-23 10화성시경기도 화성시 병점로 37-6 메트로프라자 1층 108호1840137.210932127.038341부속기관방역처리완료
2352020-07-23 10화성시경기도 화성시 병점로 37-6 메트로프라자 1층 107호1840137.210932127.038341부속기관방역처리완료
2362020-07-23 10화성시경기도 화성시 병점로 23-1 국민연립 B(나)동 101호 (진안동 524-9)1840137.209918127.037164숙소방역처리완료
2372020-07-23 10화성시경기도 화성시 경기대로 1038 YJ금성빌딩 A동 1층 103호1840137.20845127.035076부속기관방역처리완료
2382020-07-23 10화성시경기도 화성시 향남읍 향남로 418-10 3031859137.132254126.922284미확인방역처리완료
2392020-07-23 10화성시경기도 화성시 향남읍 상신하길로 328번길 26, 키움프라자 401호1860637.116328126.913524부속기관방역처리완료
2402020-07-23 10화성시경기도 화성시 병점로 37-6 메트로프라자 501호, 502호1840137.210932127.038341부속기관방역처리완료
2412020-07-23 10화성시경기도 화성시 병점로 37-6 메트로프라자 1층 111호1840137.210932127.038341부속기관방역처리완료
2422020-07-23 10화성시경기도 화성시 동탄중심상가2길 5 리더스프라자 5층 501, 502호1845337.205987127.072709부속기관방역처리완료
2432020-07-23 10화성시경기도 화성시 병점로 37-6 메트로프라자 101, 102호1840137.210932127.038341부속기관방역처리완료