Overview

Dataset statistics

Number of variables7
Number of observations23
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory64.7 B

Variable types

Numeric3
Categorical2
Text2

Dataset

Description화학물질 유·누출시 주민이 긴급하게 대피하여 인체 노출 등의 위험으로부터 보호할수 있는 대피장소로 시설명, 주소, 대피장소 등의 항목을 제공합니다
URLhttps://www.data.go.kr/data/15102164/fileData.do

Alerts

연번 is highly overall correlated with 시설구분High correlation
수용면적(제곱미터) is highly overall correlated with 수용인원(명)High correlation
수용인원(명) is highly overall correlated with 수용면적(제곱미터)High correlation
시설구분 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
대피장소 is highly overall correlated with 시설구분High correlation
연번 has unique valuesUnique
시설명 has unique valuesUnique
주소 has unique valuesUnique
수용면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:18:25.697458
Analysis finished2023-12-12 16:18:27.103221
Duration1.41 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12
Minimum1
Maximum23
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-13T01:18:27.176316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.1
Q16.5
median12
Q317.5
95-th percentile21.9
Maximum23
Range22
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.78233
Coefficient of variation (CV)0.56519417
Kurtosis-1.2
Mean12
Median Absolute Deviation (MAD)6
Skewness0
Sum276
Variance46
MonotonicityStrictly increasing
2023-12-13T01:18:27.296109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
1 1
 
4.3%
2 1
 
4.3%
23 1
 
4.3%
22 1
 
4.3%
21 1
 
4.3%
20 1
 
4.3%
19 1
 
4.3%
18 1
 
4.3%
17 1
 
4.3%
16 1
 
4.3%
Other values (13) 13
56.5%
ValueCountFrequency (%)
1 1
4.3%
2 1
4.3%
3 1
4.3%
4 1
4.3%
5 1
4.3%
6 1
4.3%
7 1
4.3%
8 1
4.3%
9 1
4.3%
10 1
4.3%
ValueCountFrequency (%)
23 1
4.3%
22 1
4.3%
21 1
4.3%
20 1
4.3%
19 1
4.3%
18 1
4.3%
17 1
4.3%
16 1
4.3%
15 1
4.3%
14 1
4.3%

시설구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size316.0 B
교육기관
17 
공공시설

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육기관
2nd row교육기관
3rd row교육기관
4th row교육기관
5th row교육기관

Common Values

ValueCountFrequency (%)
교육기관 17
73.9%
공공시설 6
 
26.1%

Length

2023-12-13T01:18:27.406338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:18:27.479317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육기관 17
73.9%
공공시설 6
 
26.1%

시설명
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-13T01:18:27.640954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length6.826087
Min length5

Characters and Unicode

Total characters157
Distinct characters57
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row군산소룡초등학교
2nd row월명중학교
3rd row전북외국어고등학교
4th row해성초등학교
5th row문창초등학교
ValueCountFrequency (%)
군산소룡초등학교 1
 
4.3%
군산신풍초등학교 1
 
4.3%
군산배드민턴장 1
 
4.3%
군산설림도서관 1
 
4.3%
군산청소년수련관 1
 
4.3%
군산장애인체육관 1
 
4.3%
군산월명체육관 1
 
4.3%
군산대야초등학교 1
 
4.3%
군산동초등학교 1
 
4.3%
군산초등학교 1
 
4.3%
Other values (13) 13
56.5%
2023-12-13T01:18:27.981488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
10.8%
17
 
10.8%
15
 
9.6%
15
 
9.6%
14
 
8.9%
13
 
8.3%
5
 
3.2%
2
 
1.3%
2
 
1.3%
2
 
1.3%
Other values (47) 55
35.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 157
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
10.8%
17
 
10.8%
15
 
9.6%
15
 
9.6%
14
 
8.9%
13
 
8.3%
5
 
3.2%
2
 
1.3%
2
 
1.3%
2
 
1.3%
Other values (47) 55
35.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 157
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
10.8%
17
 
10.8%
15
 
9.6%
15
 
9.6%
14
 
8.9%
13
 
8.3%
5
 
3.2%
2
 
1.3%
2
 
1.3%
2
 
1.3%
Other values (47) 55
35.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 157
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
 
10.8%
17
 
10.8%
15
 
9.6%
15
 
9.6%
14
 
8.9%
13
 
8.3%
5
 
3.2%
2
 
1.3%
2
 
1.3%
2
 
1.3%
Other values (47) 55
35.0%

주소
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-13T01:18:28.539874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length20
Mean length16.826087
Min length15

Characters and Unicode

Total characters387
Distinct characters68
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row전라북도 군산시 설림길 55
2nd row전라북도 군산시 설림5길 90
3rd row전라북도 군산시 해망로 525
4th row전라북도 군산시 옥성남길 21
5th row전라북도 군산시 공항로 394
ValueCountFrequency (%)
전라북도 23
24.0%
군산시 23
24.0%
설림길 2
 
2.1%
75 2
 
2.1%
번영로 2
 
2.1%
29 2
 
2.1%
대야관통로 1
 
1.0%
신지길 1
 
1.0%
26 1
 
1.0%
자곡로 1
 
1.0%
Other values (38) 38
39.6%
2023-12-13T01:18:28.870249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
73
18.9%
25
 
6.5%
24
 
6.2%
24
 
6.2%
23
 
5.9%
23
 
5.9%
23
 
5.9%
23
 
5.9%
5 13
 
3.4%
12
 
3.1%
Other values (58) 124
32.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 253
65.4%
Space Separator 73
 
18.9%
Decimal Number 60
 
15.5%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
9.9%
24
 
9.5%
24
 
9.5%
23
 
9.1%
23
 
9.1%
23
 
9.1%
23
 
9.1%
12
 
4.7%
10
 
4.0%
4
 
1.6%
Other values (46) 62
24.5%
Decimal Number
ValueCountFrequency (%)
5 13
21.7%
2 11
18.3%
1 8
13.3%
3 6
10.0%
9 6
10.0%
8 5
 
8.3%
7 4
 
6.7%
4 3
 
5.0%
6 3
 
5.0%
0 1
 
1.7%
Space Separator
ValueCountFrequency (%)
73
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 253
65.4%
Common 134
34.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
9.9%
24
 
9.5%
24
 
9.5%
23
 
9.1%
23
 
9.1%
23
 
9.1%
23
 
9.1%
12
 
4.7%
10
 
4.0%
4
 
1.6%
Other values (46) 62
24.5%
Common
ValueCountFrequency (%)
73
54.5%
5 13
 
9.7%
2 11
 
8.2%
1 8
 
6.0%
3 6
 
4.5%
9 6
 
4.5%
8 5
 
3.7%
7 4
 
3.0%
4 3
 
2.2%
6 3
 
2.2%
Other values (2) 2
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 253
65.4%
ASCII 134
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
73
54.5%
5 13
 
9.7%
2 11
 
8.2%
1 8
 
6.0%
3 6
 
4.5%
9 6
 
4.5%
8 5
 
3.7%
7 4
 
3.0%
4 3
 
2.2%
6 3
 
2.2%
Other values (2) 2
 
1.5%
Hangul
ValueCountFrequency (%)
25
9.9%
24
 
9.5%
24
 
9.5%
23
 
9.1%
23
 
9.1%
23
 
9.1%
23
 
9.1%
12
 
4.7%
10
 
4.0%
4
 
1.6%
Other values (46) 62
24.5%

대피장소
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)26.1%
Missing0
Missing (%)0.0%
Memory size316.0 B
1층 강당
2층 강당
1층 체육관
3층강당
2층 체육관

Length

Max length6
Median length5
Mean length5.2608696
Min length4

Unique

Unique3 ?
Unique (%)13.0%

Sample

1st row3층강당
2nd row2층 강당
3rd row2층 체육관
4th row1층 강당
5th row1층 강당

Common Values

ValueCountFrequency (%)
1층 강당 8
34.8%
2층 강당 7
30.4%
1층 체육관 5
21.7%
3층강당 1
 
4.3%
2층 체육관 1
 
4.3%
1층 도서관 1
 
4.3%

Length

2023-12-13T01:18:29.015532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:18:29.136564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강당 15
33.3%
1층 14
31.1%
2층 8
17.8%
체육관 6
 
13.3%
3층강당 1
 
2.2%
도서관 1
 
2.2%

수용면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1219.913
Minimum72
Maximum10821
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-13T01:18:29.257783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum72
5-th percentile147.4
Q1238.5
median728
Q3941
95-th percentile3992.7
Maximum10821
Range10749
Interquartile range (IQR)702.5

Descriptive statistics

Standard deviation2269.3271
Coefficient of variation (CV)1.8602368
Kurtosis15.888732
Mean1219.913
Median Absolute Deviation (MAD)475
Skewness3.8399894
Sum28058
Variance5149845.4
MonotonicityNot monotonic
2023-12-13T01:18:29.395114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
728 1
 
4.3%
1071 1
 
4.3%
72 1
 
4.3%
898 1
 
4.3%
143 1
 
4.3%
842 1
 
4.3%
823 1
 
4.3%
10821 1
 
4.3%
234 1
 
4.3%
253 1
 
4.3%
Other values (13) 13
56.5%
ValueCountFrequency (%)
72 1
4.3%
143 1
4.3%
187 1
4.3%
204 1
4.3%
234 1
4.3%
238 1
4.3%
239 1
4.3%
240 1
4.3%
253 1
4.3%
339 1
4.3%
ValueCountFrequency (%)
10821 1
4.3%
4209 1
4.3%
2046 1
4.3%
1188 1
4.3%
1071 1
4.3%
984 1
4.3%
898 1
4.3%
842 1
4.3%
831 1
4.3%
823 1
4.3%

수용인원(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct21
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1467.3913
Minimum80
Maximum13000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-13T01:18:29.505322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum80
5-th percentile175
Q1280
median880
Q31135
95-th percentile4838
Maximum13000
Range12920
Interquartile range (IQR)855

Descriptive statistics

Standard deviation2729.7073
Coefficient of variation (CV)1.860245
Kurtosis15.789081
Mean1467.3913
Median Absolute Deviation (MAD)580
Skewness3.8274262
Sum33750
Variance7451302
MonotonicityNot monotonic
2023-12-13T01:18:29.623939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
280 3
 
13.0%
1290 1
 
4.3%
80 1
 
4.3%
1080 1
 
4.3%
170 1
 
4.3%
1020 1
 
4.3%
990 1
 
4.3%
13000 1
 
4.3%
300 1
 
4.3%
220 1
 
4.3%
Other values (11) 11
47.8%
ValueCountFrequency (%)
80 1
 
4.3%
170 1
 
4.3%
220 1
 
4.3%
240 1
 
4.3%
280 3
13.0%
290 1
 
4.3%
300 1
 
4.3%
410 1
 
4.3%
830 1
 
4.3%
880 1
 
4.3%
ValueCountFrequency (%)
13000 1
4.3%
5100 1
4.3%
2480 1
4.3%
1400 1
4.3%
1290 1
4.3%
1190 1
4.3%
1080 1
4.3%
1020 1
4.3%
1000 1
4.3%
990 1
4.3%

Interactions

2023-12-13T01:18:26.568662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:18:26.014439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:18:26.293661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:18:26.651728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:18:26.113527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:18:26.392355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:18:26.745221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:18:26.203990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:18:26.474029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:18:29.714529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설구분시설명주소대피장소수용면적(제곱미터)수용인원(명)
연번1.0000.9881.0001.0000.5110.4910.491
시설구분0.9881.0001.0001.0000.9000.3170.317
시설명1.0001.0001.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.0001.0001.000
대피장소0.5110.9001.0001.0001.0000.4330.433
수용면적(제곱미터)0.4910.3171.0001.0000.4331.0001.000
수용인원(명)0.4910.3171.0001.0000.4331.0001.000
2023-12-13T01:18:29.832019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대피장소시설구분
대피장소1.0000.643
시설구분0.6431.000
2023-12-13T01:18:29.971513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번수용면적(제곱미터)수용인원(명)시설구분대피장소
연번1.000-0.350-0.3470.7080.036
수용면적(제곱미터)-0.3501.0000.9990.1860.257
수용인원(명)-0.3470.9991.0000.1860.257
시설구분0.7080.1860.1861.0000.643
대피장소0.0360.2570.2570.6431.000

Missing values

2023-12-13T01:18:26.876941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:18:27.041807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설구분시설명주소대피장소수용면적(제곱미터)수용인원(명)
01교육기관군산소룡초등학교전라북도 군산시 설림길 553층강당728880
12교육기관월명중학교전라북도 군산시 설림5길 902층 강당10711290
23교육기관전북외국어고등학교전라북도 군산시 해망로 5252층 체육관20462480
34교육기관해성초등학교전라북도 군산시 옥성남길 211층 강당240290
45교육기관문창초등학교전라북도 군산시 공항로 3941층 강당8311000
56교육기관미성초등학교전라북도 군산시 칠성2길 272층 강당687830
67교육기관옥봉초등학교전라북도 군산시 옥봉초교길 292층 강당781940
78교육기관군산대학교전라북도 군산시 대학로 5581층 체육관42095100
89교육기관군산중학교전라북도 군산시 군중길 181층 강당11881400
910교육기관경포초등학교전라북도 군산시 백릉로 751층 강당9841190
연번시설구분시설명주소대피장소수용면적(제곱미터)수용인원(명)
1314교육기관군산지곡초등학교전라북도 군산시 신지길 262층 강당239280
1415교육기관군산초등학교전라북도 군산시 자곡로 681층 강당187220
1516교육기관군산동초등학교전라북도 군산시 번영로 1812층 강당253300
1617교육기관군산대야초등학교전라북도 군산시 대야면 대야관통로 141-131층 강당234280
1718공공시설군산월명체육관전라북도 군산시 번영로 2811층 체육관1082113000
1819공공시설군산장애인체육관전라북도 군산시 성산면 강변로 4591층 체육관823990
1920공공시설군산청소년수련관전라북도 군산시 청소년회관로 751층 체육관8421020
2021공공시설군산설림도서관전라북도 군산시 설림길 251층 도서관143170
2122공공시설군산배드민턴장전라북도 군산시 남수송5길 391층 체육관8981080
2223공공시설군사농업인회관전라북도 군산시 개정면 운회길 321층 강당7280