Overview

Dataset statistics

Number of variables4
Number of observations89
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory34.5 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description실내공기질 관리법에 의한 다중이용시설 등 실내공기질 관리대상에 대한 데이터로 시설구분, 시설명, 소재지에 대한 자료
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/15037443/fileData.do

Alerts

연번 is highly overall correlated with 시설군High correlation
시설군 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-29 22:27:45.289690
Analysis finished2024-04-29 22:27:47.177731
Duration1.89 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45
Minimum1
Maximum89
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2024-04-30T07:27:47.254594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.4
Q123
median45
Q367
95-th percentile84.6
Maximum89
Range88
Interquartile range (IQR)44

Descriptive statistics

Standard deviation25.836021
Coefficient of variation (CV)0.57413381
Kurtosis-1.2
Mean45
Median Absolute Deviation (MAD)22
Skewness0
Sum4005
Variance667.5
MonotonicityStrictly increasing
2024-04-30T07:27:47.386682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
68 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
59 1
 
1.1%
Other values (79) 79
88.8%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%
80 1
1.1%

시설군
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Memory size844.0 B
의료기관
29 
어린이집
19 
실내주차장
16 
지하역사
목욕장
Other values (8)
14 

Length

Max length9
Median length4
Mean length4.3932584
Min length2

Unique

Unique4 ?
Unique (%)4.5%

Sample

1st row학원
2nd row지하역사
3rd row지하역사
4th row지하역사
5th row지하역사

Common Values

ValueCountFrequency (%)
의료기관 29
32.6%
어린이집 19
21.3%
실내주차장 16
18.0%
지하역사 8
 
9.0%
목욕장 3
 
3.4%
대규모점포 3
 
3.4%
PC영업시설 3
 
3.4%
실내어린이놀이시설 2
 
2.2%
산후조리원 2
 
2.2%
학원 1
 
1.1%
Other values (3) 3
 
3.4%

Length

2024-04-30T07:27:47.515789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
의료기관 29
32.6%
어린이집 19
21.3%
실내주차장 16
18.0%
지하역사 8
 
9.0%
목욕장 3
 
3.4%
대규모점포 3
 
3.4%
pc영업시설 3
 
3.4%
실내어린이놀이시설 2
 
2.2%
산후조리원 2
 
2.2%
학원 1
 
1.1%
Other values (3) 3
 
3.4%
Distinct85
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size844.0 B
2024-04-30T07:27:47.761510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length21
Mean length9.7640449
Min length4

Characters and Unicode

Total characters869
Distinct characters196
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)91.0%

Sample

1st row대성학원
2nd row지하철교대역
3rd row지하철연산동역(1호선)
4th row지하철시청역
5th row지하철연산동역(3호선)
ValueCountFrequency (%)
의료법인 4
 
3.3%
이마트 3
 
2.5%
트레이더스 3
 
2.5%
연산점 3
 
2.5%
성은의료재단 2
 
1.7%
챔피언 2
 
1.7%
블랙벨트 2
 
1.7%
2
 
1.7%
이마트연제점 2
 
1.7%
홈플러스아시아드점 2
 
1.7%
Other values (94) 95
79.2%
2024-04-30T07:27:48.119609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36
 
4.1%
33
 
3.8%
31
 
3.6%
31
 
3.6%
30
 
3.5%
25
 
2.9%
22
 
2.5%
21
 
2.4%
19
 
2.2%
19
 
2.2%
Other values (186) 602
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 795
91.5%
Space Separator 31
 
3.6%
Uppercase Letter 13
 
1.5%
Open Punctuation 8
 
0.9%
Close Punctuation 8
 
0.9%
Decimal Number 8
 
0.9%
Other Punctuation 3
 
0.3%
Other Symbol 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
4.5%
33
 
4.2%
31
 
3.9%
30
 
3.8%
25
 
3.1%
22
 
2.8%
21
 
2.6%
19
 
2.4%
19
 
2.4%
19
 
2.4%
Other values (169) 540
67.9%
Uppercase Letter
ValueCountFrequency (%)
C 4
30.8%
P 3
23.1%
K 1
 
7.7%
S 1
 
7.7%
X 1
 
7.7%
O 1
 
7.7%
G 1
 
7.7%
V 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
2 3
37.5%
3 2
25.0%
1 2
25.0%
4 1
 
12.5%
Space Separator
ValueCountFrequency (%)
31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 798
91.8%
Common 58
 
6.7%
Latin 13
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
4.5%
33
 
4.1%
31
 
3.9%
30
 
3.8%
25
 
3.1%
22
 
2.8%
21
 
2.6%
19
 
2.4%
19
 
2.4%
19
 
2.4%
Other values (170) 543
68.0%
Common
ValueCountFrequency (%)
31
53.4%
( 8
 
13.8%
) 8
 
13.8%
. 3
 
5.2%
2 3
 
5.2%
3 2
 
3.4%
1 2
 
3.4%
4 1
 
1.7%
Latin
ValueCountFrequency (%)
C 4
30.8%
P 3
23.1%
K 1
 
7.7%
S 1
 
7.7%
X 1
 
7.7%
O 1
 
7.7%
G 1
 
7.7%
V 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 795
91.5%
ASCII 71
 
8.2%
None 3
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
36
 
4.5%
33
 
4.2%
31
 
3.9%
30
 
3.8%
25
 
3.1%
22
 
2.8%
21
 
2.6%
19
 
2.4%
19
 
2.4%
19
 
2.4%
Other values (169) 540
67.9%
ASCII
ValueCountFrequency (%)
31
43.7%
( 8
 
11.3%
) 8
 
11.3%
C 4
 
5.6%
. 3
 
4.2%
P 3
 
4.2%
2 3
 
4.2%
3 2
 
2.8%
1 2
 
2.8%
K 1
 
1.4%
Other values (6) 6
 
8.5%
None
ValueCountFrequency (%)
3
100.0%
Distinct80
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size844.0 B
2024-04-30T07:27:48.425076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length41
Mean length27.786517
Min length21

Characters and Unicode

Total characters2473
Distinct characters104
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)82.0%

Sample

1st row부산광역시 연제구 거제대로252번길 20 (거제동) [1층 일부,2~4층,6~7층]
2nd row부산광역시 연제구 중앙대로 1217 (거제동) [지하]
3rd row부산광역시 연제구 중앙대로 1101 (연산동) [지하]
4th row부산광역시 연제구 중앙대로 1017 (연산동) [지하]
5th row부산광역시 연제구 중앙대로 1101 (연산동) [지하]
ValueCountFrequency (%)
연제구 89
18.6%
부산광역시 88
18.4%
연산동 51
 
10.6%
거제동 25
 
5.2%
중앙대로 17
 
3.5%
월드컵대로 11
 
2.3%
지하 8
 
1.7%
과정로 6
 
1.3%
39 5
 
1.0%
아시아드대로 5
 
1.0%
Other values (129) 174
36.3%
2024-04-30T07:27:48.883163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
390
 
15.8%
155
 
6.3%
151
 
6.1%
128
 
5.2%
96
 
3.9%
96
 
3.9%
94
 
3.8%
( 90
 
3.6%
89
 
3.6%
89
 
3.6%
Other values (94) 1095
44.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1495
60.5%
Space Separator 390
 
15.8%
Decimal Number 329
 
13.3%
Open Punctuation 105
 
4.2%
Close Punctuation 104
 
4.2%
Other Punctuation 35
 
1.4%
Math Symbol 10
 
0.4%
Uppercase Letter 3
 
0.1%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
155
 
10.4%
151
 
10.1%
128
 
8.6%
96
 
6.4%
96
 
6.4%
94
 
6.3%
89
 
6.0%
89
 
6.0%
89
 
6.0%
88
 
5.9%
Other values (73) 420
28.1%
Decimal Number
ValueCountFrequency (%)
1 85
25.8%
2 51
15.5%
5 34
 
10.3%
0 32
 
9.7%
3 31
 
9.4%
4 23
 
7.0%
8 22
 
6.7%
9 22
 
6.7%
7 18
 
5.5%
6 11
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
B 1
33.3%
F 1
33.3%
C 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 90
85.7%
[ 15
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 89
85.6%
] 15
 
14.4%
Space Separator
ValueCountFrequency (%)
390
100.0%
Other Punctuation
ValueCountFrequency (%)
, 35
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1495
60.5%
Common 975
39.4%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
155
 
10.4%
151
 
10.1%
128
 
8.6%
96
 
6.4%
96
 
6.4%
94
 
6.3%
89
 
6.0%
89
 
6.0%
89
 
6.0%
88
 
5.9%
Other values (73) 420
28.1%
Common
ValueCountFrequency (%)
390
40.0%
( 90
 
9.2%
) 89
 
9.1%
1 85
 
8.7%
2 51
 
5.2%
, 35
 
3.6%
5 34
 
3.5%
0 32
 
3.3%
3 31
 
3.2%
4 23
 
2.4%
Other values (8) 115
 
11.8%
Latin
ValueCountFrequency (%)
B 1
33.3%
F 1
33.3%
C 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1495
60.5%
ASCII 978
39.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
390
39.9%
( 90
 
9.2%
) 89
 
9.1%
1 85
 
8.7%
2 51
 
5.2%
, 35
 
3.6%
5 34
 
3.5%
0 32
 
3.3%
3 31
 
3.2%
4 23
 
2.4%
Other values (11) 118
 
12.1%
Hangul
ValueCountFrequency (%)
155
 
10.4%
151
 
10.1%
128
 
8.6%
96
 
6.4%
96
 
6.4%
94
 
6.3%
89
 
6.0%
89
 
6.0%
89
 
6.0%
88
 
5.9%
Other values (73) 420
28.1%

Interactions

2024-04-30T07:27:46.914215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:27:48.981683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군시설명소재지
연번1.0000.8690.7430.912
시설군0.8691.0000.0000.000
시설명0.7430.0001.0001.000
소재지0.9120.0001.0001.000
2024-04-30T07:27:49.065178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군
연번1.0000.590
시설군0.5901.000

Missing values

2024-04-30T07:27:47.056596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:27:47.135693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설군시설명소재지
01학원대성학원부산광역시 연제구 거제대로252번길 20 (거제동) [1층 일부,2~4층,6~7층]
12지하역사지하철교대역부산광역시 연제구 중앙대로 1217 (거제동) [지하]
23지하역사지하철연산동역(1호선)부산광역시 연제구 중앙대로 1101 (연산동) [지하]
34지하역사지하철시청역부산광역시 연제구 중앙대로 1017 (연산동) [지하]
45지하역사지하철연산동역(3호선)부산광역시 연제구 중앙대로 1101 (연산동) [지하]
56지하역사지하철종합운동장역부산광역시 연제구 아시아드대로 73 (거제동) [지하]
67지하역사지하철거제역부산광역시 연제구 월드컵대로 209 (거제동) [지하]
78지하역사지하철물만골역부산광영시 연제구 월드컵대로 23 (연산동) [지하]
89지하역사지하철배산역부산광역시 연제구 연수로 229 (연산동) [지하]
910장례식장부산의료원장례식장부산광역시 연제구 월드컵대로 359 (거제동)
연번시설군시설명소재지
7980목욕장발리24시대중사우나부산광역시 연제구 월드컵대로 152 (연산동)
8081목욕장(주)해수피아부산광역시 연제구 거제천로 258 (연산동)
8182목욕장럭키랜드부산광역시 연제구 토곡로 39 (연산동)
8283대규모점포이마트연제점부산광역시 연제구 연수로 89 (연산동)
8384대규모점포홈플러스아시아드점부산광역시 연제구 종합운동장로 7 (거제동)
8485대규모점포이마트 트레이더스 연산점부산광역시 연제구 좌수영로 241 (연산동)
8586노인요양시설호산노인건강센터부산광역시 연제구 화지로 103(거제동)
8687PC영업시설자드PC방부산광역시 연제구 거제천로94, 4층 (연산동,인재빌딩)
8788PC영업시설메타PC방 부산본점부산광역시 연제구 고분로13번길 25, 304호(연산동)
8889PC영업시설OX PC 연미로점부산광역시 연제구 연미로 3, 지하1층, 지상1층