Overview

Dataset statistics

Number of variables4
Number of observations91
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory34.5 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description실내공기질 관리법에 의한 다중이용시설 등 실내공기질 관리대상에 대한 데이터로 시설구분, 시설명, 소재지에 대한 자료
URLhttps://www.data.go.kr/data/15037443/fileData.do

Alerts

연번 is highly overall correlated with 시설군High correlation
시설군 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:42:45.378045
Analysis finished2023-12-11 23:42:46.066102
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct91
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46
Minimum1
Maximum91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size951.0 B
2023-12-12T08:42:46.148914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.5
Q123.5
median46
Q368.5
95-th percentile86.5
Maximum91
Range90
Interquartile range (IQR)45

Descriptive statistics

Standard deviation26.41338
Coefficient of variation (CV)0.57420392
Kurtosis-1.2
Mean46
Median Absolute Deviation (MAD)23
Skewness0
Sum4186
Variance697.66667
MonotonicityStrictly increasing
2023-12-12T08:42:46.342021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
59 1
 
1.1%
68 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
Other values (81) 81
89.0%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
91 1
1.1%
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%

시설군
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size860.0 B
의료기관
30 
실내주차장
16 
어린이집
16 
지하역사
대규모점포
Other values (8)
16 

Length

Max length9
Median length4
Mean length4.4395604
Min length2

Unique

Unique4 ?
Unique (%)4.4%

Sample

1st row지하역사
2nd row지하역사
3rd row지하역사
4th row지하역사
5th row지하역사

Common Values

ValueCountFrequency (%)
의료기관 30
33.0%
실내주차장 16
17.6%
어린이집 16
17.6%
지하역사 8
 
8.8%
대규모점포 5
 
5.5%
PC영업시설 4
 
4.4%
목욕장 3
 
3.3%
산후조리원 3
 
3.3%
실내어린이놀이시설 2
 
2.2%
장례식장 1
 
1.1%
Other values (3) 3
 
3.3%

Length

2023-12-12T08:42:46.505903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
의료기관 30
33.0%
실내주차장 16
17.6%
어린이집 16
17.6%
지하역사 8
 
8.8%
대규모점포 5
 
5.5%
pc영업시설 4
 
4.4%
목욕장 3
 
3.3%
산후조리원 3
 
3.3%
실내어린이놀이시설 2
 
2.2%
장례식장 1
 
1.1%
Other values (3) 3
 
3.3%
Distinct86
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-12T08:42:46.795389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length17
Mean length7.7582418
Min length4

Characters and Unicode

Total characters706
Distinct characters191
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)89.0%

Sample

1st row지하철교대역
2nd row지하철연산역(1호선)
3rd row지하철시청역
4th row지하철연산역(3호선)
5th row지하철종합운동장역
ValueCountFrequency (%)
트레이더스 3
 
2.8%
연산점 3
 
2.8%
이마트연제점 2
 
1.9%
블랙벨트 2
 
1.9%
홈플러스연산점 2
 
1.9%
홈플러스아시아드점 2
 
1.9%
주)해수피아 2
 
1.9%
챔피언 2
 
1.9%
2
 
1.9%
연산당당한방병원 1
 
0.9%
Other values (85) 85
80.2%
2023-12-12T08:42:47.306091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
5.4%
34
 
4.8%
29
 
4.1%
25
 
3.5%
22
 
3.1%
16
 
2.3%
16
 
2.3%
16
 
2.3%
16
 
2.3%
15
 
2.1%
Other values (181) 479
67.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 652
92.4%
Uppercase Letter 17
 
2.4%
Space Separator 15
 
2.1%
Open Punctuation 5
 
0.7%
Close Punctuation 5
 
0.7%
Decimal Number 5
 
0.7%
Lowercase Letter 4
 
0.6%
Other Symbol 2
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
5.8%
34
 
5.2%
29
 
4.4%
25
 
3.8%
22
 
3.4%
16
 
2.5%
16
 
2.5%
16
 
2.5%
16
 
2.5%
13
 
2.0%
Other values (160) 427
65.5%
Uppercase Letter
ValueCountFrequency (%)
C 5
29.4%
P 4
23.5%
K 2
 
11.8%
S 2
 
11.8%
L 1
 
5.9%
Y 1
 
5.9%
V 1
 
5.9%
G 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 2
40.0%
4 1
20.0%
3 1
20.0%
1 1
20.0%
Lowercase Letter
ValueCountFrequency (%)
y 1
25.0%
k 1
25.0%
c 1
25.0%
u 1
25.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 654
92.6%
Common 31
 
4.4%
Latin 21
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
5.8%
34
 
5.2%
29
 
4.4%
25
 
3.8%
22
 
3.4%
16
 
2.4%
16
 
2.4%
16
 
2.4%
16
 
2.4%
13
 
2.0%
Other values (161) 429
65.6%
Latin
ValueCountFrequency (%)
C 5
23.8%
P 4
19.0%
K 2
 
9.5%
S 2
 
9.5%
y 1
 
4.8%
k 1
 
4.8%
c 1
 
4.8%
u 1
 
4.8%
L 1
 
4.8%
Y 1
 
4.8%
Other values (2) 2
 
9.5%
Common
ValueCountFrequency (%)
15
48.4%
( 5
 
16.1%
) 5
 
16.1%
2 2
 
6.5%
. 1
 
3.2%
4 1
 
3.2%
3 1
 
3.2%
1 1
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 652
92.4%
ASCII 52
 
7.4%
None 2
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
38
 
5.8%
34
 
5.2%
29
 
4.4%
25
 
3.8%
22
 
3.4%
16
 
2.5%
16
 
2.5%
16
 
2.5%
16
 
2.5%
13
 
2.0%
Other values (160) 427
65.5%
ASCII
ValueCountFrequency (%)
15
28.8%
C 5
 
9.6%
( 5
 
9.6%
) 5
 
9.6%
P 4
 
7.7%
K 2
 
3.8%
2 2
 
3.8%
S 2
 
3.8%
. 1
 
1.9%
y 1
 
1.9%
Other values (10) 10
19.2%
None
ValueCountFrequency (%)
2
100.0%
Distinct81
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-12T08:42:47.716048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length39
Mean length27.263736
Min length21

Characters and Unicode

Total characters2481
Distinct characters104
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)80.2%

Sample

1st row부산광역시 연제구 중앙대로 1217 (거제동) [지하]
2nd row부산광역시 연제구 중앙대로 1101 (연산동) [지하]
3rd row부산광역시 연제구 중앙대로 1017 (연산동) [지하]
4th row부산광역시 연제구 중앙대로 1101 (연산동) [지하]
5th row부산광역시 연제구 아시아드대로 73 (거제동) [지하]
ValueCountFrequency (%)
부산광역시 91
18.7%
연제구 91
18.7%
연산동 55
 
11.3%
거제동 23
 
4.7%
중앙대로 18
 
3.7%
월드컵대로 12
 
2.5%
지하 8
 
1.6%
과정로 7
 
1.4%
반송로 6
 
1.2%
종합운동장로 4
 
0.8%
Other values (129) 171
35.2%
2023-12-12T08:42:48.302151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
395
 
15.9%
160
 
6.4%
157
 
6.3%
128
 
5.2%
97
 
3.9%
96
 
3.9%
95
 
3.8%
) 92
 
3.7%
( 92
 
3.7%
91
 
3.7%
Other values (94) 1078
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1487
59.9%
Space Separator 395
 
15.9%
Decimal Number 336
 
13.5%
Close Punctuation 108
 
4.4%
Open Punctuation 108
 
4.4%
Other Punctuation 32
 
1.3%
Math Symbol 9
 
0.4%
Dash Punctuation 3
 
0.1%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
160
10.8%
157
10.6%
128
 
8.6%
97
 
6.5%
96
 
6.5%
95
 
6.4%
91
 
6.1%
91
 
6.1%
91
 
6.1%
91
 
6.1%
Other values (73) 390
26.2%
Decimal Number
ValueCountFrequency (%)
1 86
25.6%
2 53
15.8%
5 34
 
10.1%
0 32
 
9.5%
3 28
 
8.3%
8 26
 
7.7%
4 25
 
7.4%
9 21
 
6.2%
7 18
 
5.4%
6 13
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
C 1
33.3%
F 1
33.3%
B 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 92
85.2%
] 16
 
14.8%
Open Punctuation
ValueCountFrequency (%)
( 92
85.2%
[ 16
 
14.8%
Space Separator
ValueCountFrequency (%)
395
100.0%
Other Punctuation
ValueCountFrequency (%)
, 32
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1487
59.9%
Common 991
39.9%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
160
10.8%
157
10.6%
128
 
8.6%
97
 
6.5%
96
 
6.5%
95
 
6.4%
91
 
6.1%
91
 
6.1%
91
 
6.1%
91
 
6.1%
Other values (73) 390
26.2%
Common
ValueCountFrequency (%)
395
39.9%
) 92
 
9.3%
( 92
 
9.3%
1 86
 
8.7%
2 53
 
5.3%
5 34
 
3.4%
0 32
 
3.2%
, 32
 
3.2%
3 28
 
2.8%
8 26
 
2.6%
Other values (8) 121
 
12.2%
Latin
ValueCountFrequency (%)
C 1
33.3%
F 1
33.3%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1487
59.9%
ASCII 994
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
395
39.7%
) 92
 
9.3%
( 92
 
9.3%
1 86
 
8.7%
2 53
 
5.3%
5 34
 
3.4%
0 32
 
3.2%
, 32
 
3.2%
3 28
 
2.8%
8 26
 
2.6%
Other values (11) 124
 
12.5%
Hangul
ValueCountFrequency (%)
160
10.8%
157
10.6%
128
 
8.6%
97
 
6.5%
96
 
6.5%
95
 
6.4%
91
 
6.1%
91
 
6.1%
91
 
6.1%
91
 
6.1%
Other values (73) 390
26.2%

Interactions

2023-12-12T08:42:45.815198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:42:48.423843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군시설명소재지
연번1.0000.8860.6870.798
시설군0.8861.0000.7180.000
시설명0.6870.7181.0001.000
소재지0.7980.0001.0001.000
2023-12-12T08:42:48.514750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군
연번1.0000.601
시설군0.6011.000

Missing values

2023-12-12T08:42:45.943115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:42:46.033742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설군시설명소재지
01지하역사지하철교대역부산광역시 연제구 중앙대로 1217 (거제동) [지하]
12지하역사지하철연산역(1호선)부산광역시 연제구 중앙대로 1101 (연산동) [지하]
23지하역사지하철시청역부산광역시 연제구 중앙대로 1017 (연산동) [지하]
34지하역사지하철연산역(3호선)부산광역시 연제구 중앙대로 1101 (연산동) [지하]
45지하역사지하철종합운동장역부산광역시 연제구 아시아드대로 73 (거제동) [지하]
56지하역사지하철거제역부산광역시 연제구 월드컵대로 209 (거제동) [지하]
67지하역사지하철물만골역부산광역시 연제구 월드컵대로 23 (연산동) [지하]
78지하역사지하철배산역부산광역시 연제구 연수로 229 (연산동) [지하]
89장례식장부산의료원장례식장부산광역시 연제구 월드컵대로 359 (거제동)
910목욕장발리24시대중사우나부산광역시 연제구 월드컵대로 152 (연산동)
연번시설군시설명소재지
8182산후조리원마미사랑산후조리원부산광역시 연제구 반송로 28-1, 6~9층
8283의료기관아시아드요양병원부산광역시 연제구 월드컵대로 161, 2층~8층, 11층(연산동)
8384어린이집토끼와당근어린이집부산광역시 연제구 고분로235번길 26-1 (연산동)
8485어린이집더샵파크시티어린이집부산광역시 연제구 안연로 33, 106동 (연산동, 더샵파크시티아파트)
8586노인요양시설호산노인건강센터부산광역시 연제구 화지로 103(거제동)
8687어린이집하나금융어린이집부산광역시 연제구 반송로 10 (연산동)
8788어린이집예지어린이집부산광역시 연제구 해맞이로 97 (거제동)
8889의료기관이진용맘병원부산광역시 연제구 거제대로 295,8층~11층 (거제동)
8990의료기관한빛병원부산광역시 연제구 과정로 140, 부산은행 4층~10층 (연산동)
9091의료기관이안과의원부산광역시 연제구 중앙대로 1129 (연산동)