Overview

Dataset statistics

Number of variables7
Number of observations90
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory59.5 B

Variable types

Numeric2
Categorical3
Text2

Dataset

Description실내공기질관리법 제3조(적용대상) 및 같은 법 시행령 제2조(적용대상)에 따른 연제구 다중이용시설 실내공기질 관리현황 자료
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/15025083/fileData.do

Alerts

광역 has constant value ""Constant
기초 has constant value ""Constant
순번 is highly overall correlated with 시설군High correlation
시설군 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:16:12.114013
Analysis finished2023-12-12 18:16:13.246795
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct90
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.5
Minimum1
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size942.0 B
2023-12-13T03:16:13.345925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.45
Q123.25
median45.5
Q367.75
95-th percentile85.55
Maximum90
Range89
Interquartile range (IQR)44.5

Descriptive statistics

Standard deviation26.124701
Coefficient of variation (CV)0.57416925
Kurtosis-1.2
Mean45.5
Median Absolute Deviation (MAD)22.5
Skewness0
Sum4095
Variance682.5
MonotonicityStrictly increasing
2023-12-13T03:16:13.538492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
69 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
Other values (80) 80
88.9%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%

광역
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size852.0 B
부산광역시
90 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 90
100.0%

Length

2023-12-13T03:16:13.672675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:16:13.783330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 90
100.0%

기초
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size852.0 B
연제구
90 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연제구
2nd row연제구
3rd row연제구
4th row연제구
5th row연제구

Common Values

ValueCountFrequency (%)
연제구 90
100.0%

Length

2023-12-13T03:16:13.886434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:16:14.001351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연제구 90
100.0%
Distinct85
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-13T03:16:14.264595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length9.0222222
Min length4

Characters and Unicode

Total characters812
Distinct characters196
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)88.9%

Sample

1st row대성학원
2nd row지하철교대역
3rd row지하철연산동역(1호선)
4th row지하철시청역
5th row지하철연산동역(3호선)
ValueCountFrequency (%)
트레이더스 3
 
2.5%
연산점 3
 
2.5%
의료법인 3
 
2.5%
주)해수피아 2
 
1.7%
홈플러스연산점 2
 
1.7%
홈플러스아시아드점 2
 
1.7%
이마트연제점 2
 
1.7%
성은의료재단 2
 
1.7%
챔피언 2
 
1.7%
2
 
1.7%
Other values (95) 96
80.7%
2023-12-13T03:16:14.776531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
4.2%
32
 
3.9%
29
 
3.6%
27
 
3.3%
24
 
3.0%
21
 
2.6%
21
 
2.6%
20
 
2.5%
16
 
2.0%
16
 
2.0%
Other values (186) 572
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 744
91.6%
Space Separator 29
 
3.6%
Uppercase Letter 17
 
2.1%
Open Punctuation 5
 
0.6%
Close Punctuation 5
 
0.6%
Decimal Number 5
 
0.6%
Lowercase Letter 4
 
0.5%
Other Symbol 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
4.6%
32
 
4.3%
27
 
3.6%
24
 
3.2%
21
 
2.8%
21
 
2.8%
20
 
2.7%
16
 
2.2%
16
 
2.2%
16
 
2.2%
Other values (166) 517
69.5%
Uppercase Letter
ValueCountFrequency (%)
C 5
29.4%
P 4
23.5%
K 2
 
11.8%
S 2
 
11.8%
Y 1
 
5.9%
L 1
 
5.9%
V 1
 
5.9%
G 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 2
40.0%
4 1
20.0%
1 1
20.0%
3 1
20.0%
Lowercase Letter
ValueCountFrequency (%)
u 1
25.0%
c 1
25.0%
k 1
25.0%
y 1
25.0%
Space Separator
ValueCountFrequency (%)
29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 747
92.0%
Common 44
 
5.4%
Latin 21
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
4.6%
32
 
4.3%
27
 
3.6%
24
 
3.2%
21
 
2.8%
21
 
2.8%
20
 
2.7%
16
 
2.1%
16
 
2.1%
16
 
2.1%
Other values (167) 520
69.6%
Latin
ValueCountFrequency (%)
C 5
23.8%
P 4
19.0%
K 2
 
9.5%
S 2
 
9.5%
u 1
 
4.8%
c 1
 
4.8%
k 1
 
4.8%
Y 1
 
4.8%
L 1
 
4.8%
V 1
 
4.8%
Other values (2) 2
 
9.5%
Common
ValueCountFrequency (%)
29
65.9%
( 5
 
11.4%
) 5
 
11.4%
2 2
 
4.5%
4 1
 
2.3%
1 1
 
2.3%
3 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 744
91.6%
ASCII 65
 
8.0%
None 3
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
4.6%
32
 
4.3%
27
 
3.6%
24
 
3.2%
21
 
2.8%
21
 
2.8%
20
 
2.7%
16
 
2.2%
16
 
2.2%
16
 
2.2%
Other values (166) 517
69.5%
ASCII
ValueCountFrequency (%)
29
44.6%
C 5
 
7.7%
( 5
 
7.7%
) 5
 
7.7%
P 4
 
6.2%
K 2
 
3.1%
S 2
 
3.1%
2 2
 
3.1%
u 1
 
1.5%
c 1
 
1.5%
Other values (9) 9
 
13.8%
None
ValueCountFrequency (%)
3
100.0%

주소
Text

Distinct80
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-13T03:16:15.074194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length37
Mean length23.511111
Min length13

Characters and Unicode

Total characters2116
Distinct characters103
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)80.0%

Sample

1st row부산광역시 거제대로252번길 20 (거제동) [1층 일부,2~4층,6~7층]
2nd row부산광역시 중앙대로 1217 (거제동) [지하]
3rd row부산광역시 중앙대로 1101 (연산동) [지하]
4th row부산광역시 중앙대로 1017 (연산동) [지하]
5th row부산광역시 중앙대로 1101 (연산동) [지하]
ValueCountFrequency (%)
부산광역시 90
22.6%
연산동 56
 
14.1%
거제동 23
 
5.8%
중앙대로 17
 
4.3%
월드컵대로 12
 
3.0%
지하 8
 
2.0%
과정로 7
 
1.8%
반송로 6
 
1.5%
7 4
 
1.0%
종합운동장로 4
 
1.0%
Other values (128) 171
43.0%
2023-12-13T03:16:15.552370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
310
 
14.7%
157
 
7.4%
98
 
4.6%
96
 
4.5%
96
 
4.5%
92
 
4.3%
90
 
4.3%
) 90
 
4.3%
( 90
 
4.3%
89
 
4.2%
Other values (93) 908
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1218
57.6%
Decimal Number 328
 
15.5%
Space Separator 310
 
14.7%
Close Punctuation 107
 
5.1%
Open Punctuation 107
 
5.1%
Other Punctuation 29
 
1.4%
Math Symbol 10
 
0.5%
Uppercase Letter 4
 
0.2%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
157
12.9%
98
 
8.0%
96
 
7.9%
96
 
7.9%
92
 
7.6%
90
 
7.4%
89
 
7.3%
71
 
5.8%
37
 
3.0%
37
 
3.0%
Other values (71) 355
29.1%
Decimal Number
ValueCountFrequency (%)
1 84
25.6%
2 49
14.9%
5 35
10.7%
0 30
 
9.1%
3 29
 
8.8%
8 26
 
7.9%
4 23
 
7.0%
9 20
 
6.1%
7 19
 
5.8%
6 13
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
25.0%
B 1
25.0%
F 1
25.0%
C 1
25.0%
Close Punctuation
ValueCountFrequency (%)
) 90
84.1%
] 17
 
15.9%
Open Punctuation
ValueCountFrequency (%)
( 90
84.1%
[ 17
 
15.9%
Space Separator
ValueCountFrequency (%)
310
100.0%
Other Punctuation
ValueCountFrequency (%)
, 29
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1218
57.6%
Common 894
42.2%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
157
12.9%
98
 
8.0%
96
 
7.9%
96
 
7.9%
92
 
7.6%
90
 
7.4%
89
 
7.3%
71
 
5.8%
37
 
3.0%
37
 
3.0%
Other values (71) 355
29.1%
Common
ValueCountFrequency (%)
310
34.7%
) 90
 
10.1%
( 90
 
10.1%
1 84
 
9.4%
2 49
 
5.5%
5 35
 
3.9%
0 30
 
3.4%
, 29
 
3.2%
3 29
 
3.2%
8 26
 
2.9%
Other values (8) 122
 
13.6%
Latin
ValueCountFrequency (%)
A 1
25.0%
B 1
25.0%
F 1
25.0%
C 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1218
57.6%
ASCII 898
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
310
34.5%
) 90
 
10.0%
( 90
 
10.0%
1 84
 
9.4%
2 49
 
5.5%
5 35
 
3.9%
0 30
 
3.3%
, 29
 
3.2%
3 29
 
3.2%
8 26
 
2.9%
Other values (12) 126
14.0%
Hangul
ValueCountFrequency (%)
157
12.9%
98
 
8.0%
96
 
7.9%
96
 
7.9%
92
 
7.6%
90
 
7.4%
89
 
7.3%
71
 
5.8%
37
 
3.0%
37
 
3.0%
Other values (71) 355
29.1%

연면적
Real number (ℝ)

Distinct89
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7680.992
Minimum306.472
Maximum62816.73
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size942.0 B
2023-12-13T03:16:15.748790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum306.472
5-th percentile433.8635
Q11092.08
median3037.555
Q38706.625
95-th percentile34464.505
Maximum62816.73
Range62510.258
Interquartile range (IQR)7614.545

Descriptive statistics

Standard deviation12111.881
Coefficient of variation (CV)1.5768642
Kurtosis9.3340685
Mean7680.992
Median Absolute Deviation (MAD)2451.305
Skewness2.9198978
Sum691289.28
Variance1.4669766 × 108
MonotonicityNot monotonic
2023-12-13T03:16:15.972005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9983.74 2
 
2.2%
2445.0 1
 
1.1%
22215.5 1
 
1.1%
13928.86 1
 
1.1%
4210.98 1
 
1.1%
3617.25 1
 
1.1%
10518.95 1
 
1.1%
2961.46 1
 
1.1%
3899.52 1
 
1.1%
24670.0 1
 
1.1%
Other values (79) 79
87.8%
ValueCountFrequency (%)
306.472 1
1.1%
338.44 1
1.1%
357.0 1
1.1%
361.01 1
1.1%
425.57 1
1.1%
444.0 1
1.1%
447.6 1
1.1%
461.07 1
1.1%
506.0 1
1.1%
517.0 1
1.1%
ValueCountFrequency (%)
62816.73 1
1.1%
62816.0 1
1.1%
42140.0 1
1.1%
38985.0 1
1.1%
36628.65 1
1.1%
31819.44 1
1.1%
25253.0 1
1.1%
24670.0 1
1.1%
22717.57 1
1.1%
22215.5 1
1.1%

시설군
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)14.4%
Missing0
Missing (%)0.0%
Memory size852.0 B
의료기관
28 
어린이집
16 
실내주차장
16 
지하역사
대규모점포
Other values (8)
17 

Length

Max length9
Median length4
Mean length4.4666667
Min length2

Unique

Unique4 ?
Unique (%)4.4%

Sample

1st row학원
2nd row지하역사
3rd row지하역사
4th row지하역사
5th row지하역사

Common Values

ValueCountFrequency (%)
의료기관 28
31.1%
어린이집 16
17.8%
실내주차장 16
17.8%
지하역사 8
 
8.9%
대규모점포 5
 
5.6%
PC영업시설 5
 
5.6%
산후조리원 3
 
3.3%
목욕장 3
 
3.3%
실내어린이놀이시설 2
 
2.2%
학원 1
 
1.1%
Other values (3) 3
 
3.3%

Length

2023-12-13T03:16:16.129723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
의료기관 28
31.1%
어린이집 16
17.8%
실내주차장 16
17.8%
지하역사 8
 
8.9%
대규모점포 5
 
5.6%
pc영업시설 5
 
5.6%
산후조리원 3
 
3.3%
목욕장 3
 
3.3%
실내어린이놀이시설 2
 
2.2%
학원 1
 
1.1%
Other values (3) 3
 
3.3%

Interactions

2023-12-13T03:16:12.786755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:16:12.589306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:16:12.872238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:16:12.687859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:16:16.270355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번시설명주소연면적시설군
순번1.0000.6480.8960.3930.894
시설명0.6481.0001.0000.0000.739
주소0.8961.0001.0000.0000.000
연면적0.3930.0000.0001.0000.471
시설군0.8940.7390.0000.4711.000
2023-12-13T03:16:16.369992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번연면적시설군
순번1.000-0.1560.645
연면적-0.1561.0000.233
시설군0.6450.2331.000

Missing values

2023-12-13T03:16:13.051730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:16:13.194741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번광역기초시설명주소연면적시설군
01부산광역시연제구대성학원부산광역시 거제대로252번길 20 (거제동) [1층 일부,2~4층,6~7층]2445.0학원
12부산광역시연제구지하철교대역부산광역시 중앙대로 1217 (거제동) [지하]8174.0지하역사
23부산광역시연제구지하철연산동역(1호선)부산광역시 중앙대로 1101 (연산동) [지하]10964.0지하역사
34부산광역시연제구지하철시청역부산광역시 중앙대로 1017 (연산동) [지하]7067.0지하역사
45부산광역시연제구지하철연산동역(3호선)부산광역시 중앙대로 1101 (연산동) [지하]14687.0지하역사
56부산광역시연제구지하철종합운동장역부산광역시 부산광영시 연제구 아시아드대로 73 (거제동) [지하]9108.0지하역사
67부산광역시연제구지하철거제역부산광역시 월드컵대로 209 (거제동) [지하]9744.0지하역사
78부산광역시연제구지하철물만골역부산광역시 부산광영시 연제구 월드컵대로 23 (연산동) [지하]9524.0지하역사
89부산광역시연제구지하철배산역부산광역시 연수로 229 (연산동) [지하]13057.0지하역사
910부산광역시연제구부산의료원장례식장부산광역시 월드컵대로 359 (거제동)3103.0장례식장
순번광역기초시설명주소연면적시설군
8081부산광역시연제구이마트연제점부산광역시 연수로 89 (연산동)25253.0대규모점포
8182부산광역시연제구홈플러스아시아드점부산광역시 종합운동장로 7 (거제동)31819.44대규모점포
8283부산광역시연제구홈플러스연산점부산광역시 반송로 88 (연산동)22717.57대규모점포
8384부산광역시연제구트레이더스 연산점부산광역시 좌수영로 241 (연산동)62816.0대규모점포
8485부산광역시연제구호산노인건강센터부산광역시 화지로 103(거제동)2996.0노인요양시설
8586부산광역시연제구자드PC방부산광역시 거제천로94, 4층 (연산동,인재빌딩)357.0PC영업시설
8687부산광역시연제구더락PC클럽부산광역시 고분로13번길 25 (연산동) [3층]306.472PC영업시설
8788부산광역시연제구SKY PC방부산광역시 과정로 144 (연산동) [8층]361.01PC영업시설
8889부산광역시연제구오엑스 피시방 토곡점부산광역시 과정로 314, 대일빌딩A동 3층 (연산동)425.57PC영업시설
8990부산광역시연제구Lucky PC부산광역시 월드컵대로 76, 2층 (연산동, 모티더베스트빌)338.44PC영업시설