Overview

Dataset statistics

Number of variables4
Number of observations254
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.3 KiB
Average record size in memory33.5 B

Variable types

Numeric1
Text1
DateTime1
Categorical1

Dataset

Description부산광역시_강서구_어린이놀이시설안전검사결과_20230221
Author부산광역시 강서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15026036

Alerts

검사종류 is highly imbalanced (54.9%)Imbalance
시설번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:44:05.221796
Analysis finished2023-12-10 16:44:05.998138
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설번호
Real number (ℝ)

UNIQUE 

Distinct254
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean474055.44
Minimum13971
Maximum586071
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-11T01:44:06.129211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum13971
5-th percentile26340.85
Q1533840.25
median551722.5
Q3565412.5
95-th percentile577257.4
Maximum586071
Range572100
Interquartile range (IQR)31572.25

Descriptive statistics

Standard deviation191106.95
Coefficient of variation (CV)0.40313206
Kurtosis1.7091266
Mean474055.44
Median Absolute Deviation (MAD)14717.5
Skewness-1.9108943
Sum1.2041008 × 108
Variance3.6521866 × 1010
MonotonicityStrictly increasing
2023-12-11T01:44:06.369661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13971 1
 
0.4%
561452 1
 
0.4%
557438 1
 
0.4%
557451 1
 
0.4%
557779 1
 
0.4%
557791 1
 
0.4%
557792 1
 
0.4%
557875 1
 
0.4%
558795 1
 
0.4%
559095 1
 
0.4%
Other values (244) 244
96.1%
ValueCountFrequency (%)
13971 1
0.4%
13972 1
0.4%
13973 1
0.4%
13974 1
0.4%
13975 1
0.4%
13981 1
0.4%
13983 1
0.4%
24259 1
0.4%
26110 1
0.4%
26111 1
0.4%
ValueCountFrequency (%)
586071 1
0.4%
582863 1
0.4%
582791 1
0.4%
582343 1
0.4%
582342 1
0.4%
581326 1
0.4%
580784 1
0.4%
580084 1
0.4%
579854 1
0.4%
579637 1
0.4%
Distinct253
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T01:44:06.739472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length27
Mean length19.389764
Min length4

Characters and Unicode

Total characters4925
Distinct characters277
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique252 ?
Unique (%)99.2%

Sample

1st row신호 월더하임아파트 어린이놀이터-1 (101동 앞)
2nd row신호 월더하임아파트 어린이놀이터-2 (104동 앞)
3rd row신호 월더하임아파트 어린이놀이터-3 (108동 앞)
4th row범방어린이공원
5th row용두공원(대저2공원)
ValueCountFrequency (%)
명지 96
 
12.3%
31
 
4.0%
지사 19
 
2.4%
놀이터 18
 
2.3%
어린이놀이터1 16
 
2.1%
어린이놀이터2 15
 
1.9%
부산신호사랑으로부영 12
 
1.5%
어린이놀이터-3 12
 
1.5%
어린이놀이터-2 12
 
1.5%
명지퍼스트월드 11
 
1.4%
Other values (282) 536
68.9%
2023-12-11T01:44:07.362969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
524
 
10.6%
404
 
8.2%
206
 
4.2%
194
 
3.9%
188
 
3.8%
178
 
3.6%
178
 
3.6%
1 159
 
3.2%
136
 
2.8%
) 133
 
2.7%
Other values (267) 2625
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3568
72.4%
Space Separator 524
 
10.6%
Decimal Number 480
 
9.7%
Close Punctuation 133
 
2.7%
Open Punctuation 133
 
2.7%
Dash Punctuation 44
 
0.9%
Uppercase Letter 37
 
0.8%
Other Punctuation 4
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
404
 
11.3%
206
 
5.8%
194
 
5.4%
188
 
5.3%
178
 
5.0%
178
 
5.0%
136
 
3.8%
103
 
2.9%
76
 
2.1%
72
 
2.0%
Other values (244) 1833
51.4%
Decimal Number
ValueCountFrequency (%)
1 159
33.1%
2 94
19.6%
0 75
15.6%
3 64
13.3%
4 25
 
5.2%
5 18
 
3.8%
6 13
 
2.7%
7 12
 
2.5%
8 11
 
2.3%
9 9
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
S 16
43.2%
C 11
29.7%
D 6
 
16.2%
A 1
 
2.7%
K 1
 
2.7%
R 1
 
2.7%
B 1
 
2.7%
Space Separator
ValueCountFrequency (%)
524
100.0%
Close Punctuation
ValueCountFrequency (%)
) 133
100.0%
Open Punctuation
ValueCountFrequency (%)
( 133
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3568
72.4%
Common 1320
 
26.8%
Latin 37
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
404
 
11.3%
206
 
5.8%
194
 
5.4%
188
 
5.3%
178
 
5.0%
178
 
5.0%
136
 
3.8%
103
 
2.9%
76
 
2.1%
72
 
2.0%
Other values (244) 1833
51.4%
Common
ValueCountFrequency (%)
524
39.7%
1 159
 
12.0%
) 133
 
10.1%
( 133
 
10.1%
2 94
 
7.1%
0 75
 
5.7%
3 64
 
4.8%
- 44
 
3.3%
4 25
 
1.9%
5 18
 
1.4%
Other values (6) 51
 
3.9%
Latin
ValueCountFrequency (%)
S 16
43.2%
C 11
29.7%
D 6
 
16.2%
A 1
 
2.7%
K 1
 
2.7%
R 1
 
2.7%
B 1
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3568
72.4%
ASCII 1357
 
27.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
524
38.6%
1 159
 
11.7%
) 133
 
9.8%
( 133
 
9.8%
2 94
 
6.9%
0 75
 
5.5%
3 64
 
4.7%
- 44
 
3.2%
4 25
 
1.8%
5 18
 
1.3%
Other values (13) 88
 
6.5%
Hangul
ValueCountFrequency (%)
404
 
11.3%
206
 
5.8%
194
 
5.4%
188
 
5.3%
178
 
5.0%
178
 
5.0%
136
 
3.8%
103
 
2.9%
76
 
2.1%
72
 
2.0%
Other values (244) 1833
51.4%
Distinct106
Distinct (%)41.7%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum2020-11-25 00:00:00
Maximum2023-02-13 00:00:00
2023-12-11T01:44:08.021752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:44:08.317208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

검사종류
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
정기시설검사
230 
설치검사
24 

Length

Max length6
Median length6
Mean length5.8110236
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정기시설검사
2nd row정기시설검사
3rd row정기시설검사
4th row정기시설검사
5th row정기시설검사

Common Values

ValueCountFrequency (%)
정기시설검사 230
90.6%
설치검사 24
 
9.4%

Length

2023-12-11T01:44:08.545840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:44:08.728211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정기시설검사 230
90.6%
설치검사 24
 
9.4%

Interactions

2023-12-11T01:44:05.543407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:44:08.819056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설번호검사종류
시설번호1.0000.070
검사종류0.0701.000
2023-12-11T01:44:08.955302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설번호검사종류
시설번호1.0000.112
검사종류0.1121.000

Missing values

2023-12-11T01:44:05.803657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:44:05.941102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설번호놀이시설명검사일자검사종류
013971신호 월더하임아파트 어린이놀이터-1 (101동 앞)2022-08-29정기시설검사
113972신호 월더하임아파트 어린이놀이터-2 (104동 앞)2022-08-29정기시설검사
213973신호 월더하임아파트 어린이놀이터-3 (108동 앞)2022-08-29정기시설검사
313974범방어린이공원2021-05-11정기시설검사
413975용두공원(대저2공원)2022-11-28정기시설검사
513981오봉산체육공원 어린이놀이터2023-01-04정기시설검사
613983등구놀이터2021-12-02정기시설검사
724259성산이주단지 놀이터2023-01-04정기시설검사
826110명지 극동스타클래스 어린이놀이터12021-03-15정기시설검사
926111명지 극동스타클래스 어린이놀이터22021-03-15설치검사
시설번호놀이시설명검사일자검사종류
244579637부산명지행복주택아파트 어린이놀이터2021-09-15설치검사
245579854스타필드시티명지점 어린이놀이시설2021-09-28설치검사
246580084해성어린이집 놀이터2021-10-20설치검사
247580784강서구육아종합지원센터 실내놀이시설2021-11-05설치검사
248581326제4호수변공원 어린이놀이공원2022-01-19설치검사
249582342제9호 어린이공원2022-05-23설치검사
250582343제10호 어린이공원2022-05-23설치검사
251582791명지 호반베르디움 2차 실내놀이터2022-05-13설치검사
252582863더샵 명지퍼스트월드 3단지 키즈풀(304동과 306동 사이)2022-07-05설치검사
253586071일공공샤브2023-01-18설치검사