Overview

Dataset statistics

Number of variables4
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory36.8 B

Variable types

Numeric1
Text1
DateTime1
Categorical1

Dataset

Description부산광역시남구_정보통신공사사용전검사현황_20210908
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3080535

Alerts

공사의종류 is highly imbalanced (52.4%)Imbalance
순번 has unique valuesUnique
현장주소 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:06:30.171546
Analysis finished2023-12-10 17:06:30.903449
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18
Minimum1
Maximum35
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-11T02:06:31.027176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.7
Q19.5
median18
Q326.5
95-th percentile33.3
Maximum35
Range34
Interquartile range (IQR)17

Descriptive statistics

Standard deviation10.246951
Coefficient of variation (CV)0.56927504
Kurtosis-1.2
Mean18
Median Absolute Deviation (MAD)9
Skewness0
Sum630
Variance105
MonotonicityStrictly increasing
2023-12-11T02:06:31.289590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
1 1
 
2.9%
2 1
 
2.9%
21 1
 
2.9%
22 1
 
2.9%
23 1
 
2.9%
24 1
 
2.9%
25 1
 
2.9%
26 1
 
2.9%
27 1
 
2.9%
28 1
 
2.9%
Other values (25) 25
71.4%
ValueCountFrequency (%)
1 1
2.9%
2 1
2.9%
3 1
2.9%
4 1
2.9%
5 1
2.9%
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
ValueCountFrequency (%)
35 1
2.9%
34 1
2.9%
33 1
2.9%
32 1
2.9%
31 1
2.9%
30 1
2.9%
29 1
2.9%
28 1
2.9%
27 1
2.9%
26 1
2.9%

현장주소
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-11T02:06:31.669109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length20.8
Min length18

Characters and Unicode

Total characters728
Distinct characters33
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row부산광역시 남구 감만동 418번지 외 4필지
2nd row부산광역시 남구 대연동 1479-13
3rd row부산광역시 남구 용호동 41-25 외 1필지
4th row부산광역시 남구 대연동 1203-95
5th row부산광역시 남구 용호동 373-76번지
ValueCountFrequency (%)
남구 35
22.9%
부산광역시 34
22.2%
대연동 16
10.5%
용호동 10
 
6.5%
6
 
3.9%
문현동 6
 
3.9%
1필지 5
 
3.3%
감만동 2
 
1.3%
891-23번지 1
 
0.7%
141-51번지 1
 
0.7%
Other values (37) 37
24.2%
2023-12-11T02:06:32.253164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
118
 
16.2%
1 37
 
5.1%
35
 
4.8%
35
 
4.8%
35
 
4.8%
35
 
4.8%
35
 
4.8%
34
 
4.7%
34
 
4.7%
34
 
4.7%
Other values (23) 296
40.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 408
56.0%
Decimal Number 170
23.4%
Space Separator 118
 
16.2%
Dash Punctuation 32
 
4.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
8.6%
35
8.6%
35
8.6%
35
8.6%
35
8.6%
34
8.3%
34
8.3%
34
8.3%
27
 
6.6%
20
 
4.9%
Other values (11) 84
20.6%
Decimal Number
ValueCountFrequency (%)
1 37
21.8%
2 18
10.6%
8 17
10.0%
5 17
10.0%
9 16
9.4%
4 14
 
8.2%
6 14
 
8.2%
3 14
 
8.2%
0 14
 
8.2%
7 9
 
5.3%
Space Separator
ValueCountFrequency (%)
118
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 408
56.0%
Common 320
44.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
8.6%
35
8.6%
35
8.6%
35
8.6%
35
8.6%
34
8.3%
34
8.3%
34
8.3%
27
 
6.6%
20
 
4.9%
Other values (11) 84
20.6%
Common
ValueCountFrequency (%)
118
36.9%
1 37
 
11.6%
- 32
 
10.0%
2 18
 
5.6%
8 17
 
5.3%
5 17
 
5.3%
9 16
 
5.0%
4 14
 
4.4%
6 14
 
4.4%
3 14
 
4.4%
Other values (2) 23
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 408
56.0%
ASCII 320
44.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
118
36.9%
1 37
 
11.6%
- 32
 
10.0%
2 18
 
5.6%
8 17
 
5.3%
5 17
 
5.3%
9 16
 
5.0%
4 14
 
4.4%
6 14
 
4.4%
3 14
 
4.4%
Other values (2) 23
 
7.2%
Hangul
ValueCountFrequency (%)
35
8.6%
35
8.6%
35
8.6%
35
8.6%
35
8.6%
34
8.3%
34
8.3%
34
8.3%
27
 
6.6%
20
 
4.9%
Other values (11) 84
20.6%
Distinct28
Distinct (%)80.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
Minimum2021-01-13 00:00:00
Maximum2021-08-30 00:00:00
2023-12-11T02:06:32.422794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:06:32.635934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)

공사의종류
Categorical

IMBALANCE 

Distinct4
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Memory size412.0 B
구내통신선로설비,방송공동수신설비(종합유선방송)
28 
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송),이동통신구내선로설비
 
1
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,종합유선방송)
 
1

Length

Max length55
Median length25
Mean length28.885714
Min length25

Unique

Unique2 ?
Unique (%)5.7%

Sample

1st row구내통신선로설비,방송공동수신설비(종합유선방송)
2nd row구내통신선로설비,방송공동수신설비(종합유선방송)
3rd row구내통신선로설비,방송공동수신설비(종합유선방송)
4th row구내통신선로설비,방송공동수신설비(종합유선방송)
5th row구내통신선로설비,방송공동수신설비(종합유선방송)

Common Values

ValueCountFrequency (%)
구내통신선로설비,방송공동수신설비(종합유선방송) 28
80.0%
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송) 5
 
14.3%
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송),이동통신구내선로설비 1
 
2.9%
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,종합유선방송) 1
 
2.9%

Length

2023-12-11T02:06:32.813380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:06:32.959061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구내통신선로설비,방송공동수신설비(종합유선방송 28
80.0%
구내통신선로설비,방송공동수신설비(지상파tv,위성방송,fm라디오방송,종합유선방송 5
 
14.3%
구내통신선로설비,방송공동수신설비(지상파tv,위성방송,fm라디오방송,종합유선방송),이동통신구내선로설비 1
 
2.9%
구내통신선로설비,방송공동수신설비(지상파tv,위성방송,종합유선방송 1
 
2.9%

Interactions

2023-12-11T02:06:30.397148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:06:33.108362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번현장주소교부연월일공사의종류
순번1.0001.0000.9890.176
현장주소1.0001.0001.0001.000
교부연월일0.9891.0001.0001.000
공사의종류0.1761.0001.0001.000
2023-12-11T02:06:33.344044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번공사의종류
순번1.0000.000
공사의종류0.0001.000

Missing values

2023-12-11T02:06:30.660311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:06:30.845395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번현장주소교부연월일공사의종류
01부산광역시 남구 감만동 418번지 외 4필지2021-01-14구내통신선로설비,방송공동수신설비(종합유선방송)
12부산광역시 남구 대연동 1479-132021-01-13구내통신선로설비,방송공동수신설비(종합유선방송)
23부산광역시 남구 용호동 41-25 외 1필지2021-01-14구내통신선로설비,방송공동수신설비(종합유선방송)
34부산광역시 남구 대연동 1203-952021-01-18구내통신선로설비,방송공동수신설비(종합유선방송)
45부산광역시 남구 용호동 373-76번지2021-01-25구내통신선로설비,방송공동수신설비(종합유선방송)
56부산광역시 남구 용당동 217-162021-02-02구내통신선로설비,방송공동수신설비(종합유선방송)
67부산광역시 남구 대연동 888-162021-02-04구내통신선로설비,방송공동수신설비(종합유선방송)
78부산광역시 남구 문현동 202-30번지2021-02-04구내통신선로설비,방송공동수신설비(종합유선방송)
89부산광역시 남구 대연동 1504-252021-02-16구내통신선로설비,방송공동수신설비(종합유선방송)
910부산 남구 대연동 281-30 외 1필지2021-02-17구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
순번현장주소교부연월일공사의종류
2526부산광역시 남구 용호동 378-4번지2021-07-27구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
2627부산광역시 남구 문현동 266-6번지2021-07-29구내통신선로설비,방송공동수신설비(종합유선방송)
2728부산광역시 남구 용호동 87-30번지2021-07-29구내통신선로설비,방송공동수신설비(종합유선방송)
2829부산광역시 남구 문현동 183-492021-08-02구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
2930부산광역시 남구 대연동 30-39번지2021-08-05구내통신선로설비,방송공동수신설비(종합유선방송)
3031부산광역시 남구 용호동 755-4외 1필지2021-08-10구내통신선로설비,방송공동수신설비(종합유선방송)
3132부산광역시 남구 문현동 202-62 외 1필지2021-08-12구내통신선로설비,방송공동수신설비(종합유선방송)
3233부산광역시 남구 용호동 215-15 외 1필지2021-08-23구내통신선로설비,방송공동수신설비(종합유선방송)
3334부산광역시 남구 용호동 559-1번지2021-08-23구내통신선로설비,방송공동수신설비(종합유선방송)
3435부산광역시 남구 문현동 119-87번지2021-08-30구내통신선로설비,방송공동수신설비(지상파TV,위성방송,종합유선방송)