Overview

Dataset statistics

Number of variables4
Number of observations24
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory924.0 B
Average record size in memory38.5 B

Variable types

Numeric1
Text1
DateTime1
Categorical1

Dataset

Description부산광역시남구_정보통신공사사용전검사현황_20220607
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3080535

Alerts

순번 has unique valuesUnique
현장주소 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:06:17.282579
Analysis finished2023-12-10 17:06:17.806351
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.5
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-11T02:06:17.915825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.15
Q16.75
median12.5
Q318.25
95-th percentile22.85
Maximum24
Range23
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation7.0710678
Coefficient of variation (CV)0.56568542
Kurtosis-1.2
Mean12.5
Median Absolute Deviation (MAD)6
Skewness0
Sum300
Variance50
MonotonicityStrictly increasing
2023-12-11T02:06:18.085551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1 1
 
4.2%
14 1
 
4.2%
24 1
 
4.2%
23 1
 
4.2%
22 1
 
4.2%
21 1
 
4.2%
20 1
 
4.2%
19 1
 
4.2%
18 1
 
4.2%
17 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
1 1
4.2%
2 1
4.2%
3 1
4.2%
4 1
4.2%
5 1
4.2%
6 1
4.2%
7 1
4.2%
8 1
4.2%
9 1
4.2%
10 1
4.2%
ValueCountFrequency (%)
24 1
4.2%
23 1
4.2%
22 1
4.2%
21 1
4.2%
20 1
4.2%
19 1
4.2%
18 1
4.2%
17 1
4.2%
16 1
4.2%
15 1
4.2%

현장주소
Text

UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size324.0 B
2023-12-11T02:06:18.364018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length20.5
Mean length18.25
Min length10

Characters and Unicode

Total characters438
Distinct characters33
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)100.0%

Sample

1st row남구 용호동 554-21번지
2nd row대연동 894-18번지
3rd row남구 용호동 407-10 외 6필지
4th row남구 감만동 33-8번지
5th row부산광역시 남구 대연동 231-56번지
ValueCountFrequency (%)
남구 22
23.4%
부산광역시 14
14.9%
대연동 12
12.8%
용호동 5
 
5.3%
4
 
4.3%
문현동 4
 
4.3%
1필지 3
 
3.2%
3필지 2
 
2.1%
용당동 2
 
2.1%
243-10 1
 
1.1%
Other values (25) 25
26.6%
2023-12-11T02:06:18.877924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70
 
16.0%
1 28
 
6.4%
24
 
5.5%
- 24
 
5.5%
22
 
5.0%
22
 
5.0%
2 17
 
3.9%
15
 
3.4%
14
 
3.2%
14
 
3.2%
Other values (23) 188
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 222
50.7%
Decimal Number 122
27.9%
Space Separator 70
 
16.0%
Dash Punctuation 24
 
5.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
10.8%
22
 
9.9%
22
 
9.9%
15
 
6.8%
14
 
6.3%
14
 
6.3%
14
 
6.3%
14
 
6.3%
14
 
6.3%
12
 
5.4%
Other values (11) 57
25.7%
Decimal Number
ValueCountFrequency (%)
1 28
23.0%
2 17
13.9%
3 14
11.5%
4 13
10.7%
7 12
9.8%
5 11
 
9.0%
8 8
 
6.6%
6 7
 
5.7%
9 6
 
4.9%
0 6
 
4.9%
Space Separator
ValueCountFrequency (%)
70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 222
50.7%
Common 216
49.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
10.8%
22
 
9.9%
22
 
9.9%
15
 
6.8%
14
 
6.3%
14
 
6.3%
14
 
6.3%
14
 
6.3%
14
 
6.3%
12
 
5.4%
Other values (11) 57
25.7%
Common
ValueCountFrequency (%)
70
32.4%
1 28
 
13.0%
- 24
 
11.1%
2 17
 
7.9%
3 14
 
6.5%
4 13
 
6.0%
7 12
 
5.6%
5 11
 
5.1%
8 8
 
3.7%
6 7
 
3.2%
Other values (2) 12
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 222
50.7%
ASCII 216
49.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
70
32.4%
1 28
 
13.0%
- 24
 
11.1%
2 17
 
7.9%
3 14
 
6.5%
4 13
 
6.0%
7 12
 
5.6%
5 11
 
5.1%
8 8
 
3.7%
6 7
 
3.2%
Other values (2) 12
 
5.6%
Hangul
ValueCountFrequency (%)
24
10.8%
22
 
9.9%
22
 
9.9%
15
 
6.8%
14
 
6.3%
14
 
6.3%
14
 
6.3%
14
 
6.3%
14
 
6.3%
12
 
5.4%
Other values (11) 57
25.7%
Distinct22
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size324.0 B
Minimum2022-01-10 00:00:00
Maximum2022-05-27 00:00:00
2023-12-11T02:06:19.071657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:06:19.283163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)

공사의종류
Categorical

Distinct3
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
구내통신선로설비,방송공동수신설비(종합유선방송)
16 
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
구내통신선로설비
 
1

Length

Max length44
Median length25
Mean length29.833333
Min length8

Unique

Unique1 ?
Unique (%)4.2%

Sample

1st row구내통신선로설비,방송공동수신설비(종합유선방송)
2nd row구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
3rd row구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
4th row구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
5th row구내통신선로설비,방송공동수신설비(종합유선방송)

Common Values

ValueCountFrequency (%)
구내통신선로설비,방송공동수신설비(종합유선방송) 16
66.7%
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송) 7
29.2%
구내통신선로설비 1
 
4.2%

Length

2023-12-11T02:06:19.533002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:06:19.710930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구내통신선로설비,방송공동수신설비(종합유선방송 16
66.7%
구내통신선로설비,방송공동수신설비(지상파tv,위성방송,fm라디오방송,종합유선방송 7
29.2%
구내통신선로설비 1
 
4.2%

Interactions

2023-12-11T02:06:17.464473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:06:19.851828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번현장주소교부연월일공사의종류
순번1.0001.0000.9450.000
현장주소1.0001.0001.0001.000
교부연월일0.9451.0001.0001.000
공사의종류0.0001.0001.0001.000
2023-12-11T02:06:20.052000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번공사의종류
순번1.0000.000
공사의종류0.0001.000

Missing values

2023-12-11T02:06:17.624957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:06:17.751408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번현장주소교부연월일공사의종류
01남구 용호동 554-21번지2022-01-10구내통신선로설비,방송공동수신설비(종합유선방송)
12대연동 894-18번지2022-01-20구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
23남구 용호동 407-10 외 6필지2022-01-26구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
34남구 감만동 33-8번지2022-01-27구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
45부산광역시 남구 대연동 231-56번지2022-02-10구내통신선로설비,방송공동수신설비(종합유선방송)
56남구 문현동 127-78번지2022-02-10구내통신선로설비,방송공동수신설비(종합유선방송)
67부산광역시 남구 대연동 1736-32022-02-18구내통신선로설비,방송공동수신설비(종합유선방송)
78부산광역시 남구 대연동 252-72022-02-23구내통신선로설비,방송공동수신설비(종합유선방송)
89부산광역시 남구 용당동 217-25번지2022-03-15구내통신선로설비
910부산광역시 남구 대연동 1170-112022-03-17구내통신선로설비,방송공동수신설비(종합유선방송)
순번현장주소교부연월일공사의종류
1415부산광역시 남구 대연동 1511-2외 1필지2022-04-07구내통신선로설비,방송공동수신설비(종합유선방송)
1516부산광역시 남구 대연동 243-102022-04-11구내통신선로설비,방송공동수신설비(종합유선방송)
1617남구 대연동 1258-172022-04-11구내통신선로설비,방송공동수신설비(종합유선방송)
1718부산광역시 남구 용당동 462-3 외 1필지2022-04-25구내통신선로설비,방송공동수신설비(종합유선방송)
1819대연동 1776-62022-05-11구내통신선로설비,방송공동수신설비(종합유선방송)
1920남구 문현동 424-12번지2022-05-12구내통신선로설비,방송공동수신설비(종합유선방송)
2021남구 문현동 119-50 외 1필지2022-05-13구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
2122남구 문현동 238-73 외 3필지2022-05-17구내통신선로설비,방송공동수신설비(종합유선방송)
2223부산광역시 남구 대연동 999-32022-05-23구내통신선로설비,방송공동수신설비(종합유선방송)
2324부산광역시 남구 용호동 388-22022-05-27구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)