Overview

Dataset statistics

Number of variables3
Number of observations489
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.1 KiB
Average record size in memory25.3 B

Variable types

Numeric1
Text2

Dataset

Description부산광역시 북구 관내에 존재하는 소독의무대상시설 현황에 대한 정보로 연번, 업소명, 주소 등의 항목을 제공하고 있습니다.
Author부산광역시 북구
URLhttps://www.data.go.kr/data/15005977/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:46:33.815328
Analysis finished2023-12-12 18:46:34.476163
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct489
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean245
Minimum1
Maximum489
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.4 KiB
2023-12-13T03:46:34.585251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile25.4
Q1123
median245
Q3367
95-th percentile464.6
Maximum489
Range488
Interquartile range (IQR)244

Descriptive statistics

Standard deviation141.3064
Coefficient of variation (CV)0.57676084
Kurtosis-1.2
Mean245
Median Absolute Deviation (MAD)122
Skewness0
Sum119805
Variance19967.5
MonotonicityStrictly increasing
2023-12-13T03:46:34.817065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
337 1
 
0.2%
335 1
 
0.2%
334 1
 
0.2%
333 1
 
0.2%
332 1
 
0.2%
331 1
 
0.2%
330 1
 
0.2%
329 1
 
0.2%
328 1
 
0.2%
Other values (479) 479
98.0%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
489 1
0.2%
488 1
0.2%
487 1
0.2%
486 1
0.2%
485 1
0.2%
484 1
0.2%
483 1
0.2%
482 1
0.2%
481 1
0.2%
480 1
0.2%
Distinct477
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-12-13T03:46:35.229332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length7.1451943
Min length2

Characters and Unicode

Total characters3494
Distinct characters404
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique465 ?
Unique (%)95.1%

Sample

1st row부민병원
2nd row구포 성심병원
3rd row구포 부민병원
4th row미래로병원
5th row베스티안부산병원
ValueCountFrequency (%)
덕천점 4
 
0.7%
구포 3
 
0.5%
스타벅스 3
 
0.5%
부산화명점 3
 
0.5%
부산지식산업센터 2
 
0.4%
부민병원 2
 
0.4%
율리역 2
 
0.4%
부산덕천점 2
 
0.4%
화명점 2
 
0.4%
화명그린24(그린숲속 2
 
0.4%
Other values (510) 530
95.5%
2023-12-13T03:46:35.823897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
83
 
2.4%
79
 
2.3%
75
 
2.1%
71
 
2.0%
70
 
2.0%
70
 
2.0%
68
 
1.9%
68
 
1.9%
64
 
1.8%
58
 
1.7%
Other values (394) 2788
79.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3228
92.4%
Space Separator 70
 
2.0%
Decimal Number 60
 
1.7%
Uppercase Letter 42
 
1.2%
Open Punctuation 37
 
1.1%
Close Punctuation 37
 
1.1%
Other Symbol 7
 
0.2%
Other Punctuation 5
 
0.1%
Dash Punctuation 4
 
0.1%
Lowercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
83
 
2.6%
79
 
2.4%
75
 
2.3%
71
 
2.2%
70
 
2.2%
68
 
2.1%
68
 
2.1%
64
 
2.0%
58
 
1.8%
54
 
1.7%
Other values (356) 2538
78.6%
Uppercase Letter
ValueCountFrequency (%)
T 4
 
9.5%
B 4
 
9.5%
M 3
 
7.1%
D 3
 
7.1%
A 3
 
7.1%
U 3
 
7.1%
H 3
 
7.1%
O 3
 
7.1%
W 3
 
7.1%
L 2
 
4.8%
Other values (7) 11
26.2%
Decimal Number
ValueCountFrequency (%)
2 23
38.3%
1 12
20.0%
0 5
 
8.3%
3 5
 
8.3%
4 4
 
6.7%
5 3
 
5.0%
7 3
 
5.0%
9 2
 
3.3%
8 2
 
3.3%
6 1
 
1.7%
Other Punctuation
ValueCountFrequency (%)
, 2
40.0%
& 1
20.0%
. 1
20.0%
! 1
20.0%
Lowercase Letter
ValueCountFrequency (%)
e 3
75.0%
h 1
 
25.0%
Space Separator
ValueCountFrequency (%)
70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3235
92.6%
Common 213
 
6.1%
Latin 46
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
83
 
2.6%
79
 
2.4%
75
 
2.3%
71
 
2.2%
70
 
2.2%
68
 
2.1%
68
 
2.1%
64
 
2.0%
58
 
1.8%
54
 
1.7%
Other values (357) 2545
78.7%
Latin
ValueCountFrequency (%)
T 4
 
8.7%
B 4
 
8.7%
M 3
 
6.5%
D 3
 
6.5%
A 3
 
6.5%
e 3
 
6.5%
U 3
 
6.5%
H 3
 
6.5%
O 3
 
6.5%
W 3
 
6.5%
Other values (9) 14
30.4%
Common
ValueCountFrequency (%)
70
32.9%
( 37
17.4%
) 37
17.4%
2 23
 
10.8%
1 12
 
5.6%
0 5
 
2.3%
3 5
 
2.3%
- 4
 
1.9%
4 4
 
1.9%
5 3
 
1.4%
Other values (8) 13
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3228
92.4%
ASCII 259
 
7.4%
None 7
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
83
 
2.6%
79
 
2.4%
75
 
2.3%
71
 
2.2%
70
 
2.2%
68
 
2.1%
68
 
2.1%
64
 
2.0%
58
 
1.8%
54
 
1.7%
Other values (356) 2538
78.6%
ASCII
ValueCountFrequency (%)
70
27.0%
( 37
14.3%
) 37
14.3%
2 23
 
8.9%
1 12
 
4.6%
0 5
 
1.9%
3 5
 
1.9%
T 4
 
1.5%
- 4
 
1.5%
4 4
 
1.5%
Other values (27) 58
22.4%
None
ValueCountFrequency (%)
7
100.0%

주소
Text

Distinct479
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-12-13T03:46:36.218840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length31
Mean length16.599182
Min length5

Characters and Unicode

Total characters8117
Distinct characters203
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique469 ?
Unique (%)95.9%

Sample

1st row부산광역시 북구 만덕대로 59 (덕천동)
2nd row부산광역시 북구 낙동대로 1786 (구포동)
3rd row부산광역시 북구 시랑로 31-1 (구포동)
4th row부산광역시 북구 금곡대로 15 (덕천동)
5th row부산광역시 북구 화명대로 1 (화명동)
ValueCountFrequency (%)
화명동 73
 
4.9%
구포동 68
 
4.5%
금곡대로 67
 
4.5%
덕천동 59
 
3.9%
북구 52
 
3.5%
만덕동 42
 
2.8%
화명신도시로 31
 
2.1%
만덕대로 30
 
2.0%
부산광역시 28
 
1.9%
금곡동 28
 
1.9%
Other values (535) 1023
68.2%
2023-12-13T03:46:37.057804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1490
 
18.4%
444
 
5.5%
1 424
 
5.2%
388
 
4.8%
( 323
 
4.0%
) 323
 
4.0%
2 295
 
3.6%
244
 
3.0%
218
 
2.7%
202
 
2.5%
Other values (193) 3766
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3794
46.7%
Decimal Number 1956
24.1%
Space Separator 1490
 
18.4%
Close Punctuation 324
 
4.0%
Open Punctuation 323
 
4.0%
Other Punctuation 116
 
1.4%
Dash Punctuation 65
 
0.8%
Uppercase Letter 43
 
0.5%
Math Symbol 4
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
444
 
11.7%
388
 
10.2%
244
 
6.4%
218
 
5.7%
202
 
5.3%
188
 
5.0%
174
 
4.6%
173
 
4.6%
139
 
3.7%
139
 
3.7%
Other values (166) 1485
39.1%
Decimal Number
ValueCountFrequency (%)
1 424
21.7%
2 295
15.1%
3 200
10.2%
0 194
9.9%
6 158
 
8.1%
8 157
 
8.0%
4 156
 
8.0%
5 139
 
7.1%
7 136
 
7.0%
9 97
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 90
77.6%
/ 20
 
17.2%
? 4
 
3.4%
@ 1
 
0.9%
. 1
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
B 21
48.8%
T 20
46.5%
L 1
 
2.3%
H 1
 
2.3%
Close Punctuation
ValueCountFrequency (%)
) 323
99.7%
] 1
 
0.3%
Space Separator
ValueCountFrequency (%)
1490
100.0%
Open Punctuation
ValueCountFrequency (%)
( 323
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 65
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4278
52.7%
Hangul 3795
46.8%
Latin 44
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
444
 
11.7%
388
 
10.2%
244
 
6.4%
218
 
5.7%
202
 
5.3%
188
 
5.0%
174
 
4.6%
173
 
4.6%
139
 
3.7%
139
 
3.7%
Other values (167) 1486
39.2%
Common
ValueCountFrequency (%)
1490
34.8%
1 424
 
9.9%
( 323
 
7.6%
) 323
 
7.6%
2 295
 
6.9%
3 200
 
4.7%
0 194
 
4.5%
6 158
 
3.7%
8 157
 
3.7%
4 156
 
3.6%
Other values (11) 558
 
13.0%
Latin
ValueCountFrequency (%)
B 21
47.7%
T 20
45.5%
L 1
 
2.3%
e 1
 
2.3%
H 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4322
53.2%
Hangul 3794
46.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1490
34.5%
1 424
 
9.8%
( 323
 
7.5%
) 323
 
7.5%
2 295
 
6.8%
3 200
 
4.6%
0 194
 
4.5%
6 158
 
3.7%
8 157
 
3.6%
4 156
 
3.6%
Other values (16) 602
13.9%
Hangul
ValueCountFrequency (%)
444
 
11.7%
388
 
10.2%
244
 
6.4%
218
 
5.7%
202
 
5.3%
188
 
5.0%
174
 
4.6%
173
 
4.6%
139
 
3.7%
139
 
3.7%
Other values (166) 1485
39.1%
None
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-13T03:46:34.175247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T03:46:34.328483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:46:34.436315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설명주소
01부민병원부산광역시 북구 만덕대로 59 (덕천동)
12구포 성심병원부산광역시 북구 낙동대로 1786 (구포동)
23구포 부민병원부산광역시 북구 시랑로 31-1 (구포동)
34미래로병원부산광역시 북구 금곡대로 15 (덕천동)
45베스티안부산병원부산광역시 북구 화명대로 1 (화명동)
56아하브병원부산광역시 북구 만덕고개길 84 (만덕동)
67한사랑내과병원부산광역시 북구 만덕대로 38 (덕천동)
78화명일신기독병원부산광역시 북구 금곡대로 268
89맥켄지화명일신기독병원부산광역시 북구 금곡대로 268
910굿윌치과병원부산광역시 북구 금곡대로 15 (덕천동)
연번시설명주소
479480현대2차북구 화명신도시로 48
480481협성북구 만덕1로 82
481482화명그린24(그린숲속)북구 산성로 88
482483화명대림타운북구 금곡대로 268
483484화명뜨란채북구 화명신도시로 219
484485화명리버빌2차북구 화명신도시로 244
485486화명코오롱북구 양달로 80-11
486487화명한일유앤아이북구 화명신도시로 194
487488e-편한세상 화명힐스북구 화명대로68번길
488489금정산LH뉴웰시티북구 상학골 35