Overview

Dataset statistics

Number of variables3
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.9 KiB
Average record size in memory25.3 B

Variable types

Numeric1
Text2

Dataset

Description부산광역시금정구_전문건설업체현황_20230214
Author부산광역시 금정구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3055603

Alerts

순번 has unique valuesUnique
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:23:14.565054
Analysis finished2023-12-10 17:23:15.592146
Duration1.03 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.5
Minimum1
Maximum400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-11T02:23:15.767421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20.95
Q1100.75
median200.5
Q3300.25
95-th percentile380.05
Maximum400
Range399
Interquartile range (IQR)199.5

Descriptive statistics

Standard deviation115.6143
Coefficient of variation (CV)0.57662993
Kurtosis-1.2
Mean200.5
Median Absolute Deviation (MAD)100
Skewness0
Sum80200
Variance13366.667
MonotonicityStrictly increasing
2023-12-11T02:23:16.136362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
265 1
 
0.2%
275 1
 
0.2%
274 1
 
0.2%
273 1
 
0.2%
272 1
 
0.2%
271 1
 
0.2%
270 1
 
0.2%
269 1
 
0.2%
268 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
400 1
0.2%
399 1
0.2%
398 1
0.2%
397 1
0.2%
396 1
0.2%
395 1
0.2%
394 1
0.2%
393 1
0.2%
392 1
0.2%
391 1
0.2%

상호
Text

UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-11T02:23:17.397447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length7.3475
Min length2

Characters and Unicode

Total characters2939
Distinct characters279
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique400 ?
Unique (%)100.0%

Sample

1st row(유)새천년건설
2nd row(주)가람디자인
3rd row(주)가온개발
4th row(주)강윤
5th row(주)건보산업
ValueCountFrequency (%)
유)새천년건설 1
 
0.2%
세윤종합건설(주 1
 
0.2%
에스더블유건설(주 1
 
0.2%
에덴건축 1
 
0.2%
씨에이건설(주 1
 
0.2%
신흥설비 1
 
0.2%
신이영산업개발(주 1
 
0.2%
신우냉,난방 1
 
0.2%
신아건축인테리어 1
 
0.2%
수창산업개발주식회사 1
 
0.2%
Other values (390) 390
97.5%
2023-12-11T02:23:18.483600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
314
 
10.7%
( 236
 
8.0%
) 236
 
8.0%
144
 
4.9%
134
 
4.6%
91
 
3.1%
78
 
2.7%
77
 
2.6%
69
 
2.3%
52
 
1.8%
Other values (269) 1508
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2441
83.1%
Open Punctuation 236
 
8.0%
Close Punctuation 236
 
8.0%
Uppercase Letter 18
 
0.6%
Decimal Number 6
 
0.2%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
314
 
12.9%
144
 
5.9%
134
 
5.5%
91
 
3.7%
78
 
3.2%
77
 
3.2%
69
 
2.8%
52
 
2.1%
45
 
1.8%
42
 
1.7%
Other values (249) 1395
57.1%
Uppercase Letter
ValueCountFrequency (%)
C 3
16.7%
O 2
11.1%
E 2
11.1%
R 2
11.1%
G 2
11.1%
N 1
 
5.6%
P 1
 
5.6%
W 1
 
5.6%
V 1
 
5.6%
J 1
 
5.6%
Other values (2) 2
11.1%
Decimal Number
ValueCountFrequency (%)
1 2
33.3%
9 2
33.3%
4 1
16.7%
0 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
. 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 236
100.0%
Close Punctuation
ValueCountFrequency (%)
) 236
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2441
83.1%
Common 480
 
16.3%
Latin 18
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
314
 
12.9%
144
 
5.9%
134
 
5.5%
91
 
3.7%
78
 
3.2%
77
 
3.2%
69
 
2.8%
52
 
2.1%
45
 
1.8%
42
 
1.7%
Other values (249) 1395
57.1%
Latin
ValueCountFrequency (%)
C 3
16.7%
O 2
11.1%
E 2
11.1%
R 2
11.1%
G 2
11.1%
N 1
 
5.6%
P 1
 
5.6%
W 1
 
5.6%
V 1
 
5.6%
J 1
 
5.6%
Other values (2) 2
11.1%
Common
ValueCountFrequency (%)
( 236
49.2%
) 236
49.2%
1 2
 
0.4%
9 2
 
0.4%
, 1
 
0.2%
4 1
 
0.2%
0 1
 
0.2%
. 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2441
83.1%
ASCII 498
 
16.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
314
 
12.9%
144
 
5.9%
134
 
5.5%
91
 
3.7%
78
 
3.2%
77
 
3.2%
69
 
2.8%
52
 
2.1%
45
 
1.8%
42
 
1.7%
Other values (249) 1395
57.1%
ASCII
ValueCountFrequency (%)
( 236
47.4%
) 236
47.4%
C 3
 
0.6%
1 2
 
0.4%
9 2
 
0.4%
O 2
 
0.4%
E 2
 
0.4%
R 2
 
0.4%
G 2
 
0.4%
N 1
 
0.2%
Other values (10) 10
 
2.0%
Distinct389
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-11T02:23:19.083395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length42
Mean length28.4875
Min length19

Characters and Unicode

Total characters11395
Distinct characters194
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique379 ?
Unique (%)94.8%

Sample

1st row부산광역시 금정구 중앙대로 2350, 2층 (노포동)
2nd row부산광역시 금정구 삼어로 219 (금사동)
3rd row부산광역시 금정구 체육공원로 631-15 (두구동)
4th row부산광역시 금정구 개좌로272번길 21-13 (회동동)
5th row부산광역시 금정구 중앙대로 1981 (남산동)
ValueCountFrequency (%)
금정구 400
 
17.9%
부산광역시 399
 
17.8%
남산동 77
 
3.4%
부곡동 64
 
2.9%
구서동 62
 
2.8%
중앙대로 33
 
1.5%
2층 33
 
1.5%
장전동 31
 
1.4%
금강로 29
 
1.3%
금사동 26
 
1.2%
Other values (577) 1085
48.5%
2023-12-11T02:23:20.098866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1839
 
16.1%
542
 
4.8%
513
 
4.5%
508
 
4.5%
507
 
4.4%
482
 
4.2%
425
 
3.7%
407
 
3.6%
( 402
 
3.5%
) 402
 
3.5%
Other values (184) 5368
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6672
58.6%
Space Separator 1839
 
16.1%
Decimal Number 1830
 
16.1%
Open Punctuation 402
 
3.5%
Close Punctuation 402
 
3.5%
Other Punctuation 155
 
1.4%
Dash Punctuation 90
 
0.8%
Lowercase Letter 3
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
542
 
8.1%
513
 
7.7%
508
 
7.6%
507
 
7.6%
482
 
7.2%
425
 
6.4%
407
 
6.1%
401
 
6.0%
399
 
6.0%
397
 
6.0%
Other values (163) 2091
31.3%
Decimal Number
ValueCountFrequency (%)
1 375
20.5%
2 283
15.5%
3 213
11.6%
0 184
10.1%
5 166
9.1%
6 151
8.3%
4 151
8.3%
9 120
 
6.6%
7 114
 
6.2%
8 73
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
d 1
33.3%
m 1
33.3%
c 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 128
82.6%
27
 
17.4%
Uppercase Letter
ValueCountFrequency (%)
D 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
1839
100.0%
Open Punctuation
ValueCountFrequency (%)
( 402
100.0%
Close Punctuation
ValueCountFrequency (%)
) 402
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 90
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6672
58.6%
Common 4718
41.4%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
542
 
8.1%
513
 
7.7%
508
 
7.6%
507
 
7.6%
482
 
7.2%
425
 
6.4%
407
 
6.1%
401
 
6.0%
399
 
6.0%
397
 
6.0%
Other values (163) 2091
31.3%
Common
ValueCountFrequency (%)
1839
39.0%
( 402
 
8.5%
) 402
 
8.5%
1 375
 
7.9%
2 283
 
6.0%
3 213
 
4.5%
0 184
 
3.9%
5 166
 
3.5%
6 151
 
3.2%
4 151
 
3.2%
Other values (6) 552
 
11.7%
Latin
ValueCountFrequency (%)
D 1
20.0%
d 1
20.0%
m 1
20.0%
c 1
20.0%
B 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6672
58.6%
ASCII 4696
41.2%
None 27
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1839
39.2%
( 402
 
8.6%
) 402
 
8.6%
1 375
 
8.0%
2 283
 
6.0%
3 213
 
4.5%
0 184
 
3.9%
5 166
 
3.5%
6 151
 
3.2%
4 151
 
3.2%
Other values (10) 530
 
11.3%
Hangul
ValueCountFrequency (%)
542
 
8.1%
513
 
7.7%
508
 
7.6%
507
 
7.6%
482
 
7.2%
425
 
6.4%
407
 
6.1%
401
 
6.0%
399
 
6.0%
397
 
6.0%
Other values (163) 2091
31.3%
None
ValueCountFrequency (%)
27
100.0%

Interactions

2023-12-11T02:23:15.029797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T02:23:15.346347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:23:15.527388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상호영업소재지(도로명주소)
01(유)새천년건설부산광역시 금정구 중앙대로 2350, 2층 (노포동)
12(주)가람디자인부산광역시 금정구 삼어로 219 (금사동)
23(주)가온개발부산광역시 금정구 체육공원로 631-15 (두구동)
34(주)강윤부산광역시 금정구 개좌로272번길 21-13 (회동동)
45(주)건보산업부산광역시 금정구 중앙대로 1981 (남산동)
56(주)고센건설부산광역시 금정구 금정로 191,3층(장전동, 고센빌딩)
67(주)공간산업개발부산광역시 금정구 노포사송로 123, 지1층 (노포동)
78(주)관문건설부산광역시 금정구 팔송로45번길 37-1(남산동)
89(주)광일테크부산광역시 금정구 금강로578번길 32 4층 (구서동)
910(주)국제배관부산광역시 금정구 금샘로 556 (남산동)
순번상호영업소재지(도로명주소)
390391해광설비공사부산광역시 금정구 서동로 95 (서동)
391392해든조경주식회사부산광역시 금정구 남산로 48 2층 (남산동)
392393해림건설(주)부산광역시 금정구 서부곡로 19 덕영빌딩 3층 (부곡동)
393394현대설비부산광역시 금정구 수림로66번길 24 (장전동)
394395현대티지부산광역시 금정구 체육공원로29번길 24 (구서동)
395396현송건설주식회사부산광역시 금정구 금강로 179 9층 (장전동)
396397화성설비부산광역시 금정구 남산로37번길 20-2 (남산동)
397398회영설비부산광역시 금정구 금사로 168 (회동동)
398399효성드라이비트(주)부산광역시 금정구 오시게로 62, 505호(부곡동, 남성하이빌)
399400휘람건설(주)부산광역시 금정구 금샘로582번길 24 105호(아신빌라) (남산동)