Overview

Dataset statistics

Number of variables3
Number of observations390
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.7 KiB
Average record size in memory25.3 B

Variable types

Numeric1
Text2

Dataset

Description부산광역시금정구_전문건설업체현황_20220219
Author부산광역시 금정구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3055603

Reproduction

Analysis started2023-12-10 17:23:21.996236
Analysis finished2023-12-10 17:23:23.044991
Duration1.05 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

Distinct387
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean195.40769
Minimum1
Maximum387
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-11T02:23:23.258106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20.45
Q198.25
median195.5
Q3292.75
95-th percentile370.55
Maximum387
Range386
Interquartile range (IQR)194.5

Descriptive statistics

Standard deviation112.57347
Coefficient of variation (CV)0.57609541
Kurtosis-1.2054634
Mean195.40769
Median Absolute Deviation (MAD)97.5
Skewness-0.0044069025
Sum76209
Variance12672.787
MonotonicityNot monotonic
2023-12-11T02:23:23.688441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
378 2
 
0.5%
376 2
 
0.5%
377 2
 
0.5%
253 1
 
0.3%
262 1
 
0.3%
261 1
 
0.3%
260 1
 
0.3%
259 1
 
0.3%
258 1
 
0.3%
257 1
 
0.3%
Other values (377) 377
96.7%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
387 1
0.3%
386 1
0.3%
385 1
0.3%
384 1
0.3%
383 1
0.3%
382 1
0.3%
381 1
0.3%
380 1
0.3%
379 1
0.3%
378 2
0.5%

상호
Text

Distinct389
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:23:24.287249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length7.2589744
Min length2

Characters and Unicode

Total characters2831
Distinct characters271
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique388 ?
Unique (%)99.5%

Sample

1st row(유)새천년건설
2nd row(주)가람디자인
3rd row(주)가온개발
4th row(주)강윤
5th row(주)건보산업
ValueCountFrequency (%)
해림건설(주 2
 
0.5%
대명토건(주 1
 
0.3%
세경조경 1
 
0.3%
신흥설비 1
 
0.3%
신이영산업개발(주 1
 
0.3%
신우냉,난방 1
 
0.3%
신아건축인테리어 1
 
0.3%
수창산업개발주식회사 1
 
0.3%
수광조경개발 1
 
0.3%
송림건설(주 1
 
0.3%
Other values (379) 379
97.2%
2023-12-11T02:23:25.209805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
301
 
10.6%
( 236
 
8.3%
) 236
 
8.3%
141
 
5.0%
128
 
4.5%
80
 
2.8%
66
 
2.3%
65
 
2.3%
61
 
2.2%
56
 
2.0%
Other values (261) 1461
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2334
82.4%
Open Punctuation 236
 
8.3%
Close Punctuation 236
 
8.3%
Uppercase Letter 19
 
0.7%
Decimal Number 3
 
0.1%
Other Punctuation 2
 
0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
301
 
12.9%
141
 
6.0%
128
 
5.5%
80
 
3.4%
66
 
2.8%
65
 
2.8%
61
 
2.6%
56
 
2.4%
43
 
1.8%
38
 
1.6%
Other values (242) 1355
58.1%
Uppercase Letter
ValueCountFrequency (%)
C 4
21.1%
R 2
10.5%
E 2
10.5%
G 2
10.5%
N 2
10.5%
O 1
 
5.3%
V 1
 
5.3%
W 1
 
5.3%
T 1
 
5.3%
I 1
 
5.3%
Other values (2) 2
10.5%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
9 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
. 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 236
100.0%
Close Punctuation
ValueCountFrequency (%)
) 236
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2335
82.5%
Common 477
 
16.8%
Latin 19
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
301
 
12.9%
141
 
6.0%
128
 
5.5%
80
 
3.4%
66
 
2.8%
65
 
2.8%
61
 
2.6%
56
 
2.4%
43
 
1.8%
38
 
1.6%
Other values (243) 1356
58.1%
Latin
ValueCountFrequency (%)
C 4
21.1%
R 2
10.5%
E 2
10.5%
G 2
10.5%
N 2
10.5%
O 1
 
5.3%
V 1
 
5.3%
W 1
 
5.3%
T 1
 
5.3%
I 1
 
5.3%
Other values (2) 2
10.5%
Common
ValueCountFrequency (%)
( 236
49.5%
) 236
49.5%
1 2
 
0.4%
, 1
 
0.2%
. 1
 
0.2%
9 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2334
82.4%
ASCII 496
 
17.5%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
301
 
12.9%
141
 
6.0%
128
 
5.5%
80
 
3.4%
66
 
2.8%
65
 
2.8%
61
 
2.6%
56
 
2.4%
43
 
1.8%
38
 
1.6%
Other values (242) 1355
58.1%
ASCII
ValueCountFrequency (%)
( 236
47.6%
) 236
47.6%
C 4
 
0.8%
R 2
 
0.4%
E 2
 
0.4%
G 2
 
0.4%
1 2
 
0.4%
N 2
 
0.4%
O 1
 
0.2%
V 1
 
0.2%
Other values (8) 8
 
1.6%
None
ValueCountFrequency (%)
1
100.0%
Distinct378
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:23:25.753638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length42
Mean length28.625641
Min length19

Characters and Unicode

Total characters11164
Distinct characters197
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique367 ?
Unique (%)94.1%

Sample

1st row부산광역시 금정구 중앙대로 2350, 2층 (노포동)
2nd row부산광역시 금정구 삼어로 219 (금사동)
3rd row부산광역시 금정구 체육공원로 631-15 (두구동)
4th row부산광역시 금정구 개좌로272번길 21-13 (회동동)
5th row부산광역시 금정구 중앙대로 1981 (남산동)
ValueCountFrequency (%)
금정구 390
 
17.8%
부산광역시 389
 
17.8%
남산동 77
 
3.5%
구서동 62
 
2.8%
부곡동 57
 
2.6%
중앙대로 37
 
1.7%
2층 34
 
1.6%
금강로 30
 
1.4%
장전동 29
 
1.3%
서동 27
 
1.2%
Other values (580) 1054
48.2%
2023-12-11T02:23:26.695952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1796
 
16.1%
535
 
4.8%
501
 
4.5%
495
 
4.4%
488
 
4.4%
460
 
4.1%
412
 
3.7%
396
 
3.5%
391
 
3.5%
389
 
3.5%
Other values (187) 5301
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6522
58.4%
Decimal Number 1806
 
16.2%
Space Separator 1796
 
16.1%
Open Punctuation 386
 
3.5%
Close Punctuation 386
 
3.5%
Other Punctuation 178
 
1.6%
Dash Punctuation 85
 
0.8%
Lowercase Letter 3
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
535
 
8.2%
501
 
7.7%
495
 
7.6%
488
 
7.5%
460
 
7.1%
412
 
6.3%
396
 
6.1%
391
 
6.0%
389
 
6.0%
388
 
5.9%
Other values (166) 2067
31.7%
Decimal Number
ValueCountFrequency (%)
1 365
20.2%
2 285
15.8%
3 207
11.5%
0 184
10.2%
5 151
8.4%
4 144
 
8.0%
6 144
 
8.0%
9 124
 
6.9%
7 117
 
6.5%
8 85
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
c 1
33.3%
m 1
33.3%
d 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 148
83.1%
30
 
16.9%
Uppercase Letter
ValueCountFrequency (%)
D 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
1796
100.0%
Open Punctuation
ValueCountFrequency (%)
( 386
100.0%
Close Punctuation
ValueCountFrequency (%)
) 386
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 85
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6522
58.4%
Common 4637
41.5%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
535
 
8.2%
501
 
7.7%
495
 
7.6%
488
 
7.5%
460
 
7.1%
412
 
6.3%
396
 
6.1%
391
 
6.0%
389
 
6.0%
388
 
5.9%
Other values (166) 2067
31.7%
Common
ValueCountFrequency (%)
1796
38.7%
( 386
 
8.3%
) 386
 
8.3%
1 365
 
7.9%
2 285
 
6.1%
3 207
 
4.5%
0 184
 
4.0%
5 151
 
3.3%
, 148
 
3.2%
4 144
 
3.1%
Other values (6) 585
 
12.6%
Latin
ValueCountFrequency (%)
D 1
20.0%
B 1
20.0%
c 1
20.0%
m 1
20.0%
d 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6522
58.4%
ASCII 4612
41.3%
None 30
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1796
38.9%
( 386
 
8.4%
) 386
 
8.4%
1 365
 
7.9%
2 285
 
6.2%
3 207
 
4.5%
0 184
 
4.0%
5 151
 
3.3%
, 148
 
3.2%
4 144
 
3.1%
Other values (10) 560
 
12.1%
Hangul
ValueCountFrequency (%)
535
 
8.2%
501
 
7.7%
495
 
7.6%
488
 
7.5%
460
 
7.1%
412
 
6.3%
396
 
6.1%
391
 
6.0%
389
 
6.0%
388
 
5.9%
Other values (166) 2067
31.7%
None
ValueCountFrequency (%)
30
100.0%

Interactions

2023-12-11T02:23:22.466424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T02:23:22.773054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:23:22.974650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상호영업소재지
01(유)새천년건설부산광역시 금정구 중앙대로 2350, 2층 (노포동)
12(주)가람디자인부산광역시 금정구 삼어로 219 (금사동)
23(주)가온개발부산광역시 금정구 체육공원로 631-15 (두구동)
34(주)강윤부산광역시 금정구 개좌로272번길 21-13 (회동동)
45(주)건보산업부산광역시 금정구 중앙대로 1981 (남산동)
56(주)고센건설부산광역시 금정구 금정로 191,3층(장전동, 고센빌딩)
67(주)공간산업개발부산광역시 금정구 노포사송로 123, 지1층 (노포동)
78(주)관문건설부산광역시 금정구 팔송로45번길 37-1(남산동)
89(주)광일테크부산광역시 금정구 금강로578번길 32 4층 (구서동)
910(주)국제배관부산광역시 금정구 두실로 72-1 (남산동)
순번상호영업소재지
380378해림건설(주)부산광역시 금정구 서부곡로 19 덕영빌딩 3층 (부곡동)
381379해림건설(주)부산광역시 금정구 서부곡로 19 덕영빌딩 3층 (부곡동)
382380현대설비부산광역시 금정구 수림로66번길 24 (장전동)
383381현대티지부산광역시 금정구 체육공원로29번길 24 (구서동)
384382현송건설주식회사부산광역시 금정구 금강로 179 9층 (장전동)
385383화성설비부산광역시 금정구 남산로37번길 20-2 (남산동)
386384회영설비부산광역시 금정구 금사로 168 (회동동)
387385효성드라이비트(주)부산광역시 금정구 오시게로 62, 505호(부곡동, 남성하이빌)
388386휘람건설(주)부산광역시 금정구 금샘로582번길 24 105호(아신빌라) (남산동)
389387용두토건㈜부산광역시 금정구 금강로 585, 5층