Overview

Dataset statistics

Number of variables4
Number of observations36
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory36.7 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시_기장군_건축사사무소현황_20230215
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15034735

Alerts

연번 has unique valuesUnique
대표자 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:00:35.592751
Analysis finished2023-12-10 17:00:36.300034
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.5
Minimum1
Maximum36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-11T02:00:36.425817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.75
Q19.75
median18.5
Q327.25
95-th percentile34.25
Maximum36
Range35
Interquartile range (IQR)17.5

Descriptive statistics

Standard deviation10.535654
Coefficient of variation (CV)0.5694948
Kurtosis-1.2
Mean18.5
Median Absolute Deviation (MAD)9
Skewness0
Sum666
Variance111
MonotonicityStrictly increasing
2023-12-11T02:00:36.663970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
1 1
 
2.8%
20 1
 
2.8%
22 1
 
2.8%
23 1
 
2.8%
24 1
 
2.8%
25 1
 
2.8%
26 1
 
2.8%
27 1
 
2.8%
28 1
 
2.8%
29 1
 
2.8%
Other values (26) 26
72.2%
ValueCountFrequency (%)
1 1
2.8%
2 1
2.8%
3 1
2.8%
4 1
2.8%
5 1
2.8%
6 1
2.8%
7 1
2.8%
8 1
2.8%
9 1
2.8%
10 1
2.8%
ValueCountFrequency (%)
36 1
2.8%
35 1
2.8%
34 1
2.8%
33 1
2.8%
32 1
2.8%
31 1
2.8%
30 1
2.8%
29 1
2.8%
28 1
2.8%
27 1
2.8%
Distinct33
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-11T02:00:37.007395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length9.7777778
Min length7

Characters and Unicode

Total characters352
Distinct characters68
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)83.3%

Sample

1st row아름 건축사사무소
2nd row건축사사무소 장원
3rd row일탑 건축사사무소
4th row건축사사무소 소슬
5th row와이디건축사사무소
ValueCountFrequency (%)
건축사사무소 19
33.3%
눌원종합건축사사무소 2
 
3.5%
동림건축사사무소 2
 
3.5%
주)서원건축사사무소 2
 
3.5%
인재건축사사무소 1
 
1.8%
원광 1
 
1.8%
라인건축 1
 
1.8%
태림 1
 
1.8%
1
 
1.8%
건보건축사사무소 1
 
1.8%
Other values (26) 26
45.6%
2023-12-11T02:00:37.522260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
72
20.5%
38
10.8%
38
10.8%
37
10.5%
36
10.2%
22
 
6.2%
6
 
1.7%
( 5
 
1.4%
) 5
 
1.4%
5
 
1.4%
Other values (58) 88
25.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 316
89.8%
Space Separator 22
 
6.2%
Open Punctuation 5
 
1.4%
Close Punctuation 5
 
1.4%
Uppercase Letter 3
 
0.9%
Other Symbol 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
22.8%
38
12.0%
38
12.0%
37
11.7%
36
11.4%
6
 
1.9%
5
 
1.6%
5
 
1.6%
5
 
1.6%
4
 
1.3%
Other values (51) 70
22.2%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
N 1
33.3%
G 1
33.3%
Space Separator
ValueCountFrequency (%)
22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 317
90.1%
Common 32
 
9.1%
Latin 3
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
22.7%
38
12.0%
38
12.0%
37
11.7%
36
11.4%
6
 
1.9%
5
 
1.6%
5
 
1.6%
5
 
1.6%
4
 
1.3%
Other values (52) 71
22.4%
Common
ValueCountFrequency (%)
22
68.8%
( 5
 
15.6%
) 5
 
15.6%
Latin
ValueCountFrequency (%)
E 1
33.3%
N 1
33.3%
G 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 316
89.8%
ASCII 35
 
9.9%
None 1
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
72
22.8%
38
12.0%
38
12.0%
37
11.7%
36
11.4%
6
 
1.9%
5
 
1.6%
5
 
1.6%
5
 
1.6%
4
 
1.3%
Other values (51) 70
22.2%
ASCII
ValueCountFrequency (%)
22
62.9%
( 5
 
14.3%
) 5
 
14.3%
E 1
 
2.9%
N 1
 
2.9%
G 1
 
2.9%
None
ValueCountFrequency (%)
1
100.0%

대표자
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-11T02:00:37.869380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters108
Distinct characters60
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row이두열
2nd row박주영
3rd row박평섭
4th row최미예
5th row김대원
ValueCountFrequency (%)
이두열 1
 
2.8%
박주영 1
 
2.8%
전준호 1
 
2.8%
권종규 1
 
2.8%
안기석 1
 
2.8%
김임수 1
 
2.8%
조서영 1
 
2.8%
이상석 1
 
2.8%
김동철 1
 
2.8%
심태홍 1
 
2.8%
Other values (26) 26
72.2%
2023-12-11T02:00:38.391177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
10.2%
6
 
5.6%
5
 
4.6%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (50) 67
62.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 108
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
10.2%
6
 
5.6%
5
 
4.6%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (50) 67
62.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 108
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
10.2%
6
 
5.6%
5
 
4.6%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (50) 67
62.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 108
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
10.2%
6
 
5.6%
5
 
4.6%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (50) 67
62.0%

주소
Text

Distinct31
Distinct (%)86.1%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-11T02:00:38.926696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length43
Mean length35.361111
Min length27

Characters and Unicode

Total characters1273
Distinct characters82
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)75.0%

Sample

1st row부산광역시 기장군 기장읍 기장대로 515 3층 (기장읍)
2nd row부산광역시 기장군 기장읍 차성남로89번길 7 (기장읍)
3rd row부산광역시 기장군 기장읍 차성동로116번길 10 4층 (기장읍)
4th row부산광역시 기장군 철마면 고촌로34번길 23-1 (철마면)
5th row부산광역시 기장군 일광면 일광로 90 2층 (일광면)
ValueCountFrequency (%)
기장읍 54
20.9%
부산광역시 36
14.0%
기장군 36
14.0%
기장대로 11
 
4.3%
3층 11
 
4.3%
철마면 6
 
2.3%
차성로418번길 5
 
1.9%
일광면 4
 
1.6%
28 4
 
1.6%
2동 4
 
1.6%
Other values (63) 87
33.7%
2023-12-11T02:00:39.753018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
254
20.0%
104
 
8.2%
102
 
8.0%
60
 
4.7%
41
 
3.2%
( 37
 
2.9%
) 37
 
2.9%
36
 
2.8%
36
 
2.8%
36
 
2.8%
Other values (72) 530
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 749
58.8%
Space Separator 254
 
20.0%
Decimal Number 179
 
14.1%
Open Punctuation 37
 
2.9%
Close Punctuation 37
 
2.9%
Dash Punctuation 7
 
0.5%
Other Punctuation 6
 
0.5%
Uppercase Letter 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
104
13.9%
102
13.6%
60
 
8.0%
41
 
5.5%
36
 
4.8%
36
 
4.8%
36
 
4.8%
36
 
4.8%
36
 
4.8%
34
 
4.5%
Other values (52) 228
30.4%
Decimal Number
ValueCountFrequency (%)
2 33
18.4%
3 29
16.2%
1 26
14.5%
6 16
8.9%
4 16
8.9%
5 15
8.4%
0 15
8.4%
8 12
 
6.7%
9 10
 
5.6%
7 7
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
B 1
25.0%
A 1
25.0%
T 1
25.0%
O 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 5
83.3%
/ 1
 
16.7%
Space Separator
ValueCountFrequency (%)
254
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 749
58.8%
Common 520
40.8%
Latin 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
104
13.9%
102
13.6%
60
 
8.0%
41
 
5.5%
36
 
4.8%
36
 
4.8%
36
 
4.8%
36
 
4.8%
36
 
4.8%
34
 
4.5%
Other values (52) 228
30.4%
Common
ValueCountFrequency (%)
254
48.8%
( 37
 
7.1%
) 37
 
7.1%
2 33
 
6.3%
3 29
 
5.6%
1 26
 
5.0%
6 16
 
3.1%
4 16
 
3.1%
5 15
 
2.9%
0 15
 
2.9%
Other values (6) 42
 
8.1%
Latin
ValueCountFrequency (%)
B 1
25.0%
A 1
25.0%
T 1
25.0%
O 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 749
58.8%
ASCII 524
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
254
48.5%
( 37
 
7.1%
) 37
 
7.1%
2 33
 
6.3%
3 29
 
5.5%
1 26
 
5.0%
6 16
 
3.1%
4 16
 
3.1%
5 15
 
2.9%
0 15
 
2.9%
Other values (10) 46
 
8.8%
Hangul
ValueCountFrequency (%)
104
13.9%
102
13.6%
60
 
8.0%
41
 
5.5%
36
 
4.8%
36
 
4.8%
36
 
4.8%
36
 
4.8%
36
 
4.8%
34
 
4.5%
Other values (52) 228
30.4%

Interactions

2023-12-11T02:00:35.911120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:00:39.941805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사무소명대표자주소
연번1.0001.0001.0000.958
사무소명1.0001.0001.0001.000
대표자1.0001.0001.0001.000
주소0.9581.0001.0001.000

Missing values

2023-12-11T02:00:36.096841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:00:36.239664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사무소명대표자주소
01아름 건축사사무소이두열부산광역시 기장군 기장읍 기장대로 515 3층 (기장읍)
12건축사사무소 장원박주영부산광역시 기장군 기장읍 차성남로89번길 7 (기장읍)
23일탑 건축사사무소박평섭부산광역시 기장군 기장읍 차성동로116번길 10 4층 (기장읍)
34건축사사무소 소슬최미예부산광역시 기장군 철마면 고촌로34번길 23-1 (철마면)
45와이디건축사사무소김대원부산광역시 기장군 일광면 일광로 90 2층 (일광면)
56선우 건축사사무소김준기부산광역시 기장군 기장읍 기장대로 456 (기장읍)
67태산 건축사사무소전동수부산광역시 기장군 기장읍 기장대로 515 3층 (기장읍)
78일산 건축사사무소박평호부산광역시 기장군 기장읍 차성로418번길 12 (기장읍)
89눌원종합건축사사무소박기환부산광역시 기장군 기장읍 차성로418번길 28 3층 (교리, 형제빌딩 2동) (기장읍)
910눌원종합건축사사무소원현규부산광역시 기장군 기장읍 차성로418번길 28 3층 (교리, 형제빌딩 2동) (기장읍)
연번사무소명대표자주소
2627동림건축사사무소전준호부산광역시 기장군 기장읍 차성로 296 (기장읍)
2728건축사사무소 홍심태홍부산광역시 기장군 기장읍 기장대로 521 (기장읍)
2829인재건축사사무소윤상천부산광역시 기장군 기장읍 대라리 대지 95-6 3층 사무실
2930(주)씨앤피 건축사사무소박경순부산광역시 기장군 기장읍 기장대로 563 현대아파트상가 에이동 301호 (기장읍)
3031엠건축사사무소문찬도부산광역시 기장군 기장읍 차성로 290 307호 (기장읍)
3132조은건축사사무소장필규부산광역시 기장군 기장읍 청강로 22 2층 (기장읍)
3233건축사사무소 태성ENG김행철부산광역시 기장군 기장읍 기장대로 469, 3층 (청강리)
3334건축사사무소 아키현김효현부산광역시 기장군 장안읍 정관로 1142, 에이동 403호 (장안읍)
3435㈜센터라인 종합건축사사무소김규섭부산광역시 기장군 정관읍 정관2로9 상가213동 비105호 이지더원이차아파트 (정관읍)
3536혜우 건축사사무소김치우부산광역시 기장군 정관읍 방곡4로 24 1층(정관읍)