Overview

Dataset statistics

Number of variables7
Number of observations104
Missing cells6
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory58.3 B

Variable types

Numeric1
Text4
Categorical2

Dataset

Description경상북도 구미시 건축사사무소 현황으로 건축사무소명, 대표자, 주소, 연락처 데이터를 제공하는 csv파일 데이터입니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/3077606/fileData.do

Alerts

관리기관전화번호 has constant value ""Constant
관리기관명 has constant value ""Constant
연락처 has 6 (5.8%) missing valuesMissing
연번 has unique valuesUnique
건축사사무소 명 has unique valuesUnique
대표자 has unique valuesUnique

Reproduction

Analysis started2024-04-21 02:51:31.457368
Analysis finished2024-04-21 02:51:32.913917
Duration1.46 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct104
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52.5
Minimum1
Maximum104
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T11:51:32.980258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.15
Q126.75
median52.5
Q378.25
95-th percentile98.85
Maximum104
Range103
Interquartile range (IQR)51.5

Descriptive statistics

Standard deviation30.166206
Coefficient of variation (CV)0.5745944
Kurtosis-1.2
Mean52.5
Median Absolute Deviation (MAD)26
Skewness0
Sum5460
Variance910
MonotonicityStrictly increasing
2024-04-21T11:51:33.107013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
54 1
 
1.0%
78 1
 
1.0%
77 1
 
1.0%
76 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
Other values (94) 94
90.4%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
104 1
1.0%
103 1
1.0%
102 1
1.0%
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
Distinct104
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size964.0 B
2024-04-21T11:51:33.309359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length9
Mean length9.8173077
Min length6

Characters and Unicode

Total characters1021
Distinct characters131
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)100.0%

Sample

1st row건축사(사) 동신
2nd row삼원 건축사(사)
3rd row강 건축사(사)
4th row신명 건축사(사)
5th row(주)종합건축사사무소 구일
ValueCountFrequency (%)
건축사사무소 45
23.4%
건축사(사 36
 
18.8%
한빛 1
 
0.5%
도원 1
 
0.5%
비채 1
 
0.5%
아크 1
 
0.5%
1
 
0.5%
㈜탑이엔씨 1
 
0.5%
어울림 1
 
0.5%
연지 1
 
0.5%
Other values (103) 103
53.6%
2024-04-21T11:51:33.647871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
206
20.2%
109
10.7%
108
10.6%
89
 
8.7%
( 58
 
5.7%
) 58
 
5.7%
54
 
5.3%
53
 
5.2%
16
 
1.6%
11
 
1.1%
Other values (121) 259
25.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 806
78.9%
Space Separator 89
 
8.7%
Open Punctuation 58
 
5.7%
Close Punctuation 58
 
5.7%
Other Symbol 5
 
0.5%
Uppercase Letter 2
 
0.2%
Decimal Number 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
206
25.6%
109
13.5%
108
13.4%
54
 
6.7%
53
 
6.6%
16
 
2.0%
11
 
1.4%
9
 
1.1%
9
 
1.1%
8
 
1.0%
Other values (112) 223
27.7%
Uppercase Letter
ValueCountFrequency (%)
E 1
50.0%
C 1
50.0%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
89
100.0%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 58
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 811
79.4%
Common 208
 
20.4%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
206
25.4%
109
13.4%
108
13.3%
54
 
6.7%
53
 
6.5%
16
 
2.0%
11
 
1.4%
9
 
1.1%
9
 
1.1%
8
 
1.0%
Other values (113) 228
28.1%
Common
ValueCountFrequency (%)
89
42.8%
( 58
27.9%
) 58
27.9%
& 1
 
0.5%
2 1
 
0.5%
1 1
 
0.5%
Latin
ValueCountFrequency (%)
E 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 805
78.8%
ASCII 210
 
20.6%
None 5
 
0.5%
Compat Jamo 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
206
25.6%
109
13.5%
108
13.4%
54
 
6.7%
53
 
6.6%
16
 
2.0%
11
 
1.4%
9
 
1.1%
9
 
1.1%
8
 
1.0%
Other values (111) 222
27.6%
ASCII
ValueCountFrequency (%)
89
42.4%
( 58
27.6%
) 58
27.6%
E 1
 
0.5%
& 1
 
0.5%
C 1
 
0.5%
2 1
 
0.5%
1 1
 
0.5%
None
ValueCountFrequency (%)
5
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

대표자
Text

UNIQUE 

Distinct104
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size964.0 B
2024-04-21T11:51:33.923072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.0288462
Min length2

Characters and Unicode

Total characters315
Distinct characters98
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)100.0%

Sample

1st row박재광
2nd row임병욱
3rd row강대홍
4th row신종수
5th row김정겸
ValueCountFrequency (%)
박재광 1
 
1.0%
임병욱 1
 
1.0%
서원덕 1
 
1.0%
이재환 1
 
1.0%
박정익 1
 
1.0%
송병한 1
 
1.0%
김정철 1
 
1.0%
이영욱 1
 
1.0%
조재성 1
 
1.0%
천용석 1
 
1.0%
Other values (94) 94
90.4%
2024-04-21T11:51:34.333350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
6.3%
17
 
5.4%
13
 
4.1%
10
 
3.2%
9
 
2.9%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
7
 
2.2%
Other values (88) 208
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 311
98.7%
Space Separator 4
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
6.4%
17
 
5.5%
13
 
4.2%
10
 
3.2%
9
 
2.9%
8
 
2.6%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
Other values (87) 204
65.6%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 311
98.7%
Common 4
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
6.4%
17
 
5.5%
13
 
4.2%
10
 
3.2%
9
 
2.9%
8
 
2.6%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
Other values (87) 204
65.6%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 311
98.7%
ASCII 4
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
6.4%
17
 
5.5%
13
 
4.2%
10
 
3.2%
9
 
2.9%
8
 
2.6%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
Other values (87) 204
65.6%
ASCII
ValueCountFrequency (%)
4
100.0%

주소
Text

Distinct101
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size964.0 B
2024-04-21T11:51:34.559759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length33
Mean length26.346154
Min length18

Characters and Unicode

Total characters2740
Distinct characters112
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)94.2%

Sample

1st row경상북도 구미시 고아읍 원대로 113-7, 헤아림7호
2nd row경상북도 구미시 송원서로 13, 3층(송정동,삼원빌딩)
3rd row경상북도 구미시 금오시장로2길 27 (원평동)
4th row경상북도 구미시 송동로 80, 2층 (도량동)
5th row경상북도 구미시 송정대로26-1, 3층 (송정동)
ValueCountFrequency (%)
경상북도 104
18.8%
구미시 104
18.8%
2층 17
 
3.1%
송원서로 17
 
3.1%
3층 17
 
3.1%
송정동 14
 
2.5%
원평동 7
 
1.3%
상사동로 6
 
1.1%
고아읍 5
 
0.9%
송정대로 5
 
0.9%
Other values (191) 257
46.5%
2024-04-21T11:51:34.880326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
472
 
17.2%
113
 
4.1%
113
 
4.1%
112
 
4.1%
111
 
4.1%
108
 
3.9%
107
 
3.9%
105
 
3.8%
97
 
3.5%
2 95
 
3.5%
Other values (102) 1307
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1551
56.6%
Space Separator 472
 
17.2%
Decimal Number 454
 
16.6%
Other Punctuation 82
 
3.0%
Open Punctuation 76
 
2.8%
Close Punctuation 76
 
2.8%
Dash Punctuation 27
 
1.0%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
113
 
7.3%
113
 
7.3%
112
 
7.2%
111
 
7.2%
108
 
7.0%
107
 
6.9%
105
 
6.8%
97
 
6.3%
91
 
5.9%
69
 
4.4%
Other values (84) 525
33.8%
Decimal Number
ValueCountFrequency (%)
2 95
20.9%
1 90
19.8%
3 72
15.9%
0 44
9.7%
5 35
 
7.7%
6 31
 
6.8%
7 30
 
6.6%
4 27
 
5.9%
9 18
 
4.0%
8 12
 
2.6%
Other Punctuation
ValueCountFrequency (%)
, 81
98.8%
/ 1
 
1.2%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
472
100.0%
Open Punctuation
ValueCountFrequency (%)
( 76
100.0%
Close Punctuation
ValueCountFrequency (%)
) 76
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1551
56.6%
Common 1187
43.3%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
113
 
7.3%
113
 
7.3%
112
 
7.2%
111
 
7.2%
108
 
7.0%
107
 
6.9%
105
 
6.8%
97
 
6.3%
91
 
5.9%
69
 
4.4%
Other values (84) 525
33.8%
Common
ValueCountFrequency (%)
472
39.8%
2 95
 
8.0%
1 90
 
7.6%
, 81
 
6.8%
( 76
 
6.4%
) 76
 
6.4%
3 72
 
6.1%
0 44
 
3.7%
5 35
 
2.9%
6 31
 
2.6%
Other values (6) 115
 
9.7%
Latin
ValueCountFrequency (%)
B 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1551
56.6%
ASCII 1189
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
472
39.7%
2 95
 
8.0%
1 90
 
7.6%
, 81
 
6.8%
( 76
 
6.4%
) 76
 
6.4%
3 72
 
6.1%
0 44
 
3.7%
5 35
 
2.9%
6 31
 
2.6%
Other values (8) 117
 
9.8%
Hangul
ValueCountFrequency (%)
113
 
7.3%
113
 
7.3%
112
 
7.2%
111
 
7.2%
108
 
7.0%
107
 
6.9%
105
 
6.8%
97
 
6.3%
91
 
5.9%
69
 
4.4%
Other values (84) 525
33.8%

연락처
Text

MISSING 

Distinct97
Distinct (%)99.0%
Missing6
Missing (%)5.8%
Memory size964.0 B
2024-04-21T11:51:35.080279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1176
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)98.0%

Sample

1st row054-452-4375
2nd row054-452-7903
3rd row054-452-4776
4th row054-453-0615
5th row054-455-9190
ValueCountFrequency (%)
054-481-1668 2
 
2.0%
054-464-3877 1
 
1.0%
054-452-4375 1
 
1.0%
054-452-1339 1
 
1.0%
054-462-1190 1
 
1.0%
054-444-3094 1
 
1.0%
054-461-1025 1
 
1.0%
054-442-4539 1
 
1.0%
054-457-1788 1
 
1.0%
054-462-6513 1
 
1.0%
Other values (87) 87
88.8%
2024-04-21T11:51:35.405581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 254
21.6%
- 196
16.7%
5 190
16.2%
0 156
13.3%
7 88
 
7.5%
1 61
 
5.2%
6 61
 
5.2%
3 49
 
4.2%
2 46
 
3.9%
8 44
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 980
83.3%
Dash Punctuation 196
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 254
25.9%
5 190
19.4%
0 156
15.9%
7 88
 
9.0%
1 61
 
6.2%
6 61
 
6.2%
3 49
 
5.0%
2 46
 
4.7%
8 44
 
4.5%
9 31
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1176
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 254
21.6%
- 196
16.7%
5 190
16.2%
0 156
13.3%
7 88
 
7.5%
1 61
 
5.2%
6 61
 
5.2%
3 49
 
4.2%
2 46
 
3.9%
8 44
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1176
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 254
21.6%
- 196
16.7%
5 190
16.2%
0 156
13.3%
7 88
 
7.5%
1 61
 
5.2%
6 61
 
5.2%
3 49
 
4.2%
2 46
 
3.9%
8 44
 
3.7%

관리기관전화번호
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size964.0 B
054-480-5512
104 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row054-480-5512
2nd row054-480-5512
3rd row054-480-5512
4th row054-480-5512
5th row054-480-5512

Common Values

ValueCountFrequency (%)
054-480-5512 104
100.0%

Length

2024-04-21T11:51:35.523840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:51:35.598405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
054-480-5512 104
100.0%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size964.0 B
경상북도 구미시청
104 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도 구미시청
2nd row경상북도 구미시청
3rd row경상북도 구미시청
4th row경상북도 구미시청
5th row경상북도 구미시청

Common Values

ValueCountFrequency (%)
경상북도 구미시청 104
100.0%

Length

2024-04-21T11:51:35.687607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:51:35.770046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상북도 104
50.0%
구미시청 104
50.0%

Interactions

2024-04-21T11:51:32.633019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:51:35.821418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연락처
연번1.0000.923
연락처0.9231.000

Missing values

2024-04-21T11:51:32.774930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:51:32.868945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번건축사사무소 명대표자주소연락처관리기관전화번호관리기관명
01건축사(사) 동신박재광경상북도 구미시 고아읍 원대로 113-7, 헤아림7호054-452-4375054-480-5512경상북도 구미시청
12삼원 건축사(사)임병욱경상북도 구미시 송원서로 13, 3층(송정동,삼원빌딩)054-452-7903054-480-5512경상북도 구미시청
23강 건축사(사)강대홍경상북도 구미시 금오시장로2길 27 (원평동)054-452-4776054-480-5512경상북도 구미시청
34신명 건축사(사)신종수경상북도 구미시 송동로 80, 2층 (도량동)054-453-0615054-480-5512경상북도 구미시청
45(주)종합건축사사무소 구일김정겸경상북도 구미시 송정대로26-1, 3층 (송정동)054-455-9190054-480-5512경상북도 구미시청
56삼웅 건축사(사)한정우경상북도 구미시 신시로 10길 135 신영타운 2층054-457-2724054-480-5512경상북도 구미시청
67승민 건축사(사)김민성경상북도 구미시 형곡로 38, 2층(형곡동)054-457-7778054-480-5512경상북도 구미시청
78건원 건축사(사)김용택경상북도 구미시 송원서로 21(송정동 3층)054-457-8877054-480-5512경상북도 구미시청
89그룹원건축사(사)손창호경상북도 구미시 송원서로 9, 2층 (송정동)054-457-9262054-480-5512경상북도 구미시청
910건축사(사) 도솔김창호경상북도 구미시 송원서로2길 13, 2층054-458-2676054-480-5512경상북도 구미시청
연번건축사사무소 명대표자주소연락처관리기관전화번호관리기관명
9495이루다 건축사사무소장광수경상북도 구미시 경은로 30 2층054-604-0523054-480-5512경상북도 구미시청
9596대가 건축사사무소최정윤경상북도 구미시 송정대로 20, 3층054-456-1755054-480-5512경상북도 구미시청
9697건축사사무소 그룹원황혜선경상북도 구미시 송원서로 9, 동진빌딩2층054-457-8798054-480-5512경상북도 구미시청
9798건축사사무소 도형박문섭경상북도 구미시 형곡로36길 23-12, 200호(형곡동)054-452-6648054-480-5512경상북도 구미시청
9899제이원 건축사사무소장진주경상북도 구미시 송원서로 27, 2층(송정동)054-608-0029054-480-5512경상북도 구미시청
99100폴리머 건축사사무소임현주경상북도 구미시 문장로 142-30<NA>054-480-5512경상북도 구미시청
100101태성종합기술조형준경상북도 구미시 백산로 167, 3층054-464-1273054-480-5512경상북도 구미시청
101102건축사사무소 원건축배상원경상북도 구미시 고아읍 동평로 66 302호054-442-8869054-480-5512경상북도 구미시청
102103건축사사무소 혜현정민혜경상북도 구미시 인동북길 91 2층<NA>054-480-5512경상북도 구미시청
103104삼안 건축사사무소안우찬경상북도 구미시 산책길 55-31 3층<NA>054-480-5512경상북도 구미시청