Overview

Dataset statistics

Number of variables5
Number of observations209
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.5 KiB
Average record size in memory41.6 B

Variable types

Text4
Numeric1

Dataset

Description대전광역시 대한주택건설협회 업체현황에 대한 데이터로 상호명, 주소, 등록지우편번호, 전화번호 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15061028/fileData.do

Alerts

상호명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:04:26.293325
Analysis finished2023-12-12 15:04:26.827902
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호명
Text

UNIQUE 

Distinct209
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T00:04:26.998594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length8.1052632
Min length3

Characters and Unicode

Total characters1694
Distinct characters199
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique209 ?
Unique (%)100.0%

Sample

1st row(주)가양
2nd row(주)가화건설
3rd row(주)강산환경
4th row(주)건강
5th row(주)건강건설
ValueCountFrequency (%)
주)가양 1
 
0.5%
주)아이에스종합건설 1
 
0.5%
이음종합건설(주 1
 
0.5%
웅비건설(주 1
 
0.5%
원창건설(주 1
 
0.5%
주)원평종합건설 1
 
0.5%
주)원플러스디앤씨 1
 
0.5%
유니원종합건설(주 1
 
0.5%
주)유원건설산업 1
 
0.5%
주)유토개발1차 1
 
0.5%
Other values (199) 199
95.2%
2023-12-13T00:04:27.364951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
216
 
12.8%
( 203
 
12.0%
) 203
 
12.0%
114
 
6.7%
102
 
6.0%
46
 
2.7%
43
 
2.5%
34
 
2.0%
34
 
2.0%
33
 
1.9%
Other values (189) 666
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1285
75.9%
Open Punctuation 203
 
12.0%
Close Punctuation 203
 
12.0%
Decimal Number 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
216
 
16.8%
114
 
8.9%
102
 
7.9%
46
 
3.6%
43
 
3.3%
34
 
2.6%
34
 
2.6%
33
 
2.6%
31
 
2.4%
27
 
2.1%
Other values (185) 605
47.1%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
2 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 203
100.0%
Close Punctuation
ValueCountFrequency (%)
) 203
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1285
75.9%
Common 409
 
24.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
216
 
16.8%
114
 
8.9%
102
 
7.9%
46
 
3.6%
43
 
3.3%
34
 
2.6%
34
 
2.6%
33
 
2.6%
31
 
2.4%
27
 
2.1%
Other values (185) 605
47.1%
Common
ValueCountFrequency (%)
( 203
49.6%
) 203
49.6%
1 2
 
0.5%
2 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1285
75.9%
ASCII 409
 
24.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
216
 
16.8%
114
 
8.9%
102
 
7.9%
46
 
3.6%
43
 
3.3%
34
 
2.6%
34
 
2.6%
33
 
2.6%
31
 
2.4%
27
 
2.1%
Other values (185) 605
47.1%
ASCII
ValueCountFrequency (%)
( 203
49.6%
) 203
49.6%
1 2
 
0.5%
2 1
 
0.2%
Distinct203
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T00:04:27.635953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length29
Mean length21.009569
Min length11

Characters and Unicode

Total characters4391
Distinct characters143
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)94.3%

Sample

1st row대전 유성구 대학로 55-35, 201호
2nd row대전 서구 문정로2번길 113, 둔산탑스빌 202호
3rd row대전 유성구 원내로 13, 지하층
4th row대전 중구 대종로 544, 106호
5th row대전 중구 대종로 544, 102호
ValueCountFrequency (%)
대전 209
 
20.1%
서구 90
 
8.7%
유성구 64
 
6.2%
중구 25
 
2.4%
2층 18
 
1.7%
동구 17
 
1.6%
201호 17
 
1.6%
대덕구 13
 
1.2%
3층 11
 
1.1%
1층 11
 
1.1%
Other values (368) 565
54.3%
2023-12-13T00:04:28.104046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
831
18.9%
291
 
6.6%
1 233
 
5.3%
217
 
4.9%
215
 
4.9%
2 201
 
4.6%
199
 
4.5%
, 180
 
4.1%
0 160
 
3.6%
3 148
 
3.4%
Other values (133) 1716
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2137
48.7%
Decimal Number 1208
27.5%
Space Separator 831
 
18.9%
Other Punctuation 180
 
4.1%
Dash Punctuation 33
 
0.8%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
291
13.6%
217
 
10.2%
215
 
10.1%
199
 
9.3%
125
 
5.8%
109
 
5.1%
106
 
5.0%
96
 
4.5%
74
 
3.5%
74
 
3.5%
Other values (118) 631
29.5%
Decimal Number
ValueCountFrequency (%)
1 233
19.3%
2 201
16.6%
0 160
13.2%
3 148
12.3%
5 111
9.2%
4 100
8.3%
6 73
 
6.0%
7 71
 
5.9%
8 56
 
4.6%
9 55
 
4.6%
Space Separator
ValueCountFrequency (%)
831
100.0%
Other Punctuation
ValueCountFrequency (%)
, 180
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2254
51.3%
Hangul 2137
48.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
291
13.6%
217
 
10.2%
215
 
10.1%
199
 
9.3%
125
 
5.8%
109
 
5.1%
106
 
5.0%
96
 
4.5%
74
 
3.5%
74
 
3.5%
Other values (118) 631
29.5%
Common
ValueCountFrequency (%)
831
36.9%
1 233
 
10.3%
2 201
 
8.9%
, 180
 
8.0%
0 160
 
7.1%
3 148
 
6.6%
5 111
 
4.9%
4 100
 
4.4%
6 73
 
3.2%
7 71
 
3.1%
Other values (5) 146
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2254
51.3%
Hangul 2137
48.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
831
36.9%
1 233
 
10.3%
2 201
 
8.9%
, 180
 
8.0%
0 160
 
7.1%
3 148
 
6.6%
5 111
 
4.9%
4 100
 
4.4%
6 73
 
3.2%
7 71
 
3.1%
Other values (5) 146
 
6.5%
Hangul
ValueCountFrequency (%)
291
13.6%
217
 
10.2%
215
 
10.1%
199
 
9.3%
125
 
5.8%
109
 
5.1%
106
 
5.0%
96
 
4.5%
74
 
3.5%
74
 
3.5%
Other values (118) 631
29.5%

등록지우편번호
Real number (ℝ)

Distinct135
Distinct (%)64.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34781.206
Minimum34012
Maximum35413
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T00:04:28.286611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34012
5-th percentile34086.4
Q134186
median34884
Q335262
95-th percentile35370
Maximum35413
Range1401
Interquartile range (IQR)1076

Descriptive statistics

Standard deviation503.53812
Coefficient of variation (CV)0.014477305
Kurtosis-1.6756524
Mean34781.206
Median Absolute Deviation (MAD)446
Skewness-0.19845864
Sum7269272
Variance253550.64
MonotonicityNot monotonic
2023-12-13T00:04:28.435241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34185 8
 
3.8%
34168 6
 
2.9%
35350 6
 
2.9%
35203 5
 
2.4%
35241 4
 
1.9%
34831 4
 
1.9%
35233 4
 
1.9%
35370 4
 
1.9%
35261 4
 
1.9%
34171 4
 
1.9%
Other values (125) 160
76.6%
ValueCountFrequency (%)
34012 1
 
0.5%
34014 3
1.4%
34063 1
 
0.5%
34065 1
 
0.5%
34068 1
 
0.5%
34070 1
 
0.5%
34074 1
 
0.5%
34086 2
1.0%
34087 2
1.0%
34091 2
1.0%
ValueCountFrequency (%)
35413 1
 
0.5%
35412 1
 
0.5%
35401 2
1.0%
35399 1
 
0.5%
35387 1
 
0.5%
35382 1
 
0.5%
35379 1
 
0.5%
35370 4
1.9%
35368 3
1.4%
35363 1
 
0.5%
Distinct201
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T00:04:28.719859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length34
Mean length22.124402
Min length11

Characters and Unicode

Total characters4624
Distinct characters195
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique194 ?
Unique (%)92.8%

Sample

1st row대전 유성구 대학로 55-35, 201호
2nd row대전 서구 문정로2번길 113, 둔산탑스빌 202호
3rd row세종특별자치시 나성로 33-6, 705호
4th row대전 중구 대종로 544, 106호
5th row대전 중구 대종로 544, 102호
ValueCountFrequency (%)
대전 192
 
17.9%
서구 87
 
8.1%
유성구 57
 
5.3%
2층 30
 
2.8%
중구 23
 
2.1%
동구 16
 
1.5%
201호 13
 
1.2%
대덕구 11
 
1.0%
1층 10
 
0.9%
4층 10
 
0.9%
Other values (416) 624
58.2%
2023-12-13T00:04:29.216933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
864
18.7%
271
 
5.9%
1 246
 
5.3%
214
 
4.6%
2 208
 
4.5%
200
 
4.3%
197
 
4.3%
, 194
 
4.2%
0 159
 
3.4%
3 151
 
3.3%
Other values (185) 1920
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2274
49.2%
Decimal Number 1244
26.9%
Space Separator 864
 
18.7%
Other Punctuation 195
 
4.2%
Dash Punctuation 34
 
0.7%
Uppercase Letter 5
 
0.1%
Close Punctuation 4
 
0.1%
Open Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
271
 
11.9%
214
 
9.4%
200
 
8.8%
197
 
8.7%
119
 
5.2%
114
 
5.0%
106
 
4.7%
94
 
4.1%
87
 
3.8%
71
 
3.1%
Other values (165) 801
35.2%
Decimal Number
ValueCountFrequency (%)
1 246
19.8%
2 208
16.7%
0 159
12.8%
3 151
12.1%
5 118
9.5%
4 101
8.1%
6 72
 
5.8%
7 70
 
5.6%
8 63
 
5.1%
9 56
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
B 2
40.0%
K 1
20.0%
G 1
20.0%
T 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 194
99.5%
& 1
 
0.5%
Space Separator
ValueCountFrequency (%)
864
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2345
50.7%
Hangul 2274
49.2%
Latin 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
271
 
11.9%
214
 
9.4%
200
 
8.8%
197
 
8.7%
119
 
5.2%
114
 
5.0%
106
 
4.7%
94
 
4.1%
87
 
3.8%
71
 
3.1%
Other values (165) 801
35.2%
Common
ValueCountFrequency (%)
864
36.8%
1 246
 
10.5%
2 208
 
8.9%
, 194
 
8.3%
0 159
 
6.8%
3 151
 
6.4%
5 118
 
5.0%
4 101
 
4.3%
6 72
 
3.1%
7 70
 
3.0%
Other values (6) 162
 
6.9%
Latin
ValueCountFrequency (%)
B 2
40.0%
K 1
20.0%
G 1
20.0%
T 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2350
50.8%
Hangul 2274
49.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
864
36.8%
1 246
 
10.5%
2 208
 
8.9%
, 194
 
8.3%
0 159
 
6.8%
3 151
 
6.4%
5 118
 
5.0%
4 101
 
4.3%
6 72
 
3.1%
7 70
 
3.0%
Other values (10) 167
 
7.1%
Hangul
ValueCountFrequency (%)
271
 
11.9%
214
 
9.4%
200
 
8.8%
197
 
8.7%
119
 
5.2%
114
 
5.0%
106
 
4.7%
94
 
4.1%
87
 
3.8%
71
 
3.1%
Other values (165) 801
35.2%
Distinct198
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T00:04:29.523483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12
Min length9

Characters and Unicode

Total characters2508
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique188 ?
Unique (%)90.0%

Sample

1st row042-822-0540
2nd row042-488-8404
3rd row1877-6557
4th row042-537-1122
5th row042-537-1122
ValueCountFrequency (%)
042-582-0420 3
 
1.4%
042-256-0433 2
 
1.0%
042-536-4307 2
 
1.0%
042-528-1600 2
 
1.0%
042-482-8793 2
 
1.0%
042-536-9066 2
 
1.0%
042-824-1847 2
 
1.0%
042-630-9429 2
 
1.0%
042-581-9811 2
 
1.0%
042-537-1122 2
 
1.0%
Other values (188) 188
90.0%
2023-12-13T00:04:29.976990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 421
16.8%
- 416
16.6%
0 393
15.7%
4 334
13.3%
8 192
7.7%
5 160
 
6.4%
3 152
 
6.1%
6 135
 
5.4%
1 132
 
5.3%
7 110
 
4.4%
Other values (2) 63
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2091
83.4%
Dash Punctuation 416
 
16.6%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 421
20.1%
0 393
18.8%
4 334
16.0%
8 192
9.2%
5 160
 
7.7%
3 152
 
7.3%
6 135
 
6.5%
1 132
 
6.3%
7 110
 
5.3%
9 62
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 416
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2508
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 421
16.8%
- 416
16.6%
0 393
15.7%
4 334
13.3%
8 192
7.7%
5 160
 
6.4%
3 152
 
6.1%
6 135
 
5.4%
1 132
 
5.3%
7 110
 
4.4%
Other values (2) 63
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2508
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 421
16.8%
- 416
16.6%
0 393
15.7%
4 334
13.3%
8 192
7.7%
5 160
 
6.4%
3 152
 
6.1%
6 135
 
5.4%
1 132
 
5.3%
7 110
 
4.4%
Other values (2) 63
 
2.5%

Interactions

2023-12-13T00:04:26.568032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T00:04:26.707269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:04:26.794566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호명등록지주소등록지우편번호배달지주소배달지전화
0(주)가양대전 유성구 대학로 55-35, 201호34168대전 유성구 대학로 55-35, 201호042-822-0540
1(주)가화건설대전 서구 문정로2번길 113, 둔산탑스빌 202호35262대전 서구 문정로2번길 113, 둔산탑스빌 202호042-488-8404
2(주)강산환경대전 유성구 원내로 13, 지하층34228세종특별자치시 나성로 33-6, 705호1877-6557
3(주)건강대전 중구 대종로 544, 106호34831대전 중구 대종로 544, 106호042-537-1122
4(주)건강건설대전 중구 대종로 544, 102호34831대전 중구 대종로 544, 102호042-537-1122
5건국건설(주)대전 서구 대덕대로220번길 35, 5층35233대전 서구 대덕대로220번길 35, 5층042-523-2664
6건동이엔씨(주)대전 서구 동서대로 682, 505호35350대전 서구 동서대로 682, 505호042-484-7222
7계룡산업(주)대전 서구 월평중로 2535222대전 서구 월평중로 25, 4층042-716-1234
8공간종합건설(주)대전 서구 제비네1길 4, 4층35334대전 서구 제비네1길 4, 4층042-623-6700
9(주)공실대전 서구 월평중로13번길 36, 301호35225대전 서구 월평중로13번길 36, 301호042-535-0724
상호명등록지주소등록지우편번호배달지주소배달지전화
199(주)한빛진대전 중구 동서대로1327번길 115, 2층34822대전 중구 동서대로1327번길 115, 2층042-223-8400
200한송건설(주)대전 서구 둔산대로117번길 29, 4층35203대전 서구 둔산대로117번길 29, 4층042-472-5011
201해천종합건설(주)대전 동구 현암로72번길 19 , 2층34564대전 동구 현암로72번길 19, 2층042-633-8820
202현강건설(주)대전 서구 문정로90번길 57, 제3층 305호35263대전 서구 문정로90번길 57, 제3층 305호042-545-8861
203(주)현풍건설대전 서구 구봉산북로 7, 301-1호35370대전 서구 구봉산북로 7, 301-1호042-825-2659
204(주)혜담종합건설대전 서구 도안북로93번길 10-19, 501호35350대전 서구 도안북로93번길 10-19, 501호042-710-8318
205홍익개발(주)대전 서구 벌곡로1379번길 6, 2층35387대전 서구 벌곡로1379번길 6, 2층042-348-3800
206홍진종합건설(주)대전 서구 도산로 128, 201호35330대전 서구 도산로 128, 201호042-532-2335
207환인종합건설(주)대전 유성구 온천동로65번길 26, 3층34185대전 유성구 온천동로65번길 26, 3층042-826-1060
208힐링건설(주)대전 동구 달기장1길 9, 비02호 비03호34667대전 동구 달기장1길 9, 상가1층042-582-0420