Overview

Dataset statistics

Number of variables5
Number of observations270
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.2 KiB
Average record size in memory42.5 B

Variable types

Numeric2
Text2
Categorical1

Dataset

Description대전광역시 동구 관내 건설업등록 현황에 관한 데이터로서, 상호명과 업종구분, 영업소재지 등의 정보를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15067256/fileData.do

Alerts

연번 has unique valuesUnique
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:44:39.999840
Analysis finished2023-12-12 02:44:40.902400
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct270
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean135.5
Minimum1
Maximum270
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2023-12-12T11:44:40.980158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.45
Q168.25
median135.5
Q3202.75
95-th percentile256.55
Maximum270
Range269
Interquartile range (IQR)134.5

Descriptive statistics

Standard deviation78.086491
Coefficient of variation (CV)0.57628406
Kurtosis-1.2
Mean135.5
Median Absolute Deviation (MAD)67.5
Skewness0
Sum36585
Variance6097.5
MonotonicityStrictly increasing
2023-12-12T11:44:41.148448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
187 1
 
0.4%
173 1
 
0.4%
174 1
 
0.4%
175 1
 
0.4%
176 1
 
0.4%
177 1
 
0.4%
178 1
 
0.4%
179 1
 
0.4%
180 1
 
0.4%
Other values (260) 260
96.3%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
270 1
0.4%
269 1
0.4%
268 1
0.4%
267 1
0.4%
266 1
0.4%
265 1
0.4%
264 1
0.4%
263 1
0.4%
262 1
0.4%
261 1
0.4%

상호
Text

UNIQUE 

Distinct270
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T11:44:41.401093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length7.3296296
Min length3

Characters and Unicode

Total characters1979
Distinct characters237
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique270 ?
Unique (%)100.0%

Sample

1st row(유)금영토건
2nd row(주)가보건설
3rd row(주)거우
4th row(주)굿모닝기업
5th row(주)금상특수건설
ValueCountFrequency (%)
유)금영토건 1
 
0.4%
정진더편한생활 1
 
0.4%
제일시스템 1
 
0.4%
원강산업(주 1
 
0.4%
원준종합설비 1
 
0.4%
유지공영(주 1
 
0.4%
유진도시가스 1
 
0.4%
유창건설주식회사 1
 
0.4%
유한회사건웅이엔씨 1
 
0.4%
유한회사국민이엔씨 1
 
0.4%
Other values (260) 260
96.3%
2023-12-12T11:44:41.819450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164
 
8.3%
125
 
6.3%
( 102
 
5.2%
) 102
 
5.2%
91
 
4.6%
85
 
4.3%
61
 
3.1%
58
 
2.9%
56
 
2.8%
56
 
2.8%
Other values (227) 1079
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1761
89.0%
Open Punctuation 102
 
5.2%
Close Punctuation 102
 
5.2%
Uppercase Letter 5
 
0.3%
Other Symbol 4
 
0.2%
Other Punctuation 4
 
0.2%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
164
 
9.3%
125
 
7.1%
91
 
5.2%
85
 
4.8%
61
 
3.5%
58
 
3.3%
56
 
3.2%
56
 
3.2%
36
 
2.0%
35
 
2.0%
Other values (216) 994
56.4%
Uppercase Letter
ValueCountFrequency (%)
G 1
20.0%
N 1
20.0%
E 1
20.0%
C 1
20.0%
B 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 3
75.0%
. 1
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 102
100.0%
Close Punctuation
ValueCountFrequency (%)
) 102
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Lowercase Letter
ValueCountFrequency (%)
n 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1765
89.2%
Common 208
 
10.5%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
164
 
9.3%
125
 
7.1%
91
 
5.2%
85
 
4.8%
61
 
3.5%
58
 
3.3%
56
 
3.2%
56
 
3.2%
36
 
2.0%
35
 
2.0%
Other values (217) 998
56.5%
Latin
ValueCountFrequency (%)
G 1
16.7%
N 1
16.7%
E 1
16.7%
C 1
16.7%
n 1
16.7%
B 1
16.7%
Common
ValueCountFrequency (%)
( 102
49.0%
) 102
49.0%
, 3
 
1.4%
. 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1761
89.0%
ASCII 214
 
10.8%
None 4
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
164
 
9.3%
125
 
7.1%
91
 
5.2%
85
 
4.8%
61
 
3.5%
58
 
3.3%
56
 
3.2%
56
 
3.2%
36
 
2.0%
35
 
2.0%
Other values (216) 994
56.4%
ASCII
ValueCountFrequency (%)
( 102
47.7%
) 102
47.7%
, 3
 
1.4%
G 1
 
0.5%
N 1
 
0.5%
E 1
 
0.5%
. 1
 
0.5%
C 1
 
0.5%
n 1
 
0.5%
B 1
 
0.5%
None
ValueCountFrequency (%)
4
100.0%

업종
Categorical

Distinct47
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
가스난방공사업
101 
기계가스설비공사업
23 
실내건축공사업
21 
도장ㆍ습식ㆍ방수ㆍ석공사업
13 
상ㆍ하수도설비공사업
13 
Other values (42)
99 

Length

Max length64
Median length47
Mean length11.792593
Min length7

Unique

Unique24 ?
Unique (%)8.9%

Sample

1st row철근ㆍ콘크리트공사업, 지반조성ㆍ포장공사업, 시설물유지관리업, 금속창호ㆍ지붕건축물조립공사업, 도장ㆍ습식ㆍ방수ㆍ석공사업
2nd row상ㆍ하수도설비공사업, 지반조성ㆍ포장공사업
3rd row금속창호ㆍ지붕건축물조립공사업
4th row실내건축공사업, 시설물유지관리업
5th row시설물유지관리업

Common Values

ValueCountFrequency (%)
가스난방공사업 101
37.4%
기계가스설비공사업 23
 
8.5%
실내건축공사업 21
 
7.8%
도장ㆍ습식ㆍ방수ㆍ석공사업 13
 
4.8%
상ㆍ하수도설비공사업 13
 
4.8%
금속창호ㆍ지붕건축물조립공사업 10
 
3.7%
구조물해체ㆍ비계공사업 9
 
3.3%
철근ㆍ콘크리트공사업 8
 
3.0%
조경식재ㆍ시설물공사업 7
 
2.6%
시설물유지관리업 6
 
2.2%
Other values (37) 59
21.9%

Length

2023-12-12T11:44:41.996232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가스난방공사업 104
31.0%
실내건축공사업 32
 
9.6%
상ㆍ하수도설비공사업 31
 
9.3%
시설물유지관리업 31
 
9.3%
기계가스설비공사업 30
 
9.0%
도장ㆍ습식ㆍ방수ㆍ석공사업 28
 
8.4%
금속창호ㆍ지붕건축물조립공사업 20
 
6.0%
지반조성ㆍ포장공사업 18
 
5.4%
철근ㆍ콘크리트공사업 14
 
4.2%
조경식재ㆍ시설물공사업 12
 
3.6%
Other values (2) 15
 
4.5%
Distinct126
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34604.6
Minimum34502
Maximum34712
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2023-12-12T11:44:42.121733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34502
5-th percentile34517.35
Q134557
median34592.5
Q334656
95-th percentile34710
Maximum34712
Range210
Interquartile range (IQR)99

Descriptive statistics

Standard deviation62.790286
Coefficient of variation (CV)0.0018145069
Kurtosis-1.0904429
Mean34604.6
Median Absolute Deviation (MAD)45.5
Skewness0.38239635
Sum9343242
Variance3942.6201
MonotonicityNot monotonic
2023-12-12T11:44:42.270416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34565 9
 
3.3%
34712 8
 
3.0%
34706 7
 
2.6%
34705 6
 
2.2%
34535 6
 
2.2%
34547 6
 
2.2%
34564 6
 
2.2%
34577 5
 
1.9%
34623 5
 
1.9%
34710 5
 
1.9%
Other values (116) 207
76.7%
ValueCountFrequency (%)
34502 1
 
0.4%
34505 2
0.7%
34507 2
0.7%
34508 3
1.1%
34510 1
 
0.4%
34511 2
0.7%
34512 1
 
0.4%
34514 1
 
0.4%
34516 1
 
0.4%
34519 1
 
0.4%
ValueCountFrequency (%)
34712 8
3.0%
34711 4
1.5%
34710 5
1.9%
34708 3
 
1.1%
34706 7
2.6%
34705 6
2.2%
34704 1
 
0.4%
34703 2
 
0.7%
34702 1
 
0.4%
34700 2
 
0.7%
Distinct267
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T11:44:42.573577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length38
Mean length26.822222
Min length16

Characters and Unicode

Total characters7242
Distinct characters151
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique264 ?
Unique (%)97.8%

Sample

1st row대전광역시 동구 한밭대로1297번길 9 505호 (용전동)
2nd row대전광역시 동구 미래길 76 1층 101호 (소제동)
3rd row대전광역시 동구 대전천동로 710 3층 (삼성동)
4th row대전 동구 하소로 4(하소동)
5th row대전광역시 동구 태전로 52 (삼성동)
ValueCountFrequency (%)
동구 270
 
18.0%
대전광역시 269
 
17.9%
가양동 57
 
3.8%
삼성동 40
 
2.7%
1층 35
 
2.3%
용전동 20
 
1.3%
2층 20
 
1.3%
홍도동 20
 
1.3%
낭월동 17
 
1.1%
대전로 14
 
0.9%
Other values (419) 741
49.3%
2023-12-12T11:44:43.357904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1233
17.0%
592
 
8.2%
373
 
5.2%
355
 
4.9%
1 297
 
4.1%
275
 
3.8%
272
 
3.8%
) 271
 
3.7%
( 271
 
3.7%
269
 
3.7%
Other values (141) 3034
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4057
56.0%
Decimal Number 1290
 
17.8%
Space Separator 1233
 
17.0%
Close Punctuation 271
 
3.7%
Open Punctuation 271
 
3.7%
Dash Punctuation 64
 
0.9%
Other Punctuation 53
 
0.7%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
592
14.6%
373
 
9.2%
355
 
8.8%
275
 
6.8%
272
 
6.7%
269
 
6.6%
269
 
6.6%
249
 
6.1%
132
 
3.3%
110
 
2.7%
Other values (124) 1161
28.6%
Decimal Number
ValueCountFrequency (%)
1 297
23.0%
2 189
14.7%
5 126
9.8%
0 122
9.5%
4 118
 
9.1%
3 110
 
8.5%
6 104
 
8.1%
7 81
 
6.3%
8 74
 
5.7%
9 69
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 48
90.6%
5
 
9.4%
Space Separator
ValueCountFrequency (%)
1233
100.0%
Close Punctuation
ValueCountFrequency (%)
) 271
100.0%
Open Punctuation
ValueCountFrequency (%)
( 271
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4057
56.0%
Common 3182
43.9%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
592
14.6%
373
 
9.2%
355
 
8.8%
275
 
6.8%
272
 
6.7%
269
 
6.6%
269
 
6.6%
249
 
6.1%
132
 
3.3%
110
 
2.7%
Other values (124) 1161
28.6%
Common
ValueCountFrequency (%)
1233
38.7%
1 297
 
9.3%
) 271
 
8.5%
( 271
 
8.5%
2 189
 
5.9%
5 126
 
4.0%
0 122
 
3.8%
4 118
 
3.7%
3 110
 
3.5%
6 104
 
3.3%
Other values (6) 341
 
10.7%
Latin
ValueCountFrequency (%)
B 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4057
56.0%
ASCII 3180
43.9%
None 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1233
38.8%
1 297
 
9.3%
) 271
 
8.5%
( 271
 
8.5%
2 189
 
5.9%
5 126
 
4.0%
0 122
 
3.8%
4 118
 
3.7%
3 110
 
3.5%
6 104
 
3.3%
Other values (6) 339
 
10.7%
Hangul
ValueCountFrequency (%)
592
14.6%
373
 
9.2%
355
 
8.8%
275
 
6.8%
272
 
6.7%
269
 
6.6%
269
 
6.6%
249
 
6.1%
132
 
3.3%
110
 
2.7%
Other values (124) 1161
28.6%
None
ValueCountFrequency (%)
5
100.0%

Interactions

2023-12-12T11:44:40.548676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:44:40.357759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:44:40.639625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:44:40.460859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:44:43.460001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종우편번호 (도로명주소)
연번1.0000.5400.398
업종0.5401.0000.384
우편번호 (도로명주소)0.3980.3841.000
2023-12-12T11:44:43.568264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호 (도로명주소)업종
연번1.0000.0160.199
우편번호 (도로명주소)0.0161.0000.122
업종0.1990.1221.000

Missing values

2023-12-12T11:44:40.753792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:44:40.853662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호업종우편번호 (도로명주소)영업소재지(도로명주소)
01(유)금영토건철근ㆍ콘크리트공사업, 지반조성ㆍ포장공사업, 시설물유지관리업, 금속창호ㆍ지붕건축물조립공사업, 도장ㆍ습식ㆍ방수ㆍ석공사업34540대전광역시 동구 한밭대로1297번길 9 505호 (용전동)
12(주)가보건설상ㆍ하수도설비공사업, 지반조성ㆍ포장공사업34605대전광역시 동구 미래길 76 1층 101호 (소제동)
23(주)거우금속창호ㆍ지붕건축물조립공사업34620대전광역시 동구 대전천동로 710 3층 (삼성동)
34(주)굿모닝기업실내건축공사업, 시설물유지관리업34712대전 동구 하소로 4(하소동)
45(주)금상특수건설시설물유지관리업34623대전광역시 동구 태전로 52 (삼성동)
56(주)금양건설시설물유지관리업34540대전광역시 동구 송촌남로11번길 50 도시형생활주택네오플러스 101호 (용전동)
67(주)기석건설상ㆍ하수도설비공사업, 도장ㆍ습식ㆍ방수ㆍ석공사업, 시설물유지관리업34685대전광역시 동구 옥천로96번길 96-77 (판암동)
78(주)다원아이디실내건축공사업, 시설물유지관리업34510대전광역시 동구 우암로312번길 54 1층 (가양동)
89(주)대덕따슴이가스난방공사업34703대전광역시 동구 곤룡로 68-12 (낭월동)
910(주)대림원조경식재ㆍ시설물공사업, 상ㆍ하수도설비공사업34547대전광역시 동구 계족로 440, 2층 204호 (용전동)
연번상호업종우편번호 (도로명주소)영업소재지(도로명주소)
260261현대설비가스난방공사업34547대전광역시 동구 계족로 466 (용전동)
261262현대이엔지가스난방공사업34678대전광역시 동구 판암로 73 (판암동)
262263현민건설주식회사철근ㆍ콘크리트공사업, 구조물해체ㆍ비계공사업34629대전광역시 동구 대전로 788-1 2층 (중동)
263264현민주식회사시설물유지관리업, 실내건축공사업, 금속창호ㆍ지붕건축물조립공사업, 상ㆍ하수도설비공사업34508대전광역시 동구 동대전로 262 (가양동)
264265현우설비산업가스난방공사업34632대전광역시 동구 대전로 696 2층 (인동)
265266형원건설산업(주)상ㆍ하수도설비공사업34535대전광역시 동구 흥룡로 42 (가양동)
266267형제난방공사가스난방공사업34532대전광역시 동구 흥룡로 23 (가양동)
267268혜성설비가스난방공사업34558대전광역시 동구 동서대로1601번길 21 (홍도동)
268269호원건설(주)도장ㆍ습식ㆍ방수ㆍ석공사업, 시설물유지관리업34680대전광역시 동구 신기로 84 (가오동) 602호
269270화목보일러종합전시장가스난방공사업34602대전광역시 동구 미화5길 1 (소제동)