Overview

Dataset statistics

Number of variables7
Number of observations251
Missing cells12
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.3 KiB
Average record size in memory58.5 B

Variable types

Numeric2
Text3
Categorical1
DateTime1

Dataset

Description충청북도 단양군의 전문건설업 등록 현황 정보러 순번, 업체명, 업종, 우편번호 및 연락처, 주소, 데이터기준일 등의 항목을 포함하고 있음.
Author충청북도 단양군
URLhttps://www.data.go.kr/data/3071376/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
우편번호(도로명주소) has 4 (1.6%) missing valuesMissing
전화번호 has 8 (3.2%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:29:24.822837
Analysis finished2024-04-21 01:29:26.815151
Duration1.99 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct251
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126
Minimum1
Maximum251
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2024-04-21T10:29:26.949209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.5
Q163.5
median126
Q3188.5
95-th percentile238.5
Maximum251
Range250
Interquartile range (IQR)125

Descriptive statistics

Standard deviation72.601653
Coefficient of variation (CV)0.57620359
Kurtosis-1.2
Mean126
Median Absolute Deviation (MAD)63
Skewness0
Sum31626
Variance5271
MonotonicityStrictly increasing
2024-04-21T10:29:27.210898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
174 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
Other values (241) 241
96.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
251 1
0.4%
250 1
0.4%
249 1
0.4%
248 1
0.4%
247 1
0.4%
246 1
0.4%
245 1
0.4%
244 1
0.4%
243 1
0.4%
242 1
0.4%

상호
Text

Distinct142
Distinct (%)56.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-04-21T10:29:27.993721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length7
Mean length7.250996
Min length4

Characters and Unicode

Total characters1820
Distinct characters147
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)27.1%

Sample

1st row(유)산수건설
2nd row(유)산수건설
3rd row(유)함성건설
4th row(주)강원기계건설
5th row(주)강원기계건설
ValueCountFrequency (%)
두원건설(주 4
 
1.6%
주)신단양건설 4
 
1.6%
한일건설(주 4
 
1.6%
삼풍건설(주 4
 
1.6%
대유건설(주 4
 
1.6%
주)계명 4
 
1.6%
주)삼덕건설 4
 
1.6%
태진건설(주 4
 
1.6%
온달건설(주 4
 
1.6%
하늘건설(주 3
 
1.2%
Other values (132) 212
84.5%
2024-04-21T10:29:29.024956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
219
 
12.0%
( 210
 
11.5%
) 210
 
11.5%
172
 
9.5%
169
 
9.3%
31
 
1.7%
26
 
1.4%
26
 
1.4%
25
 
1.4%
24
 
1.3%
Other values (137) 708
38.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1400
76.9%
Open Punctuation 210
 
11.5%
Close Punctuation 210
 
11.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
219
 
15.6%
172
 
12.3%
169
 
12.1%
31
 
2.2%
26
 
1.9%
26
 
1.9%
25
 
1.8%
24
 
1.7%
23
 
1.6%
22
 
1.6%
Other values (135) 663
47.4%
Open Punctuation
ValueCountFrequency (%)
( 210
100.0%
Close Punctuation
ValueCountFrequency (%)
) 210
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1400
76.9%
Common 420
 
23.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
219
 
15.6%
172
 
12.3%
169
 
12.1%
31
 
2.2%
26
 
1.9%
26
 
1.9%
25
 
1.8%
24
 
1.7%
23
 
1.6%
22
 
1.6%
Other values (135) 663
47.4%
Common
ValueCountFrequency (%)
( 210
50.0%
) 210
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1400
76.9%
ASCII 420
 
23.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
219
 
15.6%
172
 
12.3%
169
 
12.1%
31
 
2.2%
26
 
1.9%
26
 
1.9%
25
 
1.8%
24
 
1.7%
23
 
1.6%
22
 
1.6%
Other values (135) 663
47.4%
ASCII
ValueCountFrequency (%)
( 210
50.0%
) 210
50.0%

업종
Categorical

Distinct11
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
철근ㆍ콘크리트공사업
68 
도장ㆍ습식ㆍ방수ㆍ석공사업
56 
지반조성ㆍ포장공사업
31 
상ㆍ하수도설비공사업
24 
조경식재ㆍ시설물공사업
16 
Other values (6)
56 

Length

Max length15
Median length13
Mean length10.665339
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row철근ㆍ콘크리트공사업
2nd row상ㆍ하수도설비공사업
3rd row시설물유지관리업
4th row철근ㆍ콘크리트공사업
5th row도장ㆍ습식ㆍ방수ㆍ석공사업

Common Values

ValueCountFrequency (%)
철근ㆍ콘크리트공사업 68
27.1%
도장ㆍ습식ㆍ방수ㆍ석공사업 56
22.3%
지반조성ㆍ포장공사업 31
12.4%
상ㆍ하수도설비공사업 24
 
9.6%
조경식재ㆍ시설물공사업 16
 
6.4%
가스난방공사업 16
 
6.4%
금속창호ㆍ지붕건축물조립공사업 14
 
5.6%
시설물유지관리업 13
 
5.2%
실내건축공사업 5
 
2.0%
구조물해체ㆍ비계공사업 5
 
2.0%

Length

2024-04-21T10:29:29.271587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
철근ㆍ콘크리트공사업 68
27.1%
도장ㆍ습식ㆍ방수ㆍ석공사업 56
22.3%
지반조성ㆍ포장공사업 31
12.4%
상ㆍ하수도설비공사업 24
 
9.6%
조경식재ㆍ시설물공사업 16
 
6.4%
가스난방공사업 16
 
6.4%
금속창호ㆍ지붕건축물조립공사업 14
 
5.6%
시설물유지관리업 13
 
5.2%
실내건축공사업 5
 
2.0%
구조물해체ㆍ비계공사업 5
 
2.0%

우편번호(도로명주소)
Real number (ℝ)

MISSING 

Distinct17
Distinct (%)6.9%
Missing4
Missing (%)1.6%
Infinite0
Infinite (%)0.0%
Mean27010.356
Minimum27000
Maximum27027
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2024-04-21T10:29:29.482246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum27000
5-th percentile27002
Q127005
median27011
Q327013
95-th percentile27021
Maximum27027
Range27
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.9995419
Coefficient of variation (CV)0.00022212006
Kurtosis0.43620535
Mean27010.356
Median Absolute Deviation (MAD)5
Skewness0.64122973
Sum6671558
Variance35.994503
MonotonicityNot monotonic
2024-04-21T10:29:29.685391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
27013 56
22.3%
27005 44
17.5%
27010 27
10.8%
27018 23
9.2%
27012 15
 
6.0%
27011 14
 
5.6%
27006 12
 
4.8%
27004 12
 
4.8%
27027 9
 
3.6%
27000 9
 
3.6%
Other values (7) 26
10.4%
ValueCountFrequency (%)
27000 9
 
3.6%
27001 2
 
0.8%
27002 5
 
2.0%
27003 5
 
2.0%
27004 12
 
4.8%
27005 44
17.5%
27006 12
 
4.8%
27007 3
 
1.2%
27008 2
 
0.8%
27009 2
 
0.8%
ValueCountFrequency (%)
27027 9
 
3.6%
27021 7
 
2.8%
27018 23
9.2%
27013 56
22.3%
27012 15
 
6.0%
27011 14
 
5.6%
27010 27
10.8%
27009 2
 
0.8%
27008 2
 
0.8%
27007 3
 
1.2%
Distinct123
Distinct (%)49.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-04-21T10:29:30.501894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length22.163347
Min length18

Characters and Unicode

Total characters5563
Distinct characters86
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)22.3%

Sample

1st row충청북도 단양군 매포읍 도곡파랑로 586
2nd row충청북도 단양군 매포읍 도곡파랑로 586
3rd row충청북도 단양군 단양읍 삼봉로 163 1층
4th row충청북도 단양군 단양읍 단양로 940
5th row충청북도 단양군 단양읍 단양로 940
ValueCountFrequency (%)
단양군 251
18.6%
충청북도 246
18.2%
단양읍 124
 
9.2%
매포읍 85
 
6.3%
단양로 31
 
2.3%
영춘면 23
 
1.7%
삼봉로 21
 
1.6%
2층 15
 
1.1%
1층 14
 
1.0%
온달평강로 13
 
1.0%
Other values (171) 529
39.1%
2024-04-21T10:29:31.556809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1101
19.8%
412
 
7.4%
409
 
7.4%
271
 
4.9%
251
 
4.5%
251
 
4.5%
251
 
4.5%
246
 
4.4%
209
 
3.8%
1 207
 
3.7%
Other values (76) 1955
35.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3496
62.8%
Space Separator 1101
 
19.8%
Decimal Number 905
 
16.3%
Dash Punctuation 47
 
0.8%
Other Punctuation 7
 
0.1%
Close Punctuation 3
 
0.1%
Open Punctuation 3
 
0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
412
11.8%
409
11.7%
271
 
7.8%
251
 
7.2%
251
 
7.2%
251
 
7.2%
246
 
7.0%
209
 
6.0%
145
 
4.1%
105
 
3.0%
Other values (59) 946
27.1%
Decimal Number
ValueCountFrequency (%)
1 207
22.9%
2 184
20.3%
3 95
10.5%
4 76
 
8.4%
9 74
 
8.2%
0 71
 
7.8%
6 59
 
6.5%
8 51
 
5.6%
7 46
 
5.1%
5 42
 
4.6%
Other Punctuation
ValueCountFrequency (%)
, 4
57.1%
3
42.9%
Space Separator
ValueCountFrequency (%)
1101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3496
62.8%
Common 2066
37.1%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
412
11.8%
409
11.7%
271
 
7.8%
251
 
7.2%
251
 
7.2%
251
 
7.2%
246
 
7.0%
209
 
6.0%
145
 
4.1%
105
 
3.0%
Other values (59) 946
27.1%
Common
ValueCountFrequency (%)
1101
53.3%
1 207
 
10.0%
2 184
 
8.9%
3 95
 
4.6%
4 76
 
3.7%
9 74
 
3.6%
0 71
 
3.4%
6 59
 
2.9%
8 51
 
2.5%
- 47
 
2.3%
Other values (6) 101
 
4.9%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3496
62.8%
ASCII 2064
37.1%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1101
53.3%
1 207
 
10.0%
2 184
 
8.9%
3 95
 
4.6%
4 76
 
3.7%
9 74
 
3.6%
0 71
 
3.4%
6 59
 
2.9%
8 51
 
2.5%
- 47
 
2.3%
Other values (6) 99
 
4.8%
Hangul
ValueCountFrequency (%)
412
11.8%
409
11.7%
271
 
7.8%
251
 
7.2%
251
 
7.2%
251
 
7.2%
246
 
7.0%
209
 
6.0%
145
 
4.1%
105
 
3.0%
Other values (59) 946
27.1%
None
ValueCountFrequency (%)
3
100.0%

전화번호
Text

MISSING 

Distinct130
Distinct (%)53.5%
Missing8
Missing (%)3.2%
Memory size2.1 KiB
2024-04-21T10:29:32.365528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.061728
Min length12

Characters and Unicode

Total characters2931
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)23.9%

Sample

1st row043-421-1726
2nd row043-421-1726
3rd row070-4773-1328
4th row043-422-5749
5th row043-422-5749
ValueCountFrequency (%)
043-423-9668 6
 
2.5%
043-423-2777 5
 
2.1%
043-422-3030 4
 
1.6%
043-422-3096 4
 
1.6%
043-423-7556 4
 
1.6%
043-423-2329 4
 
1.6%
043-422-3133 4
 
1.6%
043-421-7110 4
 
1.6%
043-421-1357 4
 
1.6%
043-423-5869 4
 
1.6%
Other values (120) 200
82.3%
2024-04-21T10:29:33.386960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 501
17.1%
- 486
16.6%
3 445
15.2%
0 412
14.1%
2 376
12.8%
1 172
 
5.9%
7 129
 
4.4%
5 121
 
4.1%
8 118
 
4.0%
9 90
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2445
83.4%
Dash Punctuation 486
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 501
20.5%
3 445
18.2%
0 412
16.9%
2 376
15.4%
1 172
 
7.0%
7 129
 
5.3%
5 121
 
4.9%
8 118
 
4.8%
9 90
 
3.7%
6 81
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 486
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2931
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 501
17.1%
- 486
16.6%
3 445
15.2%
0 412
14.1%
2 376
12.8%
1 172
 
5.9%
7 129
 
4.4%
5 121
 
4.1%
8 118
 
4.0%
9 90
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2931
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 501
17.1%
- 486
16.6%
3 445
15.2%
0 412
14.1%
2 376
12.8%
1 172
 
5.9%
7 129
 
4.4%
5 121
 
4.1%
8 118
 
4.0%
9 90
 
3.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum2022-06-20 00:00:00
Maximum2022-06-20 00:00:00
2024-04-21T10:29:33.577128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:29:33.736992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-21T10:29:25.711863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:29:25.223941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:29:25.960071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:29:25.465308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:29:33.854325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종우편번호(도로명주소)
연번1.0000.1160.487
업종0.1161.0000.000
우편번호(도로명주소)0.4870.0001.000
2024-04-21T10:29:34.003752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호(도로명주소)업종
연번1.0000.1460.017
우편번호(도로명주소)0.1461.0000.031
업종0.0170.0311.000

Missing values

2024-04-21T10:29:26.158972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:29:26.362227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T10:29:26.735342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번상호업종우편번호(도로명주소)영업소재지(도로명주소)전화번호데이터기준일자
01(유)산수건설철근ㆍ콘크리트공사업27001충청북도 단양군 매포읍 도곡파랑로 586043-421-17262022-06-20
12(유)산수건설상ㆍ하수도설비공사업27001충청북도 단양군 매포읍 도곡파랑로 586043-421-17262022-06-20
23(유)함성건설시설물유지관리업27013충청북도 단양군 단양읍 삼봉로 163 1층070-4773-13282022-06-20
34(주)강원기계건설철근ㆍ콘크리트공사업27027충청북도 단양군 단양읍 단양로 940043-422-57492022-06-20
45(주)강원기계건설도장ㆍ습식ㆍ방수ㆍ석공사업27027충청북도 단양군 단양읍 단양로 940043-422-57492022-06-20
56(주)강원기계건설금속창호ㆍ지붕건축물조립공사업27027충청북도 단양군 단양읍 단양로 940043-422-57492022-06-20
67(주)계림건설철근ㆍ콘크리트공사업27005충청북도 단양군 매포읍 평동2로 21043-421-80882022-06-20
78(주)계림건설도장ㆍ습식ㆍ방수ㆍ석공사업27005충청북도 단양군 매포읍 평동2로 21043-421-80882022-06-20
89(주)계림건설시설물유지관리업27005충청북도 단양군 매포읍 평동2로 21043-421-80882022-06-20
910(주)계명지반조성ㆍ포장공사업27013충청북도 단양군 단양읍 상진로 32043-422-30302022-06-20
연번상호업종우편번호(도로명주소)영업소재지(도로명주소)전화번호데이터기준일자
241242하늘건설(주)금속창호ㆍ지붕건축물조립공사업27005충북 단양군 매포읍 단양로 1936070-7766-02032022-06-20
242243하늘건설(주)지반조성ㆍ포장공사업27005충북 단양군 매포읍 단양로 1936070-7766-02032022-06-20
243244한강건설(주)철근ㆍ콘크리트공사업27004충청북도 단양군 매포읍 단양로 2000 1층043-423-02332022-06-20
244245한강건설(주)지반조성ㆍ포장공사업27004충청북도 단양군 매포읍 단양로 2000 1층043-423-02332022-06-20
245246한일건설(주)도장ㆍ습식ㆍ방수ㆍ석공사업<NA>충청북도 단양군 단양읍 별곡7길 4, 부강아파트 상가 104호043-421-71102022-06-20
246247한일건설(주)철근ㆍ콘크리트공사업<NA>충청북도 단양군 단양읍 별곡7길 4, 부강아파트 상가 104호043-421-71102022-06-20
247248한일건설(주)상ㆍ하수도설비공사업<NA>충청북도 단양군 단양읍 별곡7길 4, 부강아파트 상가 104호043-421-71102022-06-20
248249한일건설(주)지반조성ㆍ포장공사업<NA>충청북도 단양군 단양읍 별곡7길 4, 부강아파트 상가 104호043-421-71102022-06-20
249250현대가스가스난방공사업27027충청북도 단양군 단양읍 단양로 939043-422-24632022-06-20
250251현대종합설비가스난방공사업27010충청북도 단양군 단양읍 도전12길 14043-423-07402022-06-20