Overview

Dataset statistics

Number of variables6
Number of observations719
Missing cells14
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory34.5 KiB
Average record size in memory49.2 B

Variable types

Text3
Categorical1
Numeric1
DateTime1

Dataset

Description전북특별자치도 군산시 전문건설업 현황 데이터로 업체명, 업종, 우편번호, 업체주소, 업체 연락처 정보를 제공하고 있다.
Author전북특별자치도 군산시
URLhttps://www.data.go.kr/data/15126556/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 13 (1.8%) missing valuesMissing

Reproduction

Analysis started2024-03-14 17:11:38.948896
Analysis finished2024-03-14 17:11:40.674321
Duration1.73 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct484
Distinct (%)67.3%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
2024-03-15T02:11:41.482241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length7.3532684
Min length2

Characters and Unicode

Total characters5287
Distinct characters264
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique330 ?
Unique (%)45.9%

Sample

1st row(유)가람조경건설
2nd row(유)가탑엔지니어링
3rd row(유)거상컨택
4th row(유)거상컨택
5th row(유)건진토건
ValueCountFrequency (%)
주)조풍건설 9
 
1.2%
유)한성산기 6
 
0.8%
주)승주건설 6
 
0.8%
유)전일건설 5
 
0.7%
화성산업개발(주 5
 
0.7%
해전산업(주 4
 
0.6%
엘이오건설(주 4
 
0.6%
주)남산건설 4
 
0.6%
주)선암 4
 
0.6%
주식회사토림 4
 
0.6%
Other values (475) 669
92.9%
2024-03-15T02:11:42.841977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 566
 
10.7%
) 566
 
10.7%
357
 
6.8%
326
 
6.2%
319
 
6.0%
310
 
5.9%
122
 
2.3%
111
 
2.1%
95
 
1.8%
87
 
1.6%
Other values (254) 2428
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4140
78.3%
Open Punctuation 566
 
10.7%
Close Punctuation 566
 
10.7%
Uppercase Letter 8
 
0.2%
Decimal Number 5
 
0.1%
Other Symbol 1
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
357
 
8.6%
326
 
7.9%
319
 
7.7%
310
 
7.5%
122
 
2.9%
111
 
2.7%
95
 
2.3%
87
 
2.1%
85
 
2.1%
76
 
1.8%
Other values (239) 2252
54.4%
Uppercase Letter
ValueCountFrequency (%)
C 1
12.5%
G 1
12.5%
N 1
12.5%
E 1
12.5%
R 1
12.5%
M 1
12.5%
T 1
12.5%
S 1
12.5%
Decimal Number
ValueCountFrequency (%)
1 3
60.0%
9 1
 
20.0%
2 1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 566
100.0%
Close Punctuation
ValueCountFrequency (%)
) 566
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4141
78.3%
Common 1138
 
21.5%
Latin 8
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
357
 
8.6%
326
 
7.9%
319
 
7.7%
310
 
7.5%
122
 
2.9%
111
 
2.7%
95
 
2.3%
87
 
2.1%
85
 
2.1%
76
 
1.8%
Other values (240) 2253
54.4%
Latin
ValueCountFrequency (%)
C 1
12.5%
G 1
12.5%
N 1
12.5%
E 1
12.5%
R 1
12.5%
M 1
12.5%
T 1
12.5%
S 1
12.5%
Common
ValueCountFrequency (%)
( 566
49.7%
) 566
49.7%
1 3
 
0.3%
9 1
 
0.1%
1
 
0.1%
2 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4140
78.3%
ASCII 1146
 
21.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 566
49.4%
) 566
49.4%
1 3
 
0.3%
C 1
 
0.1%
G 1
 
0.1%
N 1
 
0.1%
E 1
 
0.1%
9 1
 
0.1%
R 1
 
0.1%
1
 
0.1%
Other values (4) 4
 
0.3%
Hangul
ValueCountFrequency (%)
357
 
8.6%
326
 
7.9%
319
 
7.7%
310
 
7.5%
122
 
2.9%
111
 
2.7%
95
 
2.3%
87
 
2.1%
85
 
2.1%
76
 
1.8%
Other values (239) 2252
54.4%
None
ValueCountFrequency (%)
1
100.0%

업종
Categorical

Distinct12
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
철근ㆍ콘크리트공사업
114 
기계가스설비공사업
94 
지반조성ㆍ포장공사업
92 
가스난방공사업
83 
상ㆍ하수도설비공사업
70 
Other values (7)
266 

Length

Max length17
Median length13
Mean length10.261474
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row조경식재ㆍ시설물공사업
2nd row철강구조물공사업
3rd row상ㆍ하수도설비공사업
4th row철근ㆍ콘크리트공사업
5th row상ㆍ하수도설비공사업

Common Values

ValueCountFrequency (%)
철근ㆍ콘크리트공사업 114
15.9%
기계가스설비공사업 94
13.1%
지반조성ㆍ포장공사업 92
12.8%
가스난방공사업 83
11.5%
상ㆍ하수도설비공사업 70
9.7%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 68
9.5%
구조물해체ㆍ비계공사업 48
6.7%
조경식재ㆍ시설물공사업 47
6.5%
도장ㆍ습식ㆍ방수ㆍ석공사업 39
 
5.4%
실내건축공사업 29
 
4.0%
Other values (2) 35
 
4.9%

Length

2024-03-15T02:11:43.257062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
철근ㆍ콘크리트공사업 114
15.9%
기계가스설비공사업 94
13.1%
지반조성ㆍ포장공사업 92
12.8%
가스난방공사업 83
11.5%
상ㆍ하수도설비공사업 70
9.7%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 68
9.5%
구조물해체ㆍ비계공사업 48
6.7%
조경식재ㆍ시설물공사업 47
6.5%
도장ㆍ습식ㆍ방수ㆍ석공사업 39
 
5.4%
실내건축공사업 29
 
4.0%
Other values (2) 35
 
4.9%

우편번호
Real number (ℝ)

Distinct122
Distinct (%)17.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54088.954
Minimum54001
Maximum54178
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.4 KiB
2024-03-15T02:11:43.653568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum54001
5-th percentile54003
Q154045
median54078
Q354150
95-th percentile54172
Maximum54178
Range177
Interquartile range (IQR)105

Descriptive statistics

Standard deviation57.586504
Coefficient of variation (CV)0.0010646629
Kurtosis-1.294735
Mean54088.954
Median Absolute Deviation (MAD)51
Skewness0.063127461
Sum38889958
Variance3316.2054
MonotonicityNot monotonic
2024-03-15T02:11:44.016881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
54078 36
 
5.0%
54004 35
 
4.9%
54172 30
 
4.2%
54076 25
 
3.5%
54168 24
 
3.3%
54161 24
 
3.3%
54002 23
 
3.2%
54167 20
 
2.8%
54056 18
 
2.5%
54160 18
 
2.5%
Other values (112) 466
64.8%
ValueCountFrequency (%)
54001 8
 
1.1%
54002 23
3.2%
54003 6
 
0.8%
54004 35
4.9%
54005 4
 
0.6%
54006 1
 
0.1%
54007 10
 
1.4%
54008 6
 
0.8%
54009 6
 
0.8%
54010 1
 
0.1%
ValueCountFrequency (%)
54178 1
 
0.1%
54177 3
 
0.4%
54176 9
 
1.3%
54175 5
 
0.7%
54172 30
4.2%
54170 1
 
0.1%
54169 2
 
0.3%
54168 24
3.3%
54167 20
2.8%
54166 9
 
1.3%
Distinct466
Distinct (%)64.9%
Missing1
Missing (%)0.1%
Memory size5.7 KiB
2024-03-15T02:11:45.115510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length47
Mean length27.479109
Min length11

Characters and Unicode

Total characters19730
Distinct characters218
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique312 ?
Unique (%)43.5%

Sample

1st row전북특별자치도 군산시 조촌안4길 23-8 (조촌동)
2nd row전북특별자치도 군산시 외항로 425 (산북동)
3rd row전북특별자치도 군산시 조촌안3길 12-5 2층 (조촌동)
4th row전북특별자치도 군산시 조촌안3길 12-5 2층 (조촌동)
5th row전북특별자치도 군산시 백릉3길 2 (경장동)
ValueCountFrequency (%)
군산시 718
 
18.4%
전북특별자치도 717
 
18.4%
산북동 85
 
2.2%
조촌동 75
 
1.9%
오식도동 59
 
1.5%
소룡동 50
 
1.3%
2층 49
 
1.3%
나운동 40
 
1.0%
옥구읍 26
 
0.7%
공항로 26
 
0.7%
Other values (639) 2047
52.6%
2024-03-15T02:11:46.591148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3174
 
16.1%
898
 
4.6%
837
 
4.2%
801
 
4.1%
735
 
3.7%
728
 
3.7%
728
 
3.7%
723
 
3.7%
721
 
3.7%
717
 
3.6%
Other values (208) 9668
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12498
63.3%
Space Separator 3174
 
16.1%
Decimal Number 2664
 
13.5%
Open Punctuation 578
 
2.9%
Close Punctuation 578
 
2.9%
Dash Punctuation 139
 
0.7%
Other Punctuation 97
 
0.5%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
898
 
7.2%
837
 
6.7%
801
 
6.4%
735
 
5.9%
728
 
5.8%
728
 
5.8%
723
 
5.8%
721
 
5.8%
717
 
5.7%
717
 
5.7%
Other values (189) 4893
39.2%
Decimal Number
ValueCountFrequency (%)
1 578
21.7%
2 448
16.8%
3 346
13.0%
4 264
9.9%
0 243
9.1%
5 240
9.0%
6 165
 
6.2%
7 135
 
5.1%
8 130
 
4.9%
9 115
 
4.3%
Other Punctuation
ValueCountFrequency (%)
, 62
63.9%
. 26
26.8%
9
 
9.3%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
3174
100.0%
Open Punctuation
ValueCountFrequency (%)
( 578
100.0%
Close Punctuation
ValueCountFrequency (%)
) 578
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 139
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12498
63.3%
Common 7230
36.6%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
898
 
7.2%
837
 
6.7%
801
 
6.4%
735
 
5.9%
728
 
5.8%
728
 
5.8%
723
 
5.8%
721
 
5.8%
717
 
5.7%
717
 
5.7%
Other values (189) 4893
39.2%
Common
ValueCountFrequency (%)
3174
43.9%
( 578
 
8.0%
1 578
 
8.0%
) 578
 
8.0%
2 448
 
6.2%
3 346
 
4.8%
4 264
 
3.7%
0 243
 
3.4%
5 240
 
3.3%
6 165
 
2.3%
Other values (7) 616
 
8.5%
Latin
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12498
63.3%
ASCII 7223
36.6%
None 9
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3174
43.9%
( 578
 
8.0%
1 578
 
8.0%
) 578
 
8.0%
2 448
 
6.2%
3 346
 
4.8%
4 264
 
3.7%
0 243
 
3.4%
5 240
 
3.3%
6 165
 
2.3%
Other values (8) 609
 
8.4%
Hangul
ValueCountFrequency (%)
898
 
7.2%
837
 
6.7%
801
 
6.4%
735
 
5.9%
728
 
5.8%
728
 
5.8%
723
 
5.8%
721
 
5.8%
717
 
5.7%
717
 
5.7%
Other values (189) 4893
39.2%
None
ValueCountFrequency (%)
9
100.0%

전화번호
Text

MISSING 

Distinct462
Distinct (%)65.4%
Missing13
Missing (%)1.8%
Memory size5.7 KiB
2024-03-15T02:11:47.486398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.035411
Min length12

Characters and Unicode

Total characters8497
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique306 ?
Unique (%)43.3%

Sample

1st row063-445-0500
2nd row063-442-6996
3rd row070-7812-0808
4th row070-7812-0808
5th row063-253-2691
ValueCountFrequency (%)
063-465-3034 9
 
1.3%
063-467-4452 6
 
0.8%
063-441-9100 6
 
0.8%
063-451-4990 5
 
0.7%
063-454-8850 5
 
0.7%
063-463-4775 5
 
0.7%
063-466-9440 4
 
0.6%
063-452-6336 4
 
0.6%
063-717-0536 4
 
0.6%
063-466-8405 4
 
0.6%
Other values (452) 654
92.6%
2024-03-15T02:11:48.766452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1412
16.6%
6 1296
15.3%
0 1219
14.3%
4 1060
12.5%
3 1058
12.5%
5 557
 
6.6%
1 517
 
6.1%
7 429
 
5.0%
2 409
 
4.8%
8 294
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7085
83.4%
Dash Punctuation 1412
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 1296
18.3%
0 1219
17.2%
4 1060
15.0%
3 1058
14.9%
5 557
7.9%
1 517
 
7.3%
7 429
 
6.1%
2 409
 
5.8%
8 294
 
4.1%
9 246
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 1412
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8497
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1412
16.6%
6 1296
15.3%
0 1219
14.3%
4 1060
12.5%
3 1058
12.5%
5 557
 
6.6%
1 517
 
6.1%
7 429
 
5.0%
2 409
 
4.8%
8 294
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8497
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1412
16.6%
6 1296
15.3%
0 1219
14.3%
4 1060
12.5%
3 1058
12.5%
5 557
 
6.6%
1 517
 
6.1%
7 429
 
5.0%
2 409
 
4.8%
8 294
 
3.5%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
Minimum2024-02-02 00:00:00
Maximum2024-02-02 00:00:00
2024-03-15T02:11:49.110931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:11:49.278619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T02:11:39.483862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:11:49.468974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종우편번호
업종1.0000.386
우편번호0.3861.000
2024-03-15T02:11:49.694687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호업종
우편번호1.0000.174
업종0.1741.000

Missing values

2024-03-15T02:11:39.851790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:11:40.250081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T02:11:40.540917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명업종우편번호도로명주소전화번호데이터기준일자
0(유)가람조경건설조경식재ㆍ시설물공사업54076전북특별자치도 군산시 조촌안4길 23-8 (조촌동)063-445-05002024-02-02
1(유)가탑엔지니어링철강구조물공사업54159전북특별자치도 군산시 외항로 425 (산북동)063-442-69962024-02-02
2(유)거상컨택상ㆍ하수도설비공사업54076전북특별자치도 군산시 조촌안3길 12-5 2층 (조촌동)070-7812-08082024-02-02
3(유)거상컨택철근ㆍ콘크리트공사업54076전북특별자치도 군산시 조촌안3길 12-5 2층 (조촌동)070-7812-08082024-02-02
4(유)건진토건상ㆍ하수도설비공사업54077전북특별자치도 군산시 백릉3길 2 (경장동)063-253-26912024-02-02
5(유)건진토건철근ㆍ콘크리트공사업54077전북특별자치도 군산시 백릉3길 2 (경장동)063-253-26912024-02-02
6(유)경림조경산업조경식재ㆍ시설물공사업54125전북특별자치도 군산시 신설로 40-5 (나운동)063-631-25002024-02-02
7(유)경원산업기계가스설비공사업54048전북특별자치도 군산시 나포면 십자들로 810063-453-18512024-02-02
8(유)광성건설철근ㆍ콘크리트공사업54142전북특별자치도 군산시 한밭1길 17 2층, 201호 (나운동)063-451-87512024-02-02
9(유)광성건설상ㆍ하수도설비공사업54142전북특별자치도 군산시 한밭1길 17 2층, 201호 (나운동)063-451-87512024-02-02
업체명업종우편번호도로명주소전화번호데이터기준일자
709형제설비사가스난방공사업54120전북특별자치도 군산시 월명로 339 1층 (서흥남동)063-462-11852024-02-02
710호야홈텍(주)가스난방공사업54004전북특별자치도 군산시 외항로 937 (오식도동)063-467-98002024-02-02
711화성산업개발(주)지반조성ㆍ포장공사업54079전북특별자치도 군산시 법원로 81 2층 (조촌동)063-454-88502024-02-02
712화성산업개발(주)상ㆍ하수도설비공사업54079전북특별자치도 군산시 법원로 81 2층 (조촌동)063-454-88502024-02-02
713화성산업개발(주)철근ㆍ콘크리트공사업54079전북특별자치도 군산시 법원로 81 2층 (조촌동)063-454-88502024-02-02
714화성산업개발(주)수중ㆍ준설공사업54079전북특별자치도 군산시 법원로 81 2층 (조촌동)063-454-88502024-02-02
715화성산업개발(주)구조물해체ㆍ비계공사업54079전북특별자치도 군산시 법원로 81 2층 (조촌동)063-454-88502024-02-02
716회현설비가스난방공사업54177전북특별자치도 군산시 회현면 회현로 177063-466-53912024-02-02
717효창건설(주)지반조성ㆍ포장공사업54033군산시 조촌로 135063-443-19912024-02-02
718희남건축가스난방공사업54114전북특별자치도 군산시 대학로 86 (금광동)063-462-98692024-02-02