Overview

Dataset statistics

Number of variables15
Number of observations310
Missing cells179
Missing cells (%)3.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory37.1 KiB
Average record size in memory122.4 B

Variable types

Numeric2
Text6
Categorical5
DateTime2

Dataset

Description울산광역시의 종합건설업체 현황에 대한 데이터로 업체명, 대표자, 소재지, 업종, 정상 운영여부, 우편번호 등의 정보를 제공
URLhttps://www.data.go.kr/data/15047631/fileData.do

Alerts

업종상태 has constant value ""Constant
지역 has constant value ""Constant
업체현황 is highly overall correlated with 번호 and 3 other fieldsHigh correlation
업종 is highly overall correlated with 업체현황High correlation
조직형태 is highly overall correlated with 업체현황High correlation
번호 is highly overall correlated with 업체현황High correlation
우편번호 is highly overall correlated with 업체현황High correlation
조직형태 is highly imbalanced (90.7%)Imbalance
갱신일자 has 173 (55.8%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:46:01.252609
Analysis finished2023-12-12 11:46:03.488510
Duration2.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct310
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean155.5
Minimum1
Maximum310
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2023-12-12T20:46:03.574216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile16.45
Q178.25
median155.5
Q3232.75
95-th percentile294.55
Maximum310
Range309
Interquartile range (IQR)154.5

Descriptive statistics

Standard deviation89.633513
Coefficient of variation (CV)0.57642131
Kurtosis-1.2
Mean155.5
Median Absolute Deviation (MAD)77.5
Skewness0
Sum48205
Variance8034.1667
MonotonicityStrictly increasing
2023-12-12T20:46:03.727612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
206 1
 
0.3%
213 1
 
0.3%
212 1
 
0.3%
211 1
 
0.3%
210 1
 
0.3%
209 1
 
0.3%
208 1
 
0.3%
207 1
 
0.3%
205 1
 
0.3%
Other values (300) 300
96.8%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
310 1
0.3%
309 1
0.3%
308 1
0.3%
307 1
0.3%
306 1
0.3%
305 1
0.3%
304 1
0.3%
303 1
0.3%
302 1
0.3%
301 1
0.3%
Distinct283
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T20:46:04.028029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9
Mean length8.283871
Min length5

Characters and Unicode

Total characters2568
Distinct characters188
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique259 ?
Unique (%)83.5%

Sample

1st row(유)이레건설
2nd row(유)화영종합건설
3rd row(주)갑인
4th row(주)거강건설
5th row(주)건진
ValueCountFrequency (%)
중산건설(주 3
 
1.0%
부강종합건설(주 3
 
1.0%
주)태성건설 3
 
1.0%
주)은풍건설 2
 
0.6%
주)대호토건 2
 
0.6%
성단종합건설(주 2
 
0.6%
주)대득건설 2
 
0.6%
배토건설(주 2
 
0.6%
주)서호건설 2
 
0.6%
주)조은아이건설 2
 
0.6%
Other values (273) 287
92.6%
2023-12-12T20:46:04.547141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
310
 
12.1%
( 296
 
11.5%
) 296
 
11.5%
228
 
8.9%
215
 
8.4%
147
 
5.7%
145
 
5.6%
34
 
1.3%
34
 
1.3%
25
 
1.0%
Other values (178) 838
32.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1968
76.6%
Open Punctuation 296
 
11.5%
Close Punctuation 296
 
11.5%
Other Symbol 5
 
0.2%
Uppercase Letter 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
310
 
15.8%
228
 
11.6%
215
 
10.9%
147
 
7.5%
145
 
7.4%
34
 
1.7%
34
 
1.7%
25
 
1.3%
25
 
1.3%
24
 
1.2%
Other values (173) 781
39.7%
Open Punctuation
ValueCountFrequency (%)
( 296
100.0%
Close Punctuation
ValueCountFrequency (%)
) 296
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1973
76.8%
Common 593
 
23.1%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
310
 
15.7%
228
 
11.6%
215
 
10.9%
147
 
7.5%
145
 
7.3%
34
 
1.7%
34
 
1.7%
25
 
1.3%
25
 
1.3%
24
 
1.2%
Other values (174) 786
39.8%
Common
ValueCountFrequency (%)
( 296
49.9%
) 296
49.9%
& 1
 
0.2%
Latin
ValueCountFrequency (%)
C 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1968
76.6%
ASCII 595
 
23.2%
None 5
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
310
 
15.8%
228
 
11.6%
215
 
10.9%
147
 
7.5%
145
 
7.4%
34
 
1.7%
34
 
1.7%
25
 
1.3%
25
 
1.3%
24
 
1.2%
Other values (173) 781
39.7%
ASCII
ValueCountFrequency (%)
( 296
49.7%
) 296
49.7%
C 2
 
0.3%
& 1
 
0.2%
None
ValueCountFrequency (%)
5
100.0%
Distinct276
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T20:46:04.965451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.1935484
Min length2

Characters and Unicode

Total characters990
Distinct characters151
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique248 ?
Unique (%)80.0%

Sample

1st row김인김형진
2nd row이미경
3rd row김홍윤
4th row서현석
5th row남완식
ValueCountFrequency (%)
김정수 4
 
1.3%
김재홍 3
 
1.0%
박성우신국재 3
 
1.0%
박숭호 3
 
1.0%
심정섭 3
 
1.0%
손찬영 2
 
0.6%
김호석 2
 
0.6%
원윤주 2
 
0.6%
박태원 2
 
0.6%
이정협 2
 
0.6%
Other values (266) 284
91.6%
2023-12-12T20:46:05.444175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
82
 
8.3%
45
 
4.5%
39
 
3.9%
36
 
3.6%
31
 
3.1%
27
 
2.7%
25
 
2.5%
21
 
2.1%
20
 
2.0%
19
 
1.9%
Other values (141) 645
65.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 990
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
 
8.3%
45
 
4.5%
39
 
3.9%
36
 
3.6%
31
 
3.1%
27
 
2.7%
25
 
2.5%
21
 
2.1%
20
 
2.0%
19
 
1.9%
Other values (141) 645
65.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 990
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
 
8.3%
45
 
4.5%
39
 
3.9%
36
 
3.6%
31
 
3.1%
27
 
2.7%
25
 
2.5%
21
 
2.1%
20
 
2.0%
19
 
1.9%
Other values (141) 645
65.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 990
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
82
 
8.3%
45
 
4.5%
39
 
3.9%
36
 
3.6%
31
 
3.1%
27
 
2.7%
25
 
2.5%
21
 
2.1%
20
 
2.0%
19
 
1.9%
Other values (141) 645
65.2%

조직형태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
주식회사
303 
회사법인
 
4
유한회사
 
2
개인
 
1

Length

Max length4
Median length4
Mean length3.9935484
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row유한회사
2nd row유한회사
3rd row주식회사
4th row주식회사
5th row주식회사

Common Values

ValueCountFrequency (%)
주식회사 303
97.7%
회사법인 4
 
1.3%
유한회사 2
 
0.6%
개인 1
 
0.3%

Length

2023-12-12T20:46:05.600811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:46:05.726535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주식회사 303
97.7%
회사법인 4
 
1.3%
유한회사 2
 
0.6%
개인 1
 
0.3%
Distinct93
Distinct (%)30.2%
Missing2
Missing (%)0.6%
Memory size2.6 KiB
2023-12-12T20:46:05.944427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length11.308442
Min length11

Characters and Unicode

Total characters3483
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)16.2%

Sample

1st row710000000 원
2nd row430000000 원
3rd row320000000 원
4th row700000000 원
5th row700000000 원
ValueCountFrequency (%)
308
50.0%
500000000 56
 
9.1%
350000000 20
 
3.2%
1200000000 19
 
3.1%
300000000 15
 
2.4%
700000000 13
 
2.1%
400000000 8
 
1.3%
600000000 8
 
1.3%
850000000 7
 
1.1%
950000000 7
 
1.1%
Other values (84) 155
25.2%
2023-12-12T20:46:06.300060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2301
66.1%
308
 
8.8%
308
 
8.8%
5 168
 
4.8%
1 118
 
3.4%
3 58
 
1.7%
2 58
 
1.7%
6 43
 
1.2%
7 39
 
1.1%
4 29
 
0.8%
Other values (2) 53
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2867
82.3%
Space Separator 308
 
8.8%
Other Letter 308
 
8.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2301
80.3%
5 168
 
5.9%
1 118
 
4.1%
3 58
 
2.0%
2 58
 
2.0%
6 43
 
1.5%
7 39
 
1.4%
4 29
 
1.0%
9 29
 
1.0%
8 24
 
0.8%
Space Separator
ValueCountFrequency (%)
308
100.0%
Other Letter
ValueCountFrequency (%)
308
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3175
91.2%
Hangul 308
 
8.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2301
72.5%
308
 
9.7%
5 168
 
5.3%
1 118
 
3.7%
3 58
 
1.8%
2 58
 
1.8%
6 43
 
1.4%
7 39
 
1.2%
4 29
 
0.9%
9 29
 
0.9%
Hangul
ValueCountFrequency (%)
308
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3175
91.2%
Hangul 308
 
8.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2301
72.5%
308
 
9.7%
5 168
 
5.3%
1 118
 
3.7%
3 58
 
1.8%
2 58
 
1.8%
6 43
 
1.4%
7 39
 
1.2%
4 29
 
0.9%
9 29
 
0.9%
Hangul
ValueCountFrequency (%)
308
100.0%

업종
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
건축공사업
172 
토목건축공사업
60 
토목공사업
50 
조경공사업
19 
산업ㆍ환경설비공사업
 
9

Length

Max length10
Median length5
Mean length5.5322581
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row토목공사업
2nd row건축공사업
3rd row건축공사업
4th row토목공사업
5th row토목공사업

Common Values

ValueCountFrequency (%)
건축공사업 172
55.5%
토목건축공사업 60
 
19.4%
토목공사업 50
 
16.1%
조경공사업 19
 
6.1%
산업ㆍ환경설비공사업 9
 
2.9%

Length

2023-12-12T20:46:06.426022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:46:06.540529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건축공사업 172
55.5%
토목건축공사업 60
 
19.4%
토목공사업 50
 
16.1%
조경공사업 19
 
6.1%
산업ㆍ환경설비공사업 9
 
2.9%
Distinct294
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T20:46:06.840044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length6.8967742
Min length2

Characters and Unicode

Total characters2138
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique279 ?
Unique (%)90.0%

Sample

1st row15­0338
2nd row07­0378
3rd row07­0381
4th row07­0022
5th row07­0095
ValueCountFrequency (%)
07­0012 3
 
1.0%
07­0001 2
 
0.6%
07­0113 2
 
0.6%
07­0011 2
 
0.6%
07­0021 2
 
0.6%
07­0074 2
 
0.6%
70039 2
 
0.6%
07­0079 2
 
0.6%
07­0091 2
 
0.6%
07­0008 2
 
0.6%
Other values (284) 289
93.2%
2023-12-12T20:46:07.303113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 716
33.5%
7 332
15.5%
­ 292
13.7%
1 164
 
7.7%
3 142
 
6.6%
2 131
 
6.1%
4 89
 
4.2%
8 66
 
3.1%
9 63
 
2.9%
6 63
 
2.9%
Other values (7) 80
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1824
85.3%
Format 292
 
13.7%
Other Letter 22
 
1.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 716
39.3%
7 332
18.2%
1 164
 
9.0%
3 142
 
7.8%
2 131
 
7.2%
4 89
 
4.9%
8 66
 
3.6%
9 63
 
3.5%
6 63
 
3.5%
5 58
 
3.2%
Other Letter
ValueCountFrequency (%)
7
31.8%
6
27.3%
4
18.2%
3
13.6%
1
 
4.5%
1
 
4.5%
Format
ValueCountFrequency (%)
­ 292
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2116
99.0%
Hangul 22
 
1.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 716
33.8%
7 332
15.7%
­ 292
13.8%
1 164
 
7.8%
3 142
 
6.7%
2 131
 
6.2%
4 89
 
4.2%
8 66
 
3.1%
9 63
 
3.0%
6 63
 
3.0%
Hangul
ValueCountFrequency (%)
7
31.8%
6
27.3%
4
18.2%
3
13.6%
1
 
4.5%
1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1824
85.3%
None 292
 
13.7%
Hangul 22
 
1.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 716
39.3%
7 332
18.2%
1 164
 
9.0%
3 142
 
7.8%
2 131
 
7.2%
4 89
 
4.9%
8 66
 
3.6%
9 63
 
3.5%
6 63
 
3.5%
5 58
 
3.2%
None
ValueCountFrequency (%)
­ 292
100.0%
Hangul
ValueCountFrequency (%)
7
31.8%
6
27.3%
4
18.2%
3
13.6%
1
 
4.5%
1
 
4.5%
Distinct236
Distinct (%)76.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum1994-12-06 00:00:00
Maximum2023-02-08 00:00:00
2023-12-12T20:46:07.479439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:46:07.639042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업종상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
정상
310 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상
2nd row정상
3rd row정상
4th row정상
5th row정상

Common Values

ValueCountFrequency (%)
정상 310
100.0%

Length

2023-12-12T20:46:07.776474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:46:07.867028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 310
100.0%

갱신일자
Date

MISSING 

Distinct98
Distinct (%)71.5%
Missing173
Missing (%)55.8%
Memory size2.6 KiB
Minimum2015-03-12 00:00:00
Maximum2018-05-01 00:00:00
2023-12-12T20:46:07.967290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:46:08.115884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

지역
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
울산
310 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산
2nd row울산
3rd row울산
4th row울산
5th row울산

Common Values

ValueCountFrequency (%)
울산 310
100.0%

Length

2023-12-12T20:46:08.249683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:46:08.363239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산 310
100.0%

우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct151
Distinct (%)49.2%
Missing3
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean44720.586
Minimum44017
Maximum45015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2023-12-12T20:46:08.468187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum44017
5-th percentile44242.7
Q144545.5
median44717
Q344949.5
95-th percentile45011
Maximum45015
Range998
Interquartile range (IQR)404

Descriptive statistics

Standard deviation256.30176
Coefficient of variation (CV)0.0057311806
Kurtosis-0.45758409
Mean44720.586
Median Absolute Deviation (MAD)218
Skewness-0.7235671
Sum13729220
Variance65690.59
MonotonicityNot monotonic
2023-12-12T20:46:08.607023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
44985 11
 
3.5%
44254 8
 
2.6%
44925 7
 
2.3%
44976 7
 
2.3%
45013 7
 
2.3%
44936 6
 
1.9%
44924 5
 
1.6%
44696 5
 
1.6%
44928 5
 
1.6%
44690 5
 
1.6%
Other values (141) 241
77.7%
ValueCountFrequency (%)
44017 1
0.3%
44032 1
0.3%
44056 1
0.3%
44090 1
0.3%
44105 1
0.3%
44107 1
0.3%
44108 1
0.3%
44205 1
0.3%
44217 1
0.3%
44224 2
0.6%
ValueCountFrequency (%)
45015 2
 
0.6%
45014 3
1.0%
45013 7
2.3%
45012 1
 
0.3%
45011 4
1.3%
45006 1
 
0.3%
45003 2
 
0.6%
45002 1
 
0.3%
44998 1
 
0.3%
44996 2
 
0.6%
Distinct279
Distinct (%)90.3%
Missing1
Missing (%)0.3%
Memory size2.6 KiB
2023-12-12T20:46:09.024314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length41
Mean length26.721683
Min length19

Characters and Unicode

Total characters8257
Distinct characters231
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique253 ?
Unique (%)81.9%

Sample

1st row울산광역시 동구 꽃바위로 216 2층 201호 (방어동)
2nd row울산광역시 북구 호계동 1020-1
3rd row울산광역시 중구 내황7길 65 202호(반구동 리버뷰상가)
4th row울산광역시 울주군 범서읍 구영로 166
5th row울산광역시 울주군 삼남읍 도호1길 23 401호
ValueCountFrequency (%)
울산광역시 309
 
17.9%
울주군 130
 
7.5%
남구 95
 
5.5%
중구 51
 
3.0%
범서읍 37
 
2.1%
2층 34
 
2.0%
북구 26
 
1.5%
3층 19
 
1.1%
온양읍 16
 
0.9%
청량읍 15
 
0.9%
Other values (569) 992
57.5%
2023-12-12T20:46:09.594485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1462
 
17.7%
441
 
5.3%
388
 
4.7%
313
 
3.8%
312
 
3.8%
309
 
3.7%
1 302
 
3.7%
2 266
 
3.2%
225
 
2.7%
195
 
2.4%
Other values (221) 4044
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4938
59.8%
Space Separator 1462
 
17.7%
Decimal Number 1403
 
17.0%
Open Punctuation 194
 
2.3%
Close Punctuation 194
 
2.3%
Dash Punctuation 66
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
441
 
8.9%
388
 
7.9%
313
 
6.3%
312
 
6.3%
309
 
6.3%
225
 
4.6%
195
 
3.9%
179
 
3.6%
175
 
3.5%
131
 
2.7%
Other values (207) 2270
46.0%
Decimal Number
ValueCountFrequency (%)
1 302
21.5%
2 266
19.0%
0 173
12.3%
3 167
11.9%
4 124
8.8%
6 92
 
6.6%
7 87
 
6.2%
5 74
 
5.3%
8 68
 
4.8%
9 50
 
3.6%
Space Separator
ValueCountFrequency (%)
1462
100.0%
Open Punctuation
ValueCountFrequency (%)
( 194
100.0%
Close Punctuation
ValueCountFrequency (%)
) 194
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4938
59.8%
Common 3319
40.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
441
 
8.9%
388
 
7.9%
313
 
6.3%
312
 
6.3%
309
 
6.3%
225
 
4.6%
195
 
3.9%
179
 
3.6%
175
 
3.5%
131
 
2.7%
Other values (207) 2270
46.0%
Common
ValueCountFrequency (%)
1462
44.0%
1 302
 
9.1%
2 266
 
8.0%
( 194
 
5.8%
) 194
 
5.8%
0 173
 
5.2%
3 167
 
5.0%
4 124
 
3.7%
6 92
 
2.8%
7 87
 
2.6%
Other values (4) 258
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4938
59.8%
ASCII 3319
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1462
44.0%
1 302
 
9.1%
2 266
 
8.0%
( 194
 
5.8%
) 194
 
5.8%
0 173
 
5.2%
3 167
 
5.0%
4 124
 
3.7%
6 92
 
2.8%
7 87
 
2.6%
Other values (4) 258
 
7.8%
Hangul
ValueCountFrequency (%)
441
 
8.9%
388
 
7.9%
313
 
6.3%
312
 
6.3%
309
 
6.3%
225
 
4.6%
195
 
3.9%
179
 
3.6%
175
 
3.5%
131
 
2.7%
Other values (207) 2270
46.0%
Distinct280
Distinct (%)90.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T20:46:09.854032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length42
Mean length24.96129
Min length15

Characters and Unicode

Total characters7738
Distinct characters205
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique253 ?
Unique (%)81.6%

Sample

1st row울산광역시 동구 방어동 1022-1 2층 201호
2nd row울산광역시 북구 당수골25길 35(호계동)
3rd row울산광역시 중구 반구동 797-7 202호(리버뷰상가)
4th row울산광역시 울주군 범서읍 구영리 387-2
5th row울산광역시 울주군 삼남읍 신화리 1610-3 401호
ValueCountFrequency (%)
울산광역시 297
 
18.4%
울주군 131
 
8.1%
남구 95
 
5.9%
중구 51
 
3.2%
범서읍 37
 
2.3%
2층 37
 
2.3%
삼산동 28
 
1.7%
북구 26
 
1.6%
신정동 20
 
1.2%
3층 20
 
1.2%
Other values (511) 872
54.0%
2023-12-12T20:46:10.324645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1349
 
17.4%
442
 
5.7%
372
 
4.8%
1 368
 
4.8%
301
 
3.9%
297
 
3.8%
297
 
3.8%
- 241
 
3.1%
2 233
 
3.0%
222
 
2.9%
Other values (195) 3616
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4359
56.3%
Decimal Number 1673
 
21.6%
Space Separator 1349
 
17.4%
Dash Punctuation 241
 
3.1%
Close Punctuation 57
 
0.7%
Open Punctuation 57
 
0.7%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
442
 
10.1%
372
 
8.5%
301
 
6.9%
297
 
6.8%
297
 
6.8%
222
 
5.1%
212
 
4.9%
132
 
3.0%
131
 
3.0%
126
 
2.9%
Other values (179) 1827
41.9%
Decimal Number
ValueCountFrequency (%)
1 368
22.0%
2 233
13.9%
3 183
10.9%
0 180
10.8%
4 179
10.7%
6 134
 
8.0%
5 124
 
7.4%
7 97
 
5.8%
8 95
 
5.7%
9 80
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
1349
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 241
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4359
56.3%
Common 3377
43.6%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
442
 
10.1%
372
 
8.5%
301
 
6.9%
297
 
6.8%
297
 
6.8%
222
 
5.1%
212
 
4.9%
132
 
3.0%
131
 
3.0%
126
 
2.9%
Other values (179) 1827
41.9%
Common
ValueCountFrequency (%)
1349
39.9%
1 368
 
10.9%
- 241
 
7.1%
2 233
 
6.9%
3 183
 
5.4%
0 180
 
5.3%
4 179
 
5.3%
6 134
 
4.0%
5 124
 
3.7%
7 97
 
2.9%
Other values (4) 289
 
8.6%
Latin
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4359
56.3%
ASCII 3379
43.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1349
39.9%
1 368
 
10.9%
- 241
 
7.1%
2 233
 
6.9%
3 183
 
5.4%
0 180
 
5.3%
4 179
 
5.3%
6 134
 
4.0%
5 124
 
3.7%
7 97
 
2.9%
Other values (6) 291
 
8.6%
Hangul
ValueCountFrequency (%)
442
 
10.1%
372
 
8.5%
301
 
6.9%
297
 
6.8%
297
 
6.8%
222
 
5.1%
212
 
4.9%
132
 
3.0%
131
 
3.0%
126
 
2.9%
Other values (179) 1827
41.9%

업체현황
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
<NA>
258 
이전처리 접수 완료
52 

Length

Max length10
Median length4
Mean length5.0064516
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이전처리 접수 완료
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 258
83.2%
이전처리 접수 완료 52
 
16.8%

Length

2023-12-12T20:46:10.492319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:46:10.644863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 258
62.3%
이전처리 52
 
12.6%
접수 52
 
12.6%
완료 52
 
12.6%

Interactions

2023-12-12T20:46:02.358352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:46:02.161562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:46:02.453530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:46:02.254111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:46:10.775519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호조직형태자본금업종갱신일자우편번호
번호1.0000.2220.6610.0000.8920.143
조직형태0.2221.0000.8560.1181.0000.191
자본금0.6610.8561.0000.8880.9760.756
업종0.0000.1180.8881.0000.0000.250
갱신일자0.8921.0000.9760.0001.0000.925
우편번호0.1430.1910.7560.2500.9251.000
2023-12-12T20:46:10.926132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체현황업종조직형태
업체현황1.0001.0001.000
업종1.0001.0000.096
조직형태1.0000.0961.000
2023-12-12T20:46:11.102459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호우편번호조직형태업종업체현황
번호1.000-0.0520.1330.0001.000
우편번호-0.0521.0000.0950.1021.000
조직형태0.1330.0951.0000.0961.000
업종0.0000.1020.0961.0001.000
업체현황1.0001.0001.0001.0001.000

Missing values

2023-12-12T20:46:02.634820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:46:03.237017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T20:46:03.415367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호업체명대표자조직형태자본금업종등록번호등록일자업종상태갱신일자지역우편번호도로명주소지번주소업체현황
01(유)이레건설김인김형진유한회사710000000 원토목공사업15­03382001-02-23정상2016-11-24울산44108울산광역시 동구 꽃바위로 216 2층 201호 (방어동)울산광역시 동구 방어동 1022-1 2층 201호이전처리 접수 완료
12(유)화영종합건설이미경유한회사430000000 원건축공사업07­03782022-01-01정상<NA>울산44225울산광역시 북구 호계동 1020-1울산광역시 북구 당수골25길 35(호계동)<NA>
23(주)갑인김홍윤주식회사320000000 원건축공사업07­03812022-01-01정상<NA>울산44505울산광역시 중구 내황7길 65 202호(반구동 리버뷰상가)울산광역시 중구 반구동 797-7 202호(리버뷰상가)<NA>
34(주)거강건설서현석주식회사700000000 원토목공사업07­00222000-09-26정상2016-01-15울산44925울산광역시 울주군 범서읍 구영로 166울산광역시 울주군 범서읍 구영리 387-2<NA>
45(주)건진남완식주식회사700000000 원토목공사업07­00952016-06-01정상<NA>울산44951울산광역시 울주군 삼남읍 도호1길 23 401호울산광역시 울주군 삼남읍 신화리 1610-3 401호<NA>
56(주)건호이엔씨정성호주식회사1510000000 원토목건축공사업07­00862018-04-23정상<NA>울산44783울산광역시 남구 용잠로74번길 48(부곡동)울산광역시 남구 부곡동 22-5<NA>
67(주)경도종합건설고병태주식회사500000000 원건축공사업07­3202017-07-07정상<NA>울산44686울산광역시 남구 팔등로 130 2층(신정동)울산광역시 남구 신정동 61 2층<NA>
78(주)경동이앤에스김경배주식회사9900000000 원산업ㆍ환경설비공사업07­00082013-05-10정상2016-06-29울산44780울산광역시 남구 장생포로 304 (매암동)울산광역시 남구 장생포로 304(매암동)<NA>
89(주)경동이앤에스김경배주식회사9900000000 원토목건축공사업07­00772014-05-07정상2016-06-29울산44780울산광역시 남구 장생포로 304 (매암동)울산광역시 남구 장생포로 304(매암동)<NA>
910(주)경동종합건설김성진주식회사350000000 원건축공사업07­03512020-09-11정상<NA>울산44713울산광역시 남구 삼산로301번길 8-16 4층(삼산동 고려빌딩)울산광역시 남구 삼산동 1565-5 4층<NA>
번호업체명대표자조직형태자본금업종등록번호등록일자업종상태갱신일자지역우편번호도로명주소지번주소업체현황
300301한원종합건설(주)이민우주식회사500000000 원건축공사업07­01772005-12-27정상2015-04-24울산44695울산광역시 남구 번영로 177 4층 (달동)울산광역시 남구 달동 572-1<NA>
301302해창종합건설(주)신재용주식회사550000000 원건축공사업07­00692001-04-20정상2016-12-09울산44687울산광역시 남구 번영로 195 (신정동 동문아뮤티) 501호울산광역시 남구 번영로 195 501호(신정동 동문아뮤티)<NA>
302303현대제이케이건설주식회사김옥희회사법인<NA>건축공사업07­04152022-05-30정상<NA>울산44924울산광역시 울주군 범서읍 점촌1길 18-6울산광역시 울주군 범서읍 구영리 724-8<NA>
303304현대중공업(주)한영석이상균주식회사353800000000 원산업ㆍ환경설비공사업261997-10-13정상2015-07-07울산44032울산광역시 동구 방어진순환도로 1000 (전하동) (전하동)울산광역시 동구 전하동 1 (전하동)<NA>
304305현림건설(주)박대영주식회사500000000 원건축공사업07­01372002-12-05정상2015-04-24울산<NA>울산광역시 남구 신정로126번길 7 (신정동)울산광역시 남구 신정로 126번길 7(신정동)<NA>
305306현중조경산업(주)박민규주식회사700000000 원조경공사업07­00142004-12-30정상2017-05-12울산44017울산광역시 동구 방어진순환도로 1035 지하1층 (서부동)울산광역시 동구 서부동 279-15 지하1층이전처리 접수 완료
306307호연엔지니어링(주)장용호주식회사875000000 원토목공사업07­01102021-09-28정상<NA>울산44983울산광역시 울주군 청량읍 삼정로 586 3층울산광역시 울주군 청량읍 덕하리 183-3 3층<NA>
307308홍재종합건설(주)이영섭주식회사500000000 원건축공사업07­01782006-04-28정상2015-09-16울산44417울산광역시 중구 성안10길 4 (성안동) 2층울산광역시 중구 성안10길 4 2층(성안동)<NA>
308309화안개발(주)박성언주식회사450000000 원토목공사업07­01272022-06-13정상<NA>울산44938울산광역시 울주군 언양읍 북문3길 5-6울산 울주군 언양읍 동부리 365-9<NA>
309310힐티엔지니어링(주)유교식회사법인500000000 원건축공사업07­03792022-01-01정상<NA>울산44420울산광역시 중구 성안2길 35 (성안동)울산광역시 중구 성안동 512-12<NA>