Overview

Dataset statistics

Number of variables14
Number of observations2609
Missing cells2500
Missing cells (%)6.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory290.6 KiB
Average record size in memory114.1 B

Variable types

Text8
Numeric2
Categorical3
Boolean1

Dataset

Description순천시 지역 창업 생태계 데이터를 구축하여 지역기반 스타트업을 활성화하고자 순천시 기업 목록별 현황 을 제공합니다.
Author전라남도 순천시
URLhttps://www.data.go.kr/data/15111415/fileData.do

Alerts

기초지자체 is highly overall correlated with 우편번호 and 2 other fieldsHigh correlation
행정동 is highly overall correlated with 우편번호 and 2 other fieldsHigh correlation
시군구 is highly overall correlated with 우편번호 and 2 other fieldsHigh correlation
우편번호 is highly overall correlated with 기초지자체 and 2 other fieldsHigh correlation
기초지자체 is highly imbalanced (85.0%)Imbalance
시군구 is highly imbalanced (98.1%)Imbalance
본사여부 is highly imbalanced (78.9%)Imbalance
전화번호 has 816 (31.3%) missing valuesMissing
팩스번호 has 1587 (60.8%) missing valuesMissing
설립일자 has 90 (3.4%) missing valuesMissing
우편번호 is highly skewed (γ1 = -29.00371833)Skewed
관리번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:20:38.658802
Analysis finished2023-12-12 00:20:41.091783
Duration2.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리번호
Text

UNIQUE 

Distinct2609
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size20.5 KiB
2023-12-12T09:20:41.313588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length16
Min length16

Characters and Unicode

Total characters41744
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2609 ?
Unique (%)100.0%

Sample

1st rowREQ-003-01-00001
2nd rowREQ-003-01-00002
3rd rowREQ-003-01-00003
4th rowREQ-003-01-00004
5th rowREQ-003-01-00005
ValueCountFrequency (%)
req-003-01-00001 1
 
< 0.1%
req-003-01-01753 1
 
< 0.1%
req-003-01-01735 1
 
< 0.1%
req-003-01-01743 1
 
< 0.1%
req-003-01-01736 1
 
< 0.1%
req-003-01-01737 1
 
< 0.1%
req-003-01-01738 1
 
< 0.1%
req-003-01-01739 1
 
< 0.1%
req-003-01-01740 1
 
< 0.1%
req-003-01-01741 1
 
< 0.1%
Other values (2599) 2599
99.6%
2023-12-12T09:20:41.707646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 12263
29.4%
- 7827
18.8%
1 4430
 
10.6%
3 3430
 
8.2%
R 2609
 
6.2%
E 2609
 
6.2%
Q 2609
 
6.2%
2 1431
 
3.4%
4 821
 
2.0%
5 821
 
2.0%
Other values (4) 2894
 
6.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26090
62.5%
Dash Punctuation 7827
 
18.8%
Uppercase Letter 7827
 
18.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 12263
47.0%
1 4430
 
17.0%
3 3430
 
13.1%
2 1431
 
5.5%
4 821
 
3.1%
5 821
 
3.1%
6 731
 
2.8%
7 721
 
2.8%
8 721
 
2.8%
9 721
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
R 2609
33.3%
E 2609
33.3%
Q 2609
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 7827
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 33917
81.2%
Latin 7827
 
18.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 12263
36.2%
- 7827
23.1%
1 4430
 
13.1%
3 3430
 
10.1%
2 1431
 
4.2%
4 821
 
2.4%
5 821
 
2.4%
6 731
 
2.2%
7 721
 
2.1%
8 721
 
2.1%
Latin
ValueCountFrequency (%)
R 2609
33.3%
E 2609
33.3%
Q 2609
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 41744
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 12263
29.4%
- 7827
18.8%
1 4430
 
10.6%
3 3430
 
8.2%
R 2609
 
6.2%
E 2609
 
6.2%
Q 2609
 
6.2%
2 1431
 
3.4%
4 821
 
2.0%
5 821
 
2.0%
Other values (4) 2894
 
6.9%
Distinct2596
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size20.5 KiB
2023-12-12T09:20:41.932186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length8.0038329
Min length2

Characters and Unicode

Total characters20882
Distinct characters578
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2589 ?
Unique (%)99.2%

Sample

1st row(명)뉴삼우관광전세
2nd row(명)중앙택시
3rd row(명)한일전기
4th row(유)119농산
5th row(유)1급조은정비공업사
ValueCountFrequency (%)
농협은행(주 6
 
0.2%
주)농업회사법인 6
 
0.2%
농업회사법인 5
 
0.2%
주)광주은행 4
 
0.2%
킴스체인 2
 
0.1%
대성시스템(주 2
 
0.1%
유)에덴 2
 
0.1%
주)대경 2
 
0.1%
협동조합 2
 
0.1%
주)마루 2
 
0.1%
Other values (2595) 2597
98.7%
2023-12-12T09:20:42.282398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 2457
 
11.8%
) 2457
 
11.8%
2124
 
10.2%
472
 
2.3%
461
 
2.2%
327
 
1.6%
306
 
1.5%
304
 
1.5%
295
 
1.4%
260
 
1.2%
Other values (568) 11419
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15889
76.1%
Open Punctuation 2457
 
11.8%
Close Punctuation 2457
 
11.8%
Space Separator 51
 
0.2%
Decimal Number 17
 
0.1%
Other Punctuation 8
 
< 0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2124
 
13.4%
472
 
3.0%
461
 
2.9%
327
 
2.1%
306
 
1.9%
304
 
1.9%
295
 
1.9%
260
 
1.6%
257
 
1.6%
253
 
1.6%
Other values (552) 10830
68.2%
Decimal Number
ValueCountFrequency (%)
1 8
47.1%
5 2
 
11.8%
0 1
 
5.9%
4 1
 
5.9%
9 1
 
5.9%
3 1
 
5.9%
8 1
 
5.9%
6 1
 
5.9%
2 1
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
D 1
33.3%
S 1
33.3%
R 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 2457
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2457
100.0%
Space Separator
ValueCountFrequency (%)
51
100.0%
Other Punctuation
ValueCountFrequency (%)
. 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15889
76.1%
Common 4990
 
23.9%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2124
 
13.4%
472
 
3.0%
461
 
2.9%
327
 
2.1%
306
 
1.9%
304
 
1.9%
295
 
1.9%
260
 
1.6%
257
 
1.6%
253
 
1.6%
Other values (552) 10830
68.2%
Common
ValueCountFrequency (%)
( 2457
49.2%
) 2457
49.2%
51
 
1.0%
. 8
 
0.2%
1 8
 
0.2%
5 2
 
< 0.1%
0 1
 
< 0.1%
4 1
 
< 0.1%
9 1
 
< 0.1%
3 1
 
< 0.1%
Other values (3) 3
 
0.1%
Latin
ValueCountFrequency (%)
D 1
33.3%
S 1
33.3%
R 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15889
76.1%
ASCII 4993
 
23.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 2457
49.2%
) 2457
49.2%
51
 
1.0%
. 8
 
0.2%
1 8
 
0.2%
5 2
 
< 0.1%
0 1
 
< 0.1%
D 1
 
< 0.1%
S 1
 
< 0.1%
R 1
 
< 0.1%
Other values (6) 6
 
0.1%
Hangul
ValueCountFrequency (%)
2124
 
13.4%
472
 
3.0%
461
 
2.9%
327
 
2.1%
306
 
1.9%
304
 
1.9%
295
 
1.9%
260
 
1.6%
257
 
1.6%
253
 
1.6%
Other values (552) 10830
68.2%

사업자번호
Real number (ℝ)

Distinct2602
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4995467 × 109
Minimum4.1681762 × 108
Maximum8.9987014 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.1 KiB
2023-12-12T09:20:42.425147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.1681762 × 108
5-th percentile1.9124013 × 109
Q14.1681081 × 109
median4.1681669 × 109
Q34.3781008 × 109
95-th percentile8.0646006 × 109
Maximum8.9987014 × 109
Range8.5818838 × 109
Interquartile range (IQR)2.0999264 × 108

Descriptive statistics

Standard deviation1.5602912 × 109
Coefficient of variation (CV)0.34676632
Kurtosis1.2130143
Mean4.4995467 × 109
Median Absolute Deviation (MAD)9366598
Skewness0.8030977
Sum1.1739317 × 1013
Variance2.4345088 × 1018
MonotonicityNot monotonic
2023-12-12T09:20:42.574144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4168100937 2
 
0.1%
8348702120 2
 
0.1%
4098187824 2
 
0.1%
4168110231 2
 
0.1%
4168174332 2
 
0.1%
8078700630 2
 
0.1%
4388100878 2
 
0.1%
5378100796 1
 
< 0.1%
4168529155 1
 
< 0.1%
3668601427 1
 
< 0.1%
Other values (2592) 2592
99.3%
ValueCountFrequency (%)
416817624 1
< 0.1%
1018669422 1
< 0.1%
1028132035 1
< 0.1%
1028701553 1
< 0.1%
1048700604 1
< 0.1%
1058190048 1
< 0.1%
1058610001 1
< 0.1%
1078135022 1
< 0.1%
1078160405 1
< 0.1%
1078790575 1
< 0.1%
ValueCountFrequency (%)
8998701394 1
< 0.1%
8988101268 1
< 0.1%
8978100247 1
< 0.1%
8968600915 1
< 0.1%
8958802122 1
< 0.1%
8958700146 1
< 0.1%
8948701180 1
< 0.1%
8948501129 1
< 0.1%
8938601273 1
< 0.1%
8938100599 1
< 0.1%
Distinct2325
Distinct (%)89.1%
Missing1
Missing (%)< 0.1%
Memory size20.5 KiB
2023-12-12T09:20:42.902832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length3
Mean length3.1564417
Min length2

Characters and Unicode

Total characters8232
Distinct characters261
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2098 ?
Unique (%)80.4%

Sample

1st row박금수
2nd row이연심
3rd row유상준
4th row황아름
5th row김무현
ValueCountFrequency (%)
박선식 6
 
0.2%
권준학 6
 
0.2%
김정숙 4
 
0.2%
정영균 4
 
0.2%
박화수 4
 
0.2%
정대식 4
 
0.2%
송종욱 4
 
0.2%
김광웅 4
 
0.2%
권상웅 4
 
0.2%
임병진 4
 
0.2%
Other values (2315) 2564
98.3%
2023-12-12T09:20:43.338108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
596
 
7.2%
349
 
4.2%
342
 
4.2%
248
 
3.0%
193
 
2.3%
164
 
2.0%
157
 
1.9%
134
 
1.6%
125
 
1.5%
118
 
1.4%
Other values (251) 5806
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8080
98.2%
Other Punctuation 80
 
1.0%
Space Separator 72
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
596
 
7.4%
349
 
4.3%
342
 
4.2%
248
 
3.1%
193
 
2.4%
164
 
2.0%
157
 
1.9%
134
 
1.7%
125
 
1.5%
118
 
1.5%
Other values (249) 5654
70.0%
Other Punctuation
ValueCountFrequency (%)
/ 80
100.0%
Space Separator
ValueCountFrequency (%)
72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8080
98.2%
Common 152
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
596
 
7.4%
349
 
4.3%
342
 
4.2%
248
 
3.1%
193
 
2.4%
164
 
2.0%
157
 
1.9%
134
 
1.7%
125
 
1.5%
118
 
1.5%
Other values (249) 5654
70.0%
Common
ValueCountFrequency (%)
/ 80
52.6%
72
47.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8080
98.2%
ASCII 152
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
596
 
7.4%
349
 
4.3%
342
 
4.2%
248
 
3.1%
193
 
2.4%
164
 
2.0%
157
 
1.9%
134
 
1.7%
125
 
1.5%
118
 
1.5%
Other values (249) 5654
70.0%
ASCII
ValueCountFrequency (%)
/ 80
52.6%
72
47.4%

전화번호
Text

MISSING 

Distinct1687
Distinct (%)94.1%
Missing816
Missing (%)31.3%
Memory size20.5 KiB
2023-12-12T09:20:43.625202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.063023
Min length9

Characters and Unicode

Total characters21629
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1599 ?
Unique (%)89.2%

Sample

1st row061-742-5200
2nd row061-754-4500
3rd row061-745-0909
4th row061-723-1192
5th row061-723-5123
ValueCountFrequency (%)
061-722-3777 5
 
0.3%
061-751-8800 5
 
0.3%
061-743-6000 4
 
0.2%
061-723-6200 4
 
0.2%
061-743-8404 3
 
0.2%
061-725-2790 3
 
0.2%
061-723-5320 3
 
0.2%
061-727-9500 3
 
0.2%
061-726-7513 3
 
0.2%
061-722-0881 3
 
0.2%
Other values (1677) 1757
98.0%
2023-12-12T09:20:43.995372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3606
16.7%
0 3238
15.0%
1 2842
13.1%
6 2481
11.5%
7 2425
11.2%
2 1697
7.8%
5 1613
7.5%
4 1320
 
6.1%
3 1029
 
4.8%
8 753
 
3.5%
Other values (2) 625
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18022
83.3%
Dash Punctuation 3606
 
16.7%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3238
18.0%
1 2842
15.8%
6 2481
13.8%
7 2425
13.5%
2 1697
9.4%
5 1613
9.0%
4 1320
7.3%
3 1029
 
5.7%
8 753
 
4.2%
9 624
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 3606
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 21629
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3606
16.7%
0 3238
15.0%
1 2842
13.1%
6 2481
11.5%
7 2425
11.2%
2 1697
7.8%
5 1613
7.5%
4 1320
 
6.1%
3 1029
 
4.8%
8 753
 
3.5%
Other values (2) 625
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 21629
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3606
16.7%
0 3238
15.0%
1 2842
13.1%
6 2481
11.5%
7 2425
11.2%
2 1697
7.8%
5 1613
7.5%
4 1320
 
6.1%
3 1029
 
4.8%
8 753
 
3.5%
Other values (2) 625
 
2.9%

팩스번호
Text

MISSING 

Distinct951
Distinct (%)93.1%
Missing1587
Missing (%)60.8%
Memory size20.5 KiB
2023-12-12T09:20:44.234279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.035225
Min length11

Characters and Unicode

Total characters12300
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique893 ?
Unique (%)87.4%

Sample

1st row061-742-6662
2nd row061-741-1199
3rd row061-746-0103
4th row061-724-6262
5th row061-724-6262
ValueCountFrequency (%)
061-743-8400 3
 
0.3%
061-723-7010 3
 
0.3%
061-751-5134 3
 
0.3%
061-744-6227 3
 
0.3%
0303-0958-6007 3
 
0.3%
061-725-6551 3
 
0.3%
061-910-7100 3
 
0.3%
061-751-5116 3
 
0.3%
062-673-3957 3
 
0.3%
061-743-3335 3
 
0.3%
Other values (941) 992
97.1%
2023-12-12T09:20:44.652889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2044
16.6%
0 1627
13.2%
1 1568
12.7%
6 1493
12.1%
7 1337
10.9%
2 993
8.1%
5 901
7.3%
4 749
 
6.1%
3 670
 
5.4%
9 461
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10256
83.4%
Dash Punctuation 2044
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1627
15.9%
1 1568
15.3%
6 1493
14.6%
7 1337
13.0%
2 993
9.7%
5 901
8.8%
4 749
7.3%
3 670
6.5%
9 461
 
4.5%
8 457
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 2044
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12300
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2044
16.6%
0 1627
13.2%
1 1568
12.7%
6 1493
12.1%
7 1337
10.9%
2 993
8.1%
5 901
7.3%
4 749
 
6.1%
3 670
 
5.4%
9 461
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12300
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2044
16.6%
0 1627
13.2%
1 1568
12.7%
6 1493
12.1%
7 1337
10.9%
2 993
8.1%
5 901
7.3%
4 749
 
6.1%
3 670
 
5.4%
9 461
 
3.7%

기초지자체
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct11
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size20.5 KiB
순천시
2344 
전라남도
251 
<NA>
 
4
음성군
 
2
서울시
 
2
Other values (6)
 
6

Length

Max length5
Median length3
Mean length3.0992718
Min length3

Unique

Unique6 ?
Unique (%)0.2%

Sample

1st row순천시
2nd row순천시
3rd row순천시
4th row순천시
5th row순천시

Common Values

ValueCountFrequency (%)
순천시 2344
89.8%
전라남도 251
 
9.6%
<NA> 4
 
0.2%
음성군 2
 
0.1%
서울시 2
 
0.1%
나주시 1
 
< 0.1%
광주특별시 1
 
< 0.1%
광양시 1
 
< 0.1%
대전시 1
 
< 0.1%
제주시 1
 
< 0.1%

Length

2023-12-12T09:20:44.803128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
순천시 2344
89.8%
전라남도 251
 
9.6%
na 4
 
0.2%
음성군 2
 
0.1%
서울시 2
 
0.1%
나주시 1
 
< 0.1%
광주특별시 1
 
< 0.1%
광양시 1
 
< 0.1%
대전시 1
 
< 0.1%
제주시 1
 
< 0.1%

시군구
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct10
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size20.5 KiB
순천시
2595 
<NA>
 
4
음성군
 
2
서울시
 
2
나주시
 
1
Other values (5)
 
5

Length

Max length5
Median length3
Mean length3.0019164
Min length2

Unique

Unique6 ?
Unique (%)0.2%

Sample

1st row순천시
2nd row순천시
3rd row순천시
4th row순천시
5th row순천시

Common Values

ValueCountFrequency (%)
순천시 2595
99.5%
<NA> 4
 
0.2%
음성군 2
 
0.1%
서울시 2
 
0.1%
나주시 1
 
< 0.1%
북구 1
 
< 0.1%
광양시 1
 
< 0.1%
대전시 1
 
< 0.1%
제주시 1
 
< 0.1%
광주광역시 1
 
< 0.1%

Length

2023-12-12T09:20:44.949542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:20:45.065313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
순천시 2595
99.5%
na 4
 
0.2%
음성군 2
 
0.1%
서울시 2
 
0.1%
나주시 1
 
< 0.1%
북구 1
 
< 0.1%
광양시 1
 
< 0.1%
대전시 1
 
< 0.1%
제주시 1
 
< 0.1%
광주광역시 1
 
< 0.1%
Distinct142
Distinct (%)5.5%
Missing6
Missing (%)0.2%
Memory size20.5 KiB
2023-12-12T09:20:45.315272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9504418
Min length2

Characters and Unicode

Total characters7680
Distinct characters116
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)1.7%

Sample

1st row봉림리
2nd row광천리
3rd row풍덕동
4th row남가리
5th row조례동
ValueCountFrequency (%)
연향동 332
 
12.8%
해룡면 264
 
10.1%
조례동 249
 
9.6%
가곡동 152
 
5.8%
서면 123
 
4.7%
풍덕동 108
 
4.1%
장천동 90
 
3.5%
왕지동 86
 
3.3%
별량면 55
 
2.1%
신대리 53
 
2.0%
Other values (132) 1091
41.9%
2023-12-12T09:20:45.716865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1569
20.4%
545
 
7.1%
535
 
7.0%
334
 
4.3%
332
 
4.3%
286
 
3.7%
284
 
3.7%
268
 
3.5%
259
 
3.4%
249
 
3.2%
Other values (106) 3019
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7677
> 99.9%
Decimal Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1569
20.4%
545
 
7.1%
535
 
7.0%
334
 
4.4%
332
 
4.3%
286
 
3.7%
284
 
3.7%
268
 
3.5%
259
 
3.4%
249
 
3.2%
Other values (104) 3016
39.3%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7677
> 99.9%
Common 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1569
20.4%
545
 
7.1%
535
 
7.0%
334
 
4.4%
332
 
4.3%
286
 
3.7%
284
 
3.7%
268
 
3.5%
259
 
3.4%
249
 
3.2%
Other values (104) 3016
39.3%
Common
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7677
> 99.9%
ASCII 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1569
20.4%
545
 
7.1%
535
 
7.0%
334
 
4.4%
332
 
4.3%
286
 
3.7%
284
 
3.7%
268
 
3.5%
259
 
3.4%
249
 
3.2%
Other values (104) 3016
39.3%
ASCII
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%

행정동
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size20.5 KiB
해룡면
555 
덕연동
391 
서면
250 
삼산동
220 
왕조1동
193 
Other values (41)
1000 

Length

Max length5
Median length3
Mean length3
Min length2

Unique

Unique17 ?
Unique (%)0.7%

Sample

1st row별량면
2nd row주암면
3rd row풍덕동
4th row해룡면
5th row왕조1동

Common Values

ValueCountFrequency (%)
해룡면 555
21.3%
덕연동 391
15.0%
서면 250
9.6%
삼산동 220
 
8.4%
왕조1동 193
 
7.4%
도사동 126
 
4.8%
별량면 108
 
4.1%
풍덕동 107
 
4.1%
장천동 89
 
3.4%
중앙동 69
 
2.6%
Other values (36) 501
19.2%

Length

2023-12-12T09:20:45.875254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
해룡면 555
21.3%
덕연동 391
15.0%
서면 250
9.6%
삼산동 220
 
8.4%
왕조1동 193
 
7.4%
도사동 126
 
4.8%
별량면 108
 
4.1%
풍덕동 107
 
4.1%
장천동 89
 
3.4%
중앙동 69
 
2.6%
Other values (36) 503
19.3%

우편번호
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct134
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57906.456
Minimum2465
Maximum58035
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.1 KiB
2023-12-12T09:20:46.004825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2465
5-th percentile57904
Q157931
median57965
Q358005
95-th percentile58033
Maximum58035
Range55570
Interquartile range (IQR)74

Descriptive statistics

Standard deviation1671.4203
Coefficient of variation (CV)0.028864145
Kurtosis881.57125
Mean57906.456
Median Absolute Deviation (MAD)38
Skewness-29.003718
Sum1.5107794 × 108
Variance2793645.9
MonotonicityNot monotonic
2023-12-12T09:20:46.145618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
58005 99
 
3.8%
57956 83
 
3.2%
57924 80
 
3.1%
58034 77
 
3.0%
57903 76
 
2.9%
57923 66
 
2.5%
58024 66
 
2.5%
57932 64
 
2.5%
57942 54
 
2.1%
58023 51
 
2.0%
Other values (124) 1893
72.6%
ValueCountFrequency (%)
2465 1
 
< 0.1%
5836 1
 
< 0.1%
27644 1
 
< 0.1%
34025 1
 
< 0.1%
57717 1
 
< 0.1%
57900 3
 
0.1%
57901 20
 
0.8%
57902 14
 
0.5%
57903 76
2.9%
57904 27
 
1.0%
ValueCountFrequency (%)
58035 28
 
1.1%
58034 77
3.0%
58033 31
1.2%
58032 17
 
0.7%
58031 23
 
0.9%
58030 23
 
0.9%
58029 13
 
0.5%
58028 31
1.2%
58027 25
 
1.0%
58026 25
 
1.0%
Distinct2213
Distinct (%)84.8%
Missing0
Missing (%)0.0%
Memory size20.5 KiB
2023-12-12T09:20:46.526971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length44
Mean length18.782675
Min length1

Characters and Unicode

Total characters49004
Distinct characters287
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1928 ?
Unique (%)73.9%

Sample

1st row전남 순천시 별량면 녹색로 964
2nd row전남 순천시 주암면 동주로 2041
3rd row전남 순천시 팔마로 57
4th row전남 순천시 해룡면 서가길 15
5th row전남 순천시 신월길 103
ValueCountFrequency (%)
전남 2603
20.5%
순천시 2601
20.5%
해룡면 561
 
4.4%
서면 249
 
2.0%
2층 221
 
1.7%
1층 138
 
1.1%
별량면 108
 
0.9%
중앙로 96
 
0.8%
3층 68
 
0.5%
주암면 67
 
0.5%
Other values (1584) 5956
47.0%
2023-12-12T09:20:47.354211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11333
23.1%
2721
 
5.6%
2719
 
5.5%
2700
 
5.5%
2678
 
5.5%
2618
 
5.3%
1 2164
 
4.4%
2 1620
 
3.3%
1574
 
3.2%
1076
 
2.2%
Other values (277) 17801
36.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 26866
54.8%
Space Separator 11333
23.1%
Decimal Number 9316
 
19.0%
Other Punctuation 875
 
1.8%
Dash Punctuation 579
 
1.2%
Open Punctuation 13
 
< 0.1%
Close Punctuation 12
 
< 0.1%
Uppercase Letter 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2721
 
10.1%
2719
 
10.1%
2700
 
10.0%
2678
 
10.0%
2618
 
9.7%
1574
 
5.9%
1076
 
4.0%
1041
 
3.9%
707
 
2.6%
688
 
2.6%
Other values (256) 8344
31.1%
Decimal Number
ValueCountFrequency (%)
1 2164
23.2%
2 1620
17.4%
3 1076
11.6%
4 884
9.5%
0 858
 
9.2%
5 707
 
7.6%
6 600
 
6.4%
7 488
 
5.2%
9 475
 
5.1%
8 444
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
A 4
40.0%
B 4
40.0%
E 1
 
10.0%
C 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
, 873
99.8%
. 1
 
0.1%
/ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
11333
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 579
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 26866
54.8%
Common 22128
45.2%
Latin 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2721
 
10.1%
2719
 
10.1%
2700
 
10.0%
2678
 
10.0%
2618
 
9.7%
1574
 
5.9%
1076
 
4.0%
1041
 
3.9%
707
 
2.6%
688
 
2.6%
Other values (256) 8344
31.1%
Common
ValueCountFrequency (%)
11333
51.2%
1 2164
 
9.8%
2 1620
 
7.3%
3 1076
 
4.9%
4 884
 
4.0%
, 873
 
3.9%
0 858
 
3.9%
5 707
 
3.2%
6 600
 
2.7%
- 579
 
2.6%
Other values (7) 1434
 
6.5%
Latin
ValueCountFrequency (%)
A 4
40.0%
B 4
40.0%
E 1
 
10.0%
C 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 26866
54.8%
ASCII 22138
45.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11333
51.2%
1 2164
 
9.8%
2 1620
 
7.3%
3 1076
 
4.9%
4 884
 
4.0%
, 873
 
3.9%
0 858
 
3.9%
5 707
 
3.2%
6 600
 
2.7%
- 579
 
2.6%
Other values (11) 1444
 
6.5%
Hangul
ValueCountFrequency (%)
2721
 
10.1%
2719
 
10.1%
2700
 
10.0%
2678
 
10.0%
2618
 
9.7%
1574
 
5.9%
1076
 
4.0%
1041
 
3.9%
707
 
2.6%
688
 
2.6%
Other values (256) 8344
31.1%

설립일자
Text

MISSING 

Distinct2042
Distinct (%)81.1%
Missing90
Missing (%)3.4%
Memory size20.5 KiB
2023-12-12T09:20:47.624500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters25190
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1659 ?
Unique (%)65.9%

Sample

1st row1991-08-01
2nd row1988-08-01
3rd row1995-07-01
4th row2018-08-16
5th row2015-11-16
ValueCountFrequency (%)
2012-03-02 6
 
0.2%
2009-07-01 5
 
0.2%
1996-10-01 4
 
0.2%
2014-11-26 4
 
0.2%
2013-08-09 4
 
0.2%
2017-06-08 4
 
0.2%
2017-04-25 4
 
0.2%
2017-02-21 4
 
0.2%
2005-09-01 4
 
0.2%
2014-05-08 4
 
0.2%
Other values (2032) 2476
98.3%
2023-12-12T09:20:48.088063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6265
24.9%
- 5038
20.0%
1 4202
16.7%
2 4051
16.1%
9 1260
 
5.0%
7 765
 
3.0%
8 754
 
3.0%
3 747
 
3.0%
4 740
 
2.9%
5 703
 
2.8%
Other values (2) 665
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20151
80.0%
Dash Punctuation 5038
 
20.0%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6265
31.1%
1 4202
20.9%
2 4051
20.1%
9 1260
 
6.3%
7 765
 
3.8%
8 754
 
3.7%
3 747
 
3.7%
4 740
 
3.7%
5 703
 
3.5%
6 664
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 5038
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25190
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6265
24.9%
- 5038
20.0%
1 4202
16.7%
2 4051
16.1%
9 1260
 
5.0%
7 765
 
3.0%
8 754
 
3.0%
3 747
 
3.0%
4 740
 
2.9%
5 703
 
2.8%
Other values (2) 665
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25190
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6265
24.9%
- 5038
20.0%
1 4202
16.7%
2 4051
16.1%
9 1260
 
5.0%
7 765
 
3.0%
8 754
 
3.0%
3 747
 
3.0%
4 740
 
2.9%
5 703
 
2.8%
Other values (2) 665
 
2.6%

본사여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
True
2522 
False
 
87
ValueCountFrequency (%)
True 2522
96.7%
False 87
 
3.3%
2023-12-12T09:20:48.209432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T09:20:40.219180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:20:39.994888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:20:40.324132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:20:40.098684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:20:48.266074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자번호기초지자체시군구행정동우편번호본사여부
사업자번호1.0000.1420.0990.0940.1160.036
기초지자체0.1421.0001.0000.9960.9700.429
시군구0.0991.0001.0001.0000.9460.330
행정동0.0940.9961.0001.0001.0000.416
우편번호0.1160.9700.9461.0001.0000.272
본사여부0.0360.4290.3300.4160.2721.000
2023-12-12T09:20:48.378801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기초지자체본사여부행정동시군구
기초지자체1.0000.3290.9391.000
본사여부0.3291.0000.3450.329
행정동0.9390.3451.0000.993
시군구1.0000.3290.9931.000
2023-12-12T09:20:48.489268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자번호우편번호기초지자체시군구행정동본사여부
사업자번호1.0000.0340.0440.0450.0330.027
우편번호0.0341.0000.9110.9120.9920.208
기초지자체0.0440.9111.0001.0000.9390.329
시군구0.0450.9121.0001.0000.9930.329
행정동0.0330.9920.9390.9931.0000.345
본사여부0.0270.2080.3290.3290.3451.000

Missing values

2023-12-12T09:20:40.496473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:20:40.743709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T09:20:40.968148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

관리번호업체명사업자번호대표자명전화번호팩스번호기초지자체시군구법정동행정동우편번호도로명주소설립일자본사여부
0REQ-003-01-00001(명)뉴삼우관광전세4168104909박금수061-742-5200061-742-6662순천시순천시봉림리별량면58033전남 순천시 별량면 녹색로 9641991-08-01Y
1REQ-003-01-00002(명)중앙택시4168102099이연심061-754-4500<NA>순천시순천시광천리주암면57910전남 순천시 주암면 동주로 20411988-08-01Y
2REQ-003-01-00003(명)한일전기4168112151유상준061-745-0909<NA>순천시순천시풍덕동풍덕동57960전남 순천시 팔마로 571995-07-01Y
3REQ-003-01-00004(유)119농산3228701157황아름061-723-1192<NA>순천시순천시남가리해룡면58005전남 순천시 해룡면 서가길 152018-08-16Y
4REQ-003-01-00005(유)1급조은정비공업사3488100328김무현061-723-5123<NA>순천시순천시조례동왕조1동57946전남 순천시 신월길 1032015-11-16Y
5REQ-003-01-00006(유)가나투어4168157051오성기061-745-2600061-741-1199순천시순천시연향동덕연동57979전남 순천시 연향번영2길 122007-07-24Y
6REQ-003-01-00007(유)가야산업개발4168190763김영헌061-725-2790061-746-0103순천시순천시연향동덕연동57987전남 순천시 하대석길 16, 2층2013-04-17Y
7REQ-003-01-00008(유)강남산업4168149473박종만061-752-8922061-724-6262순천시순천시궁각리주암면57910전남 순천시 주암면 주석로 290-552005-07-07Y
8REQ-003-01-00009(유)강남석재산업4168130218정영식061-742-3727061-724-6262순천시순천시모전리황전면57901전남 순천시 황전면 황전중앙길 2242000-12-26Y
9REQ-003-01-00010(유)강욱8698701205허세범<NA><NA>순천시순천시석현동삼산동57922전남 순천시 석현길 70, 101동 1103호2019-02-22Y
관리번호업체명사업자번호대표자명전화번호팩스번호기초지자체시군구법정동행정동우편번호도로명주소설립일자본사여부
2599REQ-003-01-02600호성산업(주)4168146573유종완061-753-0071061-751-0033전라남도순천시서면서면57927전남 순천시 서면 산단4길 562005-01-01Y
2600REQ-003-01-02601호텔라움(주)5558600877김진곤/이삼열061-921-1004<NA>전라남도순천시풍덕동풍덕동57993전남 순천시 하풍동길 1-12018-01-22Y
2601REQ-003-01-02602홍익건설(주)8328601619조현익061-727-0226<NA>전라남도순천시연향동덕연동57979전남 순천시 연향번영3길 8-15, 2층2020-02-24Y
2602REQ-003-01-02603화성전력(주)4168126903류옥숙061-745-3400061-742-3023전라남도순천시남정동남제동57958전남 순천시 남산4길 232000-03-02Y
2603REQ-003-01-02604화성정보통신(주)4168136885류옥숙061-745-3400061-742-3023전라남도순천시남정동남제동57958전남 순천시 남산4길 232002-06-12Y
2604REQ-003-01-02605화신상사4168127353강은심061-746-4567<NA>전라남도순천시조례동왕조동57970전남 순천시 왕궁길 60, 상가 108호2000-05-01Y
2605REQ-003-01-02606회명환경기술(주)4168177488최추열061-724-6776<NA>전라남도순천시해룡면해룡면58022전남 순천시 해룡면 해룡산단5로 652011-04-28Y
2606REQ-003-01-02607효향(주)2588700139정회남061-744-8855<NA>전라남도순천시별량면별량면58031전남 순천시 별량면 죽산길 182015-04-27Y
2607REQ-003-01-02608희망복지센터4168164168강관형061-741-8889<NA>전라남도순천시장천동장천동57956전남 순천시 중앙1길 8, 2층<NA>Y
2608REQ-003-01-02609힐링파머스(주)4358800015김은영061-755-9495061-755-8764전라남도순천시주암면주암면57910전남 순천시 주암면 주석로 276-112015-02-16Y