Overview

Dataset statistics

Number of variables6
Number of observations1829
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory87.6 KiB
Average record size in memory49.1 B

Variable types

Text3
Numeric1
Categorical1
DateTime1

Dataset

Description순천시 지역 창업 생태계 데이터를 구축하여 지역기반 스타트업을 활성화하고자 순천시 기업 고용 관련 데이터를 제공합니다.
Author전라남도 순천시
URLhttps://www.data.go.kr/data/15111423/fileData.do

Alerts

관리번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:31:14.752691
Analysis finished2023-12-12 09:31:15.694780
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리번호
Text

UNIQUE 

Distinct1829
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2023-12-12T18:31:15.935529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length16
Min length16

Characters and Unicode

Total characters29264
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1829 ?
Unique (%)100.0%

Sample

1st rowREQ-003-03-00001
2nd rowREQ-003-03-00002
3rd rowREQ-003-03-00003
4th rowREQ-003-03-00004
5th rowREQ-003-03-00005
ValueCountFrequency (%)
req-003-03-00001 1
 
0.1%
req-003-03-01229 1
 
0.1%
req-003-03-01227 1
 
0.1%
req-003-03-01226 1
 
0.1%
req-003-03-01225 1
 
0.1%
req-003-03-01224 1
 
0.1%
req-003-03-01223 1
 
0.1%
req-003-03-01222 1
 
0.1%
req-003-03-01221 1
 
0.1%
req-003-03-01220 1
 
0.1%
Other values (1819) 1819
99.5%
2023-12-12T18:31:16.387053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8885
30.4%
- 5487
18.8%
3 4221
14.4%
R 1829
 
6.2%
E 1829
 
6.2%
Q 1829
 
6.2%
1 1403
 
4.8%
2 573
 
2.0%
4 563
 
1.9%
5 563
 
1.9%
Other values (4) 2082
 
7.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18290
62.5%
Dash Punctuation 5487
 
18.8%
Uppercase Letter 5487
 
18.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8885
48.6%
3 4221
23.1%
1 1403
 
7.7%
2 573
 
3.1%
4 563
 
3.1%
5 563
 
3.1%
6 563
 
3.1%
7 563
 
3.1%
8 493
 
2.7%
9 463
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
R 1829
33.3%
E 1829
33.3%
Q 1829
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 5487
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 23777
81.2%
Latin 5487
 
18.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8885
37.4%
- 5487
23.1%
3 4221
17.8%
1 1403
 
5.9%
2 573
 
2.4%
4 563
 
2.4%
5 563
 
2.4%
6 563
 
2.4%
7 563
 
2.4%
8 493
 
2.1%
Latin
ValueCountFrequency (%)
R 1829
33.3%
E 1829
33.3%
Q 1829
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29264
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8885
30.4%
- 5487
18.8%
3 4221
14.4%
R 1829
 
6.2%
E 1829
 
6.2%
Q 1829
 
6.2%
1 1403
 
4.8%
2 573
 
2.0%
4 563
 
1.9%
5 563
 
1.9%
Other values (4) 2082
 
7.1%
Distinct1818
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2023-12-12T18:31:16.674095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length7.9442318
Min length2

Characters and Unicode

Total characters14530
Distinct characters498
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1813 ?
Unique (%)99.1%

Sample

1st row(명)뉴삼우관광전세
2nd row(명)중앙택시
3rd row(유)119농산
4th row(유)1급조은정비공업사
5th row(유)가나투어
ValueCountFrequency (%)
농협은행(주 6
 
0.3%
주)광주은행 4
 
0.2%
주)대경 2
 
0.1%
농업회사법인 2
 
0.1%
대성시스템(주 2
 
0.1%
킴스체인 2
 
0.1%
주)해광 2
 
0.1%
주)한필 1
 
0.1%
주)한창기업 1
 
0.1%
주)한중 1
 
0.1%
Other values (1815) 1815
98.7%
2023-12-12T18:31:17.138568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 1764
 
12.1%
) 1764
 
12.1%
1581
 
10.9%
338
 
2.3%
271
 
1.9%
251
 
1.7%
240
 
1.7%
216
 
1.5%
210
 
1.4%
205
 
1.4%
Other values (488) 7690
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10958
75.4%
Open Punctuation 1764
 
12.1%
Close Punctuation 1764
 
12.1%
Space Separator 23
 
0.2%
Decimal Number 13
 
0.1%
Other Punctuation 5
 
< 0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1581
 
14.4%
338
 
3.1%
271
 
2.5%
251
 
2.3%
240
 
2.2%
216
 
2.0%
210
 
1.9%
205
 
1.9%
191
 
1.7%
159
 
1.5%
Other values (474) 7296
66.6%
Decimal Number
ValueCountFrequency (%)
1 7
53.8%
5 1
 
7.7%
0 1
 
7.7%
9 1
 
7.7%
8 1
 
7.7%
6 1
 
7.7%
2 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
D 1
33.3%
S 1
33.3%
R 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 1764
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1764
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Other Punctuation
ValueCountFrequency (%)
. 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10958
75.4%
Common 3569
 
24.6%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1581
 
14.4%
338
 
3.1%
271
 
2.5%
251
 
2.3%
240
 
2.2%
216
 
2.0%
210
 
1.9%
205
 
1.9%
191
 
1.7%
159
 
1.5%
Other values (474) 7296
66.6%
Common
ValueCountFrequency (%)
( 1764
49.4%
) 1764
49.4%
23
 
0.6%
1 7
 
0.2%
. 5
 
0.1%
5 1
 
< 0.1%
0 1
 
< 0.1%
9 1
 
< 0.1%
8 1
 
< 0.1%
6 1
 
< 0.1%
Latin
ValueCountFrequency (%)
D 1
33.3%
S 1
33.3%
R 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10958
75.4%
ASCII 3572
 
24.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 1764
49.4%
) 1764
49.4%
23
 
0.6%
1 7
 
0.2%
. 5
 
0.1%
D 1
 
< 0.1%
S 1
 
< 0.1%
R 1
 
< 0.1%
5 1
 
< 0.1%
0 1
 
< 0.1%
Other values (4) 4
 
0.1%
Hangul
ValueCountFrequency (%)
1581
 
14.4%
338
 
3.1%
271
 
2.5%
251
 
2.3%
240
 
2.2%
216
 
2.0%
210
 
1.9%
205
 
1.9%
191
 
1.7%
159
 
1.5%
Other values (474) 7296
66.6%

사업자번호
Real number (ℝ)

Distinct1823
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4168669 × 109
Minimum1.0186694 × 109
Maximum8.9987014 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.2 KiB
2023-12-12T18:31:17.321313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0186694 × 109
5-th percentile1.9303405 × 109
Q14.1681081 × 109
median4.1681593 × 109
Q34.1686012 × 109
95-th percentile7.8546412 × 109
Maximum8.9987014 × 109
Range7.980032 × 109
Interquartile range (IQR)493104

Descriptive statistics

Standard deviation1.4643506 × 109
Coefficient of variation (CV)0.33153604
Kurtosis1.9682164
Mean4.4168669 × 109
Median Absolute Deviation (MAD)360162
Skewness0.8887099
Sum8.0784495 × 1012
Variance2.1443226 × 1018
MonotonicityNot monotonic
2023-12-12T18:31:17.539474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4168100937 2
 
0.1%
4098187824 2
 
0.1%
4168174332 2
 
0.1%
8078700630 2
 
0.1%
4388100878 2
 
0.1%
8348702120 2
 
0.1%
4168154753 1
 
0.1%
6138165768 1
 
0.1%
8868601426 1
 
0.1%
5048701822 1
 
0.1%
Other values (1813) 1813
99.1%
ValueCountFrequency (%)
1018669422 1
0.1%
1028132035 1
0.1%
1048700604 1
0.1%
1058190048 1
0.1%
1058610001 1
0.1%
1078135022 1
0.1%
1078160405 1
0.1%
1098655617 1
0.1%
1118131705 1
0.1%
1118133697 1
0.1%
ValueCountFrequency (%)
8998701394 1
0.1%
8958802122 1
0.1%
8958700146 1
0.1%
8948701180 1
0.1%
8948501129 1
0.1%
8928800334 1
0.1%
8908602443 1
0.1%
8898800134 1
0.1%
8898701266 1
0.1%
8888801009 1
0.1%
Distinct1691
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2023-12-12T18:31:18.221431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length3
Mean length3.1930016
Min length2

Characters and Unicode

Total characters5840
Distinct characters247
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1569 ?
Unique (%)85.8%

Sample

1st row박금수
2nd row이연심
3rd row황아름
4th row김무현
5th row오성기
ValueCountFrequency (%)
권준학 6
 
0.3%
송종욱 4
 
0.2%
정대식 3
 
0.2%
박화수 3
 
0.2%
김정숙 3
 
0.2%
임은미 3
 
0.2%
김영미 3
 
0.2%
김완우 3
 
0.2%
김태균 3
 
0.2%
김도일 3
 
0.2%
Other values (1681) 1795
98.1%
2023-12-12T18:31:18.870882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
429
 
7.3%
249
 
4.3%
238
 
4.1%
173
 
3.0%
134
 
2.3%
116
 
2.0%
107
 
1.8%
87
 
1.5%
85
 
1.5%
84
 
1.4%
Other values (237) 4138
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5710
97.8%
Other Punctuation 67
 
1.1%
Space Separator 63
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
429
 
7.5%
249
 
4.4%
238
 
4.2%
173
 
3.0%
134
 
2.3%
116
 
2.0%
107
 
1.9%
87
 
1.5%
85
 
1.5%
84
 
1.5%
Other values (235) 4008
70.2%
Other Punctuation
ValueCountFrequency (%)
/ 67
100.0%
Space Separator
ValueCountFrequency (%)
63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5710
97.8%
Common 130
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
429
 
7.5%
249
 
4.4%
238
 
4.2%
173
 
3.0%
134
 
2.3%
116
 
2.0%
107
 
1.9%
87
 
1.5%
85
 
1.5%
84
 
1.5%
Other values (235) 4008
70.2%
Common
ValueCountFrequency (%)
/ 67
51.5%
63
48.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5710
97.8%
ASCII 130
 
2.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
429
 
7.5%
249
 
4.4%
238
 
4.2%
173
 
3.0%
134
 
2.3%
116
 
2.0%
107
 
1.9%
87
 
1.5%
85
 
1.5%
84
 
1.5%
Other values (235) 4008
70.2%
ASCII
ValueCountFrequency (%)
/ 67
51.5%
63
48.5%

종업원수
Categorical

Distinct6
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
5명 이하
978 
15명 이하
524 
16명 이상
276 
10명 이하
 
46
15명이하
 
3

Length

Max length6
Median length5
Mean length5.4636413
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row16명 이상
2nd row15명 이하
3rd row15명 이하
4th row15명 이하
5th row16명 이상

Common Values

ValueCountFrequency (%)
5명 이하 978
53.5%
15명 이하 524
28.6%
16명 이상 276
 
15.1%
10명 이하 46
 
2.5%
15명이하 3
 
0.2%
16명 이하 2
 
0.1%

Length

2023-12-12T18:31:19.088588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:31:19.286035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
이하 1550
42.4%
5명 978
26.8%
15명 524
 
14.3%
16명 278
 
7.6%
이상 276
 
7.6%
10명 46
 
1.3%
15명이하 3
 
0.1%
Distinct456
Distinct (%)24.9%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
Minimum2000-10-01 00:00:00
Maximum2022-08-26 00:00:00
2023-12-12T18:31:19.553547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:31:19.820643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T18:31:15.293314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:31:19.965361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자번호종업원수
사업자번호1.0000.075
종업원수0.0751.000
2023-12-12T18:31:20.126685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자번호종업원수
사업자번호1.0000.039
종업원수0.0391.000

Missing values

2023-12-12T18:31:15.509206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:31:15.638779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리번호업체명사업자번호대표자명종업원수종업원수기준일자
0REQ-003-03-00001(명)뉴삼우관광전세4168104909박금수16명 이상2014-12-31
1REQ-003-03-00002(명)중앙택시4168102099이연심15명 이하2021-09-06
2REQ-003-03-00003(유)119농산3228701157황아름15명 이하2021-05-04
3REQ-003-03-00004(유)1급조은정비공업사3488100328김무현15명 이하2021-05-10
4REQ-003-03-00005(유)가나투어4168157051오성기16명 이상2021-08-06
5REQ-003-03-00006(유)가야산업개발4168190763김영헌5명 이하2022-06-08
6REQ-003-03-00007(유)강남산업4168149473박종만5명 이하2011-12-01
7REQ-003-03-00008(유)강남석재산업4168130218정영식5명 이하2021-04-19
8REQ-003-03-00009(유)건원건설4028133647이병조15명 이하2022-05-16
9REQ-003-03-00010(유)경강5408701100윤희정5명 이하2022-04-11
관리번호업체명사업자번호대표자명종업원수종업원수기준일자
1819REQ-003-03-01820호남남서부김활성처리제사업협동조합8188100147김병석5명 이하2022-06-28
1820REQ-003-03-01821호남산업(주)4168115217김상헌16명 이상2021-12-13
1821REQ-003-03-01822호남연마(주)4168119800이수형5명 이하2016-01-20
1822REQ-003-03-01823호남전기안전(주)4168135867임동석15명 이하2022-05-23
1823REQ-003-03-01824호성산업(주)4168146573유종완15명 이하2017-12-31
1824REQ-003-03-01825호텔라움(주)5558600877김진곤/이삼열15명 이하2022-05-20
1825REQ-003-03-01826화성전력(주)4168126903류옥숙15명 이하2000-12-31
1826REQ-003-03-01827화성정보통신(주)4168136885류옥숙5명 이하2018-12-31
1827REQ-003-03-01828회명환경기술(주)4168177488최추열5명 이하2022-08-08
1828REQ-003-03-01829힐링파머스(주)4358800015김은영15명 이하2022-05-11