Overview

Dataset statistics

Number of variables8
Number of observations237
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.4 KiB
Average record size in memory66.6 B

Variable types

Numeric1
Categorical4
DateTime1
Text2

Dataset

Description장애인고용을 위하여 고용컨설팅 서비스를 신청한 사업체에 대한 제공정보(순번, 사업장명, 접수번호, 접수일자 등)
URLhttps://www.data.go.kr/data/15014777/fileData.do

Alerts

접수번호 has constant value ""Constant
관할지역본부 및 지사 is highly overall correlated with 관할지역본부 및 지사 대표번호High correlation
관할지역본부 및 지사 대표번호 is highly overall correlated with 관할지역본부 및 지사High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:17:58.091025
Analysis finished2023-12-12 06:17:58.922307
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct237
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean119
Minimum1
Maximum237
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T15:17:58.994706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.8
Q160
median119
Q3178
95-th percentile225.2
Maximum237
Range236
Interquartile range (IQR)118

Descriptive statistics

Standard deviation68.560193
Coefficient of variation (CV)0.57613607
Kurtosis-1.2
Mean119
Median Absolute Deviation (MAD)59
Skewness0
Sum28203
Variance4700.5
MonotonicityStrictly increasing
2023-12-12T15:17:59.147057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
164 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
155 1
 
0.4%
156 1
 
0.4%
157 1
 
0.4%
158 1
 
0.4%
159 1
 
0.4%
Other values (227) 227
95.8%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
237 1
0.4%
236 1
0.4%
235 1
0.4%
234 1
0.4%
233 1
0.4%
232 1
0.4%
231 1
0.4%
230 1
0.4%
229 1
0.4%
228 1
0.4%

접수번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
202200000000
237 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202200000000
2nd row202200000000
3rd row202200000000
4th row202200000000
5th row202200000000

Common Values

ValueCountFrequency (%)
202200000000 237
100.0%

Length

2023-12-12T15:17:59.318760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:17:59.423792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202200000000 237
100.0%
Distinct61
Distinct (%)25.7%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2022-08-17 00:00:00
Maximum2022-11-18 00:00:00
2023-12-12T15:17:59.556654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:17:59.740330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct235
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T15:18:00.024145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length8.5738397
Min length3

Characters and Unicode

Total characters2032
Distinct characters307
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique233 ?
Unique (%)98.3%

Sample

1st row주식회사 케이씨씨글라스
2nd row이앤밸류(주)
3rd row(재)충청남도청소년진흥원
4th row조광페인트
5th row장금상선(주)
ValueCountFrequency (%)
주식회사 14
 
5.0%
재단법인 4
 
1.4%
산학협력단 4
 
1.4%
코리아 3
 
1.1%
의료법인 2
 
0.7%
유한회사 2
 
0.7%
2
 
0.7%
영진약품(주 2
 
0.7%
학교법인 2
 
0.7%
국방부 2
 
0.7%
Other values (241) 241
86.7%
2023-12-12T15:18:00.466556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
137
 
6.7%
( 116
 
5.7%
) 116
 
5.7%
51
 
2.5%
44
 
2.2%
42
 
2.1%
42
 
2.1%
35
 
1.7%
33
 
1.6%
29
 
1.4%
Other values (297) 1387
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1730
85.1%
Open Punctuation 116
 
5.7%
Close Punctuation 116
 
5.7%
Space Separator 42
 
2.1%
Uppercase Letter 28
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
7.9%
51
 
2.9%
44
 
2.5%
42
 
2.4%
35
 
2.0%
33
 
1.9%
29
 
1.7%
28
 
1.6%
28
 
1.6%
27
 
1.6%
Other values (280) 1276
73.8%
Uppercase Letter
ValueCountFrequency (%)
C 6
21.4%
E 3
10.7%
D 3
10.7%
A 2
 
7.1%
H 2
 
7.1%
G 2
 
7.1%
P 2
 
7.1%
B 2
 
7.1%
S 1
 
3.6%
J 1
 
3.6%
Other values (4) 4
14.3%
Open Punctuation
ValueCountFrequency (%)
( 116
100.0%
Close Punctuation
ValueCountFrequency (%)
) 116
100.0%
Space Separator
ValueCountFrequency (%)
42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1730
85.1%
Common 274
 
13.5%
Latin 28
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
7.9%
51
 
2.9%
44
 
2.5%
42
 
2.4%
35
 
2.0%
33
 
1.9%
29
 
1.7%
28
 
1.6%
28
 
1.6%
27
 
1.6%
Other values (280) 1276
73.8%
Latin
ValueCountFrequency (%)
C 6
21.4%
E 3
10.7%
D 3
10.7%
A 2
 
7.1%
H 2
 
7.1%
G 2
 
7.1%
P 2
 
7.1%
B 2
 
7.1%
S 1
 
3.6%
J 1
 
3.6%
Other values (4) 4
14.3%
Common
ValueCountFrequency (%)
( 116
42.3%
) 116
42.3%
42
 
15.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1730
85.1%
ASCII 302
 
14.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
137
 
7.9%
51
 
2.9%
44
 
2.5%
42
 
2.4%
35
 
2.0%
33
 
1.9%
29
 
1.7%
28
 
1.6%
28
 
1.6%
27
 
1.6%
Other values (280) 1276
73.8%
ASCII
ValueCountFrequency (%)
( 116
38.4%
) 116
38.4%
42
 
13.9%
C 6
 
2.0%
E 3
 
1.0%
D 3
 
1.0%
A 2
 
0.7%
H 2
 
0.7%
G 2
 
0.7%
P 2
 
0.7%
Other values (7) 8
 
2.6%
Distinct93
Distinct (%)39.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T15:18:00.815352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.9535865
Min length5

Characters and Unicode

Total characters2122
Distinct characters95
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)24.5%

Sample

1st row서울특별시 서초구
2nd row서울특별시 강남구
3rd row충청남도 천안시 서북구
4th row부산광역시 사상구
5th row서울특별시 중구
ValueCountFrequency (%)
서울특별시 122
24.1%
강남구 33
 
6.5%
경기도 31
 
6.1%
중구 20
 
4.0%
경상북도 20
 
4.0%
서초구 14
 
2.8%
성남시 14
 
2.8%
분당구 12
 
2.4%
종로구 12
 
2.4%
강원도 11
 
2.2%
Other values (101) 217
42.9%
2023-12-12T15:18:01.280978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
269
 
12.7%
221
 
10.4%
185
 
8.7%
145
 
6.8%
126
 
5.9%
126
 
5.9%
124
 
5.8%
93
 
4.4%
82
 
3.9%
60
 
2.8%
Other values (85) 691
32.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1853
87.3%
Space Separator 269
 
12.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
221
 
11.9%
185
 
10.0%
145
 
7.8%
126
 
6.8%
126
 
6.8%
124
 
6.7%
93
 
5.0%
82
 
4.4%
60
 
3.2%
50
 
2.7%
Other values (84) 641
34.6%
Space Separator
ValueCountFrequency (%)
269
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1853
87.3%
Common 269
 
12.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
221
 
11.9%
185
 
10.0%
145
 
7.8%
126
 
6.8%
126
 
6.8%
124
 
6.7%
93
 
5.0%
82
 
4.4%
60
 
3.2%
50
 
2.7%
Other values (84) 641
34.6%
Common
ValueCountFrequency (%)
269
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1853
87.3%
ASCII 269
 
12.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
269
100.0%
Hangul
ValueCountFrequency (%)
221
 
11.9%
185
 
10.0%
145
 
7.8%
126
 
6.8%
126
 
6.8%
124
 
6.7%
93
 
5.0%
82
 
4.4%
60
 
3.2%
50
 
2.7%
Other values (84) 641
34.6%

업체구분
Categorical

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
민간기업
174 
국가 및 지방자치단체
33 
공공기관
30 

Length

Max length11
Median length4
Mean length4.9746835
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row민간기업
2nd row민간기업
3rd row공공기관
4th row민간기업
5th row민간기업

Common Values

ValueCountFrequency (%)
민간기업 174
73.4%
국가 및 지방자치단체 33
 
13.9%
공공기관 30
 
12.7%

Length

2023-12-12T15:18:01.445679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:18:01.565332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간기업 174
57.4%
국가 33
 
10.9%
33
 
10.9%
지방자치단체 33
 
10.9%
공공기관 30
 
9.9%

관할지역본부 및 지사
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
서울동부지사
54 
서울지역본부
47 
경북지사
18 
서울남부지사
17 
경기동부지사
17 
Other values (17)
84 

Length

Max length6
Median length6
Mean length5.4092827
Min length2

Unique

Unique2 ?
Unique (%)0.8%

Sample

1st row서울동부지사
2nd row서울동부지사
3rd row충남지사
4th row부산지역본부
5th row서울지역본부

Common Values

ValueCountFrequency (%)
서울동부지사 54
22.8%
서울지역본부 47
19.8%
경북지사 18
 
7.6%
서울남부지사 17
 
7.2%
경기동부지사 17
 
7.2%
충남지사 11
 
4.6%
강원지사 10
 
4.2%
부산지역본부 9
 
3.8%
경남지사 8
 
3.4%
대구지역본부 6
 
2.5%
Other values (12) 40
16.9%

Length

2023-12-12T15:18:01.727284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울동부지사 54
22.8%
서울지역본부 47
19.8%
경북지사 18
 
7.6%
서울남부지사 17
 
7.2%
경기동부지사 17
 
7.2%
충남지사 11
 
4.6%
강원지사 10
 
4.2%
부산지역본부 9
 
3.8%
경남지사 8
 
3.4%
인천지사 6
 
2.5%
Other values (12) 40
16.9%

관할지역본부 및 지사 대표번호
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
02-2146-3500
54 
02-6320-7000
47 
054-450-3000
18 
02-6004-1005
17 
031-600-0209
17 
Other values (17)
84 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique2 ?
Unique (%)0.8%

Sample

1st row02-2146-3500
2nd row02-2146-3500
3rd row041-629-6000
4th row051-640-9800
5th row02-6320-7000

Common Values

ValueCountFrequency (%)
02-2146-3500 54
22.8%
02-6320-7000 47
19.8%
054-450-3000 18
 
7.6%
02-6004-1005 17
 
7.2%
031-600-0209 17
 
7.2%
041-629-6000 11
 
4.6%
033-737-6620 10
 
4.2%
051-640-9800 9
 
3.8%
055-225-8006 8
 
3.4%
053-288-1500 6
 
2.5%
Other values (12) 40
16.9%

Length

2023-12-12T15:18:01.892091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
02-2146-3500 54
22.8%
02-6320-7000 47
19.8%
054-450-3000 18
 
7.6%
02-6004-1005 17
 
7.2%
031-600-0209 17
 
7.2%
041-629-6000 11
 
4.6%
033-737-6620 10
 
4.2%
051-640-9800 9
 
3.8%
055-225-8006 8
 
3.4%
032-242-1004 6
 
2.5%
Other values (12) 40
16.9%

Interactions

2023-12-12T15:17:58.580556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:18:02.267683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번접수일자소재지업체구분관할지역본부 및 지사관할지역본부 및 지사 대표번호
순번1.0000.9920.6730.2960.5970.597
접수일자0.9921.0000.0000.2840.8160.816
소재지0.6730.0001.0000.8960.9990.999
업체구분0.2960.2840.8961.0000.6510.651
관할지역본부 및 지사0.5970.8160.9990.6511.0001.000
관할지역본부 및 지사 대표번호0.5970.8160.9990.6511.0001.000
2023-12-12T15:18:02.374136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관할지역본부 및 지사업체구분관할지역본부 및 지사 대표번호
관할지역본부 및 지사1.0000.4201.000
업체구분0.4201.0000.420
관할지역본부 및 지사 대표번호1.0000.4201.000
2023-12-12T15:18:02.490623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번업체구분관할지역본부 및 지사관할지역본부 및 지사 대표번호
순번1.0000.1840.2620.262
업체구분0.1841.0000.4200.420
관할지역본부 및 지사0.2620.4201.0001.000
관할지역본부 및 지사 대표번호0.2620.4201.0001.000

Missing values

2023-12-12T15:17:58.738898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:17:58.871336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번접수번호접수일자사업체명소재지업체구분관할지역본부 및 지사관할지역본부 및 지사 대표번호
012022000000002022-11-18주식회사 케이씨씨글라스서울특별시 서초구민간기업서울동부지사02-2146-3500
122022000000002022-11-18이앤밸류(주)서울특별시 강남구민간기업서울동부지사02-2146-3500
232022000000002022-11-17(재)충청남도청소년진흥원충청남도 천안시 서북구공공기관충남지사041-629-6000
342022000000002022-11-17조광페인트부산광역시 사상구민간기업부산지역본부051-640-9800
452022000000002022-11-16장금상선(주)서울특별시 중구민간기업서울지역본부02-6320-7000
562022000000002022-11-15한국자산관리공사부산광역시 남구공공기관부산지역본부051-640-9800
672022000000002022-11-11신한카드(주)서울특별시 중구민간기업서울지역본부02-6320-7000
782022000000002022-11-11HD한국조선해양(주)경기도 성남시 분당구민간기업서울지역본부02-6320-7000
892022000000002022-11-11(주)에이치에스애드서울특별시 마포구민간기업서울지역본부02-6320-7000
9102022000000002022-11-11에이치디씨현대산업개발(주)서울특별시 용산구민간기업서울지역본부02-6320-7000
순번접수번호접수일자사업체명소재지업체구분관할지역본부 및 지사관할지역본부 및 지사 대표번호
2272282022000000002022-08-30대구광역시교육청대구광역시 수성구국가 및 지방자치단체대구지역본부053-288-1500
2282292022000000002022-08-30(주)신성엔지니어링경상북도 청도군민간기업대구지역본부053-288-1500
2292302022000000002022-08-26콘티넨탈 오토모티브 코리아경기도 성남시 분당구민간기업경기동부지사031-600-0209
2302312022000000002022-08-24(주)컴투스서울특별시 금천구민간기업서울남부지사02-6004-1005
2312322022000000002022-08-24피앤비솔루션(주)부산광역시 해운대구민간기업부산지역본부051-640-9800
2322332022000000002022-08-23(주)삼화양행서울특별시 금천구민간기업서울남부지사02-6004-1005
2332342022000000002022-08-22진우산전(주)서울특별시 강남구민간기업서울동부지사02-2146-3500
2342352022000000002022-08-17사회복지법인 어린이재단서울특별시 중구민간기업서울지역본부02-6320-7000
2352362022000000002022-08-17동우건설 주식회사부산광역시 북구민간기업부산지역본부051-640-9800
2362372022000000002022-08-17국회사무처서울특별시 영등포구국가 및 지방자치단체서울남부지사02-6004-1005