Overview

Dataset statistics

Number of variables6
Number of observations36
Missing cells5
Missing cells (%)2.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory52.7 B

Variable types

Numeric1
Text4
Categorical1

Dataset

Description2014년에 조사한 충청북도 청주시 청원구 관내 소재 직업소개소 현황(36개소)에 대한 데이터입니다. 소재지, 대표자명, 전화번호 등이 기재되어 있습니다.
Author충청북도 청주시
URLhttps://www.data.go.kr/data/15051170/fileData.do

Alerts

순번 is highly overall correlated with 유무료High correlation
유무료 is highly overall correlated with 순번High correlation
유무료 is highly imbalanced (58.6%)Imbalance
전화번호 has 5 (13.9%) missing valuesMissing
순번 has unique valuesUnique
직업소개소 명칭 has unique valuesUnique
소재지 has unique valuesUnique
대표자 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:14:40.476541
Analysis finished2023-12-12 21:14:41.101817
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.5
Minimum1
Maximum36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-13T06:14:41.174962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.75
Q19.75
median18.5
Q327.25
95-th percentile34.25
Maximum36
Range35
Interquartile range (IQR)17.5

Descriptive statistics

Standard deviation10.535654
Coefficient of variation (CV)0.5694948
Kurtosis-1.2
Mean18.5
Median Absolute Deviation (MAD)9
Skewness0
Sum666
Variance111
MonotonicityStrictly increasing
2023-12-13T06:14:41.350336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
1 1
 
2.8%
20 1
 
2.8%
22 1
 
2.8%
23 1
 
2.8%
24 1
 
2.8%
25 1
 
2.8%
26 1
 
2.8%
27 1
 
2.8%
28 1
 
2.8%
29 1
 
2.8%
Other values (26) 26
72.2%
ValueCountFrequency (%)
1 1
2.8%
2 1
2.8%
3 1
2.8%
4 1
2.8%
5 1
2.8%
6 1
2.8%
7 1
2.8%
8 1
2.8%
9 1
2.8%
10 1
2.8%
ValueCountFrequency (%)
36 1
2.8%
35 1
2.8%
34 1
2.8%
33 1
2.8%
32 1
2.8%
31 1
2.8%
30 1
2.8%
29 1
2.8%
28 1
2.8%
27 1
2.8%
Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-13T06:14:41.899026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length9.2777778
Min length5

Characters and Unicode

Total characters334
Distinct characters105
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row충북청원지역자활센터
2nd row충북여성새로일하기본부
3rd row사단법인 중소기업기술혁신협회 충북지회
4th row대원인력개발직업소개소
5th row대율직업소개소
ValueCountFrequency (%)
충북청원지역자활센터 1
 
2.6%
한국에코로s직업소개소 1
 
2.6%
한국건설인력공사직업소개소 1
 
2.6%
청주산업인력공사직업소개소 1
 
2.6%
협성인력직업소개소 1
 
2.6%
믿음인력직업소개소 1
 
2.6%
목수직업소개소 1
 
2.6%
진성인력직업소개소 1
 
2.6%
ak리크루팅직업소개소 1
 
2.6%
미소직업인력소개소 1
 
2.6%
Other values (28) 28
73.7%
2023-12-13T06:14:42.279441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
16.2%
31
 
9.3%
30
 
9.0%
26
 
7.8%
19
 
5.7%
18
 
5.4%
5
 
1.5%
4
 
1.2%
4
 
1.2%
4
 
1.2%
Other values (95) 139
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 326
97.6%
Uppercase Letter 3
 
0.9%
Space Separator 2
 
0.6%
Other Symbol 1
 
0.3%
Open Punctuation 1
 
0.3%
Close Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
16.6%
31
 
9.5%
30
 
9.2%
26
 
8.0%
19
 
5.8%
18
 
5.5%
5
 
1.5%
4
 
1.2%
4
 
1.2%
4
 
1.2%
Other values (88) 131
40.2%
Uppercase Letter
ValueCountFrequency (%)
K 1
33.3%
A 1
33.3%
S 1
33.3%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 327
97.9%
Common 4
 
1.2%
Latin 3
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
16.5%
31
 
9.5%
30
 
9.2%
26
 
8.0%
19
 
5.8%
18
 
5.5%
5
 
1.5%
4
 
1.2%
4
 
1.2%
4
 
1.2%
Other values (89) 132
40.4%
Common
ValueCountFrequency (%)
2
50.0%
( 1
25.0%
) 1
25.0%
Latin
ValueCountFrequency (%)
K 1
33.3%
A 1
33.3%
S 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 326
97.6%
ASCII 7
 
2.1%
None 1
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
16.6%
31
 
9.5%
30
 
9.2%
26
 
8.0%
19
 
5.8%
18
 
5.5%
5
 
1.5%
4
 
1.2%
4
 
1.2%
4
 
1.2%
Other values (88) 131
40.2%
ASCII
ValueCountFrequency (%)
2
28.6%
K 1
14.3%
A 1
14.3%
S 1
14.3%
( 1
14.3%
) 1
14.3%
None
ValueCountFrequency (%)
1
100.0%

소재지
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-13T06:14:42.478273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length14.138889
Min length12

Characters and Unicode

Total characters509
Distinct characters49
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row청원군 오창읍 양청리799-6
2nd row청원군 오창읍 각리 641-1
3rd row청원군 오창읍 양청리 803-1
4th row청원구 내덕동 175-1
5th row청원구 내덕동 173-80
ValueCountFrequency (%)
청원구 28
24.6%
우암동 11
 
9.6%
내덕동 11
 
9.6%
청원군 8
 
7.0%
오창읍 6
 
5.3%
율량동 3
 
2.6%
양청리 2
 
1.8%
남일면 1
 
0.9%
308-1 1
 
0.9%
123-35 1
 
0.9%
Other values (42) 42
36.8%
2023-12-13T06:14:42.800275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
15.5%
39
 
7.7%
36
 
7.1%
1 34
 
6.7%
- 32
 
6.3%
28
 
5.5%
26
 
5.1%
2 21
 
4.1%
3 18
 
3.5%
9 14
 
2.8%
Other values (39) 182
35.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 243
47.7%
Decimal Number 155
30.5%
Space Separator 79
 
15.5%
Dash Punctuation 32
 
6.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
16.0%
36
14.8%
28
11.5%
26
10.7%
13
 
5.3%
12
 
4.9%
11
 
4.5%
11
 
4.5%
8
 
3.3%
7
 
2.9%
Other values (27) 52
21.4%
Decimal Number
ValueCountFrequency (%)
1 34
21.9%
2 21
13.5%
3 18
11.6%
9 14
9.0%
4 14
9.0%
7 13
 
8.4%
0 11
 
7.1%
5 11
 
7.1%
8 10
 
6.5%
6 9
 
5.8%
Space Separator
ValueCountFrequency (%)
79
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 266
52.3%
Hangul 243
47.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
16.0%
36
14.8%
28
11.5%
26
10.7%
13
 
5.3%
12
 
4.9%
11
 
4.5%
11
 
4.5%
8
 
3.3%
7
 
2.9%
Other values (27) 52
21.4%
Common
ValueCountFrequency (%)
79
29.7%
1 34
12.8%
- 32
12.0%
2 21
 
7.9%
3 18
 
6.8%
9 14
 
5.3%
4 14
 
5.3%
7 13
 
4.9%
0 11
 
4.1%
5 11
 
4.1%
Other values (2) 19
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 266
52.3%
Hangul 243
47.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
79
29.7%
1 34
12.8%
- 32
12.0%
2 21
 
7.9%
3 18
 
6.8%
9 14
 
5.3%
4 14
 
5.3%
7 13
 
4.9%
0 11
 
4.1%
5 11
 
4.1%
Other values (2) 19
 
7.1%
Hangul
ValueCountFrequency (%)
39
16.0%
36
14.8%
28
11.5%
26
10.7%
13
 
5.3%
12
 
4.9%
11
 
4.5%
11
 
4.5%
8
 
3.3%
7
 
2.9%
Other values (27) 52
21.4%

대표자
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-13T06:14:43.024942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.0277778
Min length3

Characters and Unicode

Total characters109
Distinct characters66
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row김경호
2nd row오경숙
3rd row장현봉
4th row김대관
5th row장영부
ValueCountFrequency (%)
김경호 1
 
2.7%
윤돈영 1
 
2.7%
장택수 1
 
2.7%
정진석 1
 
2.7%
김노회 1
 
2.7%
성낙화 1
 
2.7%
강병희 1
 
2.7%
김영용 1
 
2.7%
정지천 1
 
2.7%
권영배 1
 
2.7%
Other values (27) 27
73.0%
2023-12-13T06:14:43.377159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
9.2%
5
 
4.6%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (56) 69
63.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 107
98.2%
Space Separator 2
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
9.3%
5
 
4.7%
5
 
4.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (55) 67
62.6%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 107
98.2%
Common 2
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
9.3%
5
 
4.7%
5
 
4.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (55) 67
62.6%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 107
98.2%
ASCII 2
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
9.3%
5
 
4.7%
5
 
4.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (55) 67
62.6%
ASCII
ValueCountFrequency (%)
2
100.0%

전화번호
Text

MISSING 

Distinct31
Distinct (%)100.0%
Missing5
Missing (%)13.9%
Memory size420.0 B
2023-12-13T06:14:43.612805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters248
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row269-5720
2nd row217-9195
3rd row288-1432
4th row257-1226
5th row255-6353
ValueCountFrequency (%)
269-5720 1
 
3.2%
253-3637 1
 
3.2%
217-9195 1
 
3.2%
288-0491 1
 
3.2%
213-1982 1
 
3.2%
217-5916 1
 
3.2%
231-6880 1
 
3.2%
215-1604 1
 
3.2%
212-6668 1
 
3.2%
211-3633 1
 
3.2%
Other values (21) 21
67.7%
2023-12-13T06:14:43.968966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 54
21.8%
1 33
13.3%
- 31
12.5%
5 24
9.7%
6 19
 
7.7%
3 18
 
7.3%
9 16
 
6.5%
8 16
 
6.5%
0 15
 
6.0%
7 12
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 217
87.5%
Dash Punctuation 31
 
12.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 54
24.9%
1 33
15.2%
5 24
11.1%
6 19
 
8.8%
3 18
 
8.3%
9 16
 
7.4%
8 16
 
7.4%
0 15
 
6.9%
7 12
 
5.5%
4 10
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 248
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 54
21.8%
1 33
13.3%
- 31
12.5%
5 24
9.7%
6 19
 
7.7%
3 18
 
7.3%
9 16
 
6.5%
8 16
 
6.5%
0 15
 
6.0%
7 12
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 248
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 54
21.8%
1 33
13.3%
- 31
12.5%
5 24
9.7%
6 19
 
7.7%
3 18
 
7.3%
9 16
 
6.5%
8 16
 
6.5%
0 15
 
6.0%
7 12
 
4.8%

유무료
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size420.0 B
유료
33 
무료
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row무료
2nd row무료
3rd row무료
4th row유료
5th row유료

Common Values

ValueCountFrequency (%)
유료 33
91.7%
무료 3
 
8.3%

Length

2023-12-13T06:14:44.112156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:14:44.209326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유료 33
91.7%
무료 3
 
8.3%

Interactions

2023-12-13T06:14:40.787848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:14:44.276799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번직업소개소 명칭소재지대표자전화번호유무료
순번1.0001.0001.0001.0001.0000.943
직업소개소 명칭1.0001.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.0001.000
대표자1.0001.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.0001.000
유무료0.9431.0001.0001.0001.0001.000
2023-12-13T06:14:44.384822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번유무료
순번1.0000.696
유무료0.6961.000

Missing values

2023-12-13T06:14:40.932863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:14:41.053118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번직업소개소 명칭소재지대표자전화번호유무료
01충북청원지역자활센터청원군 오창읍 양청리799-6김경호269-5720무료
12충북여성새로일하기본부청원군 오창읍 각리 641-1오경숙217-9195무료
23사단법인 중소기업기술혁신협회 충북지회청원군 오창읍 양청리 803-1장현봉288-1432무료
34대원인력개발직업소개소청원구 내덕동 175-1김대관257-1226유료
45대율직업소개소청원구 내덕동 173-80장영부255-6353유료
56그린인력직업소개소청원구 내덕동 178-1김두영215-1259유료
67내덕개발직업소개소청원구 내덕동 290-6김순회256-1879유료
78청림인력직업소개소청원구 내덕동 305-6문태주256-0482유료
89선구산업직업소개소청원구 내덕동 381-2김재길222-2365유료
910대창인력직업소개소청원구 내덕동 493-2이원복212-5858유료
순번직업소개소 명칭소재지대표자전화번호유무료
2627청주산업인력공사직업소개소청원구 우암동 381-3최명자215-1604유료
2728미소직업인력소개소청원구 율량동 895정지천<NA>유료
2829㈜비엔싸아이직업소개소청원구 우암동 326-1최환이231-6880유료
2930좋은파출인력직업소개소청원구 공항로150번길42송현옥<NA>유료
3031하누리산업인력개발청원구 상당로 290-2김기웅<NA>유료
3132대한국제교류청원군 오창읍 792-4오창온천 2층김종팔217-5916유료
3233내수인력직업소개소청원군 내수읍 마산리 159-10민 수213-1982유료
3334도반전기촌청원군 남일면 효촌리 196-4장은경288-0491유료
3435크로바산업인력청원군 오창읍 장대리 308-1권영배211-9797유료
3536장송컨설팅청원군 오창읍 양청리 792-1이강열<NA>유료