Overview

Dataset statistics

Number of variables6
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory53.6 B

Variable types

Numeric1
Text2
Categorical3

Dataset

Description충청북도 증평군 직업소개소현황(연번,사업체명,소재지도로명주소,이용료구분,전화번호,데이터기준일) 입니다.
URLhttps://www.data.go.kr/data/15118645/fileData.do

Alerts

데이터기준일 has constant value ""Constant
이용료구분 is highly overall correlated with 전화번호High correlation
전화번호 is highly overall correlated with 이용료구분High correlation
이용료구분 is highly imbalanced (63.8%)Imbalance
연번 has unique valuesUnique
사업체명 has unique valuesUnique
소재지도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:34:34.060717
Analysis finished2023-12-12 10:34:34.827463
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-12T19:34:34.918950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.4
Q18
median15
Q322
95-th percentile27.6
Maximum29
Range28
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.5146932
Coefficient of variation (CV)0.56764621
Kurtosis-1.2
Mean15
Median Absolute Deviation (MAD)7
Skewness0
Sum435
Variance72.5
MonotonicityStrictly increasing
2023-12-12T19:34:35.096135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
1 1
 
3.4%
2 1
 
3.4%
29 1
 
3.4%
28 1
 
3.4%
27 1
 
3.4%
26 1
 
3.4%
25 1
 
3.4%
24 1
 
3.4%
23 1
 
3.4%
22 1
 
3.4%
Other values (19) 19
65.5%
ValueCountFrequency (%)
1 1
3.4%
2 1
3.4%
3 1
3.4%
4 1
3.4%
5 1
3.4%
6 1
3.4%
7 1
3.4%
8 1
3.4%
9 1
3.4%
10 1
3.4%
ValueCountFrequency (%)
29 1
3.4%
28 1
3.4%
27 1
3.4%
26 1
3.4%
25 1
3.4%
24 1
3.4%
23 1
3.4%
22 1
3.4%
21 1
3.4%
20 1
3.4%

사업체명
Text

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-12T19:34:35.407474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length7
Mean length6.7586207
Min length4

Characters and Unicode

Total characters196
Distinct characters74
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row에이플러스(A+)
2nd row증평인력
3rd row탑인력사무소
4th row개미인력
5th row태산인력
ValueCountFrequency (%)
에이플러스(a 1
 
3.2%
주식회사 1
 
3.2%
인력 1
 
3.2%
365일 1
 
3.2%
해냄인력 1
 
3.2%
삼일건설인력소개소 1
 
3.2%
바이오종합인력 1
 
3.2%
증평군일자리종합지원센터 1
 
3.2%
다은건설인력 1
 
3.2%
마스터인력 1
 
3.2%
Other values (21) 21
67.7%
2023-12-12T19:34:35.942988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
 
13.3%
25
 
12.8%
17
 
8.7%
9
 
4.6%
8
 
4.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.0%
4
 
2.0%
Other values (64) 88
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 187
95.4%
Decimal Number 3
 
1.5%
Space Separator 2
 
1.0%
Open Punctuation 1
 
0.5%
Uppercase Letter 1
 
0.5%
Math Symbol 1
 
0.5%
Close Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
13.9%
25
 
13.4%
17
 
9.1%
9
 
4.8%
8
 
4.3%
5
 
2.7%
5
 
2.7%
5
 
2.7%
4
 
2.1%
4
 
2.1%
Other values (56) 79
42.2%
Decimal Number
ValueCountFrequency (%)
5 1
33.3%
6 1
33.3%
3 1
33.3%
Space Separator
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 187
95.4%
Common 8
 
4.1%
Latin 1
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
13.9%
25
 
13.4%
17
 
9.1%
9
 
4.8%
8
 
4.3%
5
 
2.7%
5
 
2.7%
5
 
2.7%
4
 
2.1%
4
 
2.1%
Other values (56) 79
42.2%
Common
ValueCountFrequency (%)
2
25.0%
5 1
12.5%
6 1
12.5%
3 1
12.5%
( 1
12.5%
+ 1
12.5%
) 1
12.5%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 187
95.4%
ASCII 9
 
4.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
 
13.9%
25
 
13.4%
17
 
9.1%
9
 
4.8%
8
 
4.3%
5
 
2.7%
5
 
2.7%
5
 
2.7%
4
 
2.1%
4
 
2.1%
Other values (56) 79
42.2%
ASCII
ValueCountFrequency (%)
2
22.2%
5 1
11.1%
6 1
11.1%
3 1
11.1%
( 1
11.1%
A 1
11.1%
+ 1
11.1%
) 1
11.1%
Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-12T19:34:36.271934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length27
Mean length23.965517
Min length19

Characters and Unicode

Total characters695
Distinct characters57
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row충청북도 증평군 증평읍 중앙로8길 22
2nd row충청북도 증평군 증평읍 충청대로 1756, 1001동 105호 (예다인)
3rd row충청북도 증평군 증평읍 초중10길 12
4th row충청북도 증평군 증평읍 초중6길 47
5th row충청북도 증평군 증평읍 초중로 37, 201호
ValueCountFrequency (%)
충청북도 29
17.5%
증평읍 29
17.5%
증평군 29
17.5%
중앙로 7
 
4.2%
2층 7
 
4.2%
1층 6
 
3.6%
창신로 3
 
1.8%
역전로 3
 
1.8%
초중로 2
 
1.2%
중앙로5길 2
 
1.2%
Other values (48) 49
29.5%
2023-12-12T19:34:36.768777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
137
19.7%
60
 
8.6%
60
 
8.6%
1 31
 
4.5%
30
 
4.3%
30
 
4.3%
29
 
4.2%
29
 
4.2%
29
 
4.2%
29
 
4.2%
Other values (47) 231
33.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 419
60.3%
Space Separator 137
 
19.7%
Decimal Number 114
 
16.4%
Other Punctuation 16
 
2.3%
Dash Punctuation 7
 
1.0%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
14.3%
60
14.3%
30
 
7.2%
30
 
7.2%
29
 
6.9%
29
 
6.9%
29
 
6.9%
29
 
6.9%
26
 
6.2%
16
 
3.8%
Other values (32) 81
19.3%
Decimal Number
ValueCountFrequency (%)
1 31
27.2%
2 27
23.7%
3 10
 
8.8%
0 10
 
8.8%
7 8
 
7.0%
5 8
 
7.0%
4 8
 
7.0%
9 5
 
4.4%
6 4
 
3.5%
8 3
 
2.6%
Space Separator
ValueCountFrequency (%)
137
100.0%
Other Punctuation
ValueCountFrequency (%)
16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 419
60.3%
Common 276
39.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
14.3%
60
14.3%
30
 
7.2%
30
 
7.2%
29
 
6.9%
29
 
6.9%
29
 
6.9%
29
 
6.9%
26
 
6.2%
16
 
3.8%
Other values (32) 81
19.3%
Common
ValueCountFrequency (%)
137
49.6%
1 31
 
11.2%
2 27
 
9.8%
16
 
5.8%
3 10
 
3.6%
0 10
 
3.6%
7 8
 
2.9%
5 8
 
2.9%
4 8
 
2.9%
- 7
 
2.5%
Other values (5) 14
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 419
60.3%
ASCII 260
37.4%
None 16
 
2.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
137
52.7%
1 31
 
11.9%
2 27
 
10.4%
3 10
 
3.8%
0 10
 
3.8%
7 8
 
3.1%
5 8
 
3.1%
4 8
 
3.1%
- 7
 
2.7%
9 5
 
1.9%
Other values (4) 9
 
3.5%
Hangul
ValueCountFrequency (%)
60
14.3%
60
14.3%
30
 
7.2%
30
 
7.2%
29
 
6.9%
29
 
6.9%
29
 
6.9%
29
 
6.9%
26
 
6.2%
16
 
3.8%
Other values (32) 81
19.3%
None
ValueCountFrequency (%)
16
100.0%

이용료구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
유료
27 
무료
 
2

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유료
2nd row유료
3rd row유료
4th row유료
5th row유료

Common Values

ValueCountFrequency (%)
유료 27
93.1%
무료 2
 
6.9%

Length

2023-12-12T19:34:36.980402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:34:37.119842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유료 27
93.1%
무료 2
 
6.9%

전화번호
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)44.8%
Missing0
Missing (%)0.0%
Memory size364.0 B
개인번호 포함
17 
043-838-0636
 
1
043-836-2233
 
1
043-836-0801
 
1
043-838-1079
 
1
Other values (8)

Length

Max length12
Median length7
Mean length9.0689655
Min length7

Unique

Unique12 ?
Unique (%)41.4%

Sample

1st row개인번호 포함
2nd row개인번호 포함
3rd row개인번호 포함
4th row개인번호 포함
5th row개인번호 포함

Common Values

ValueCountFrequency (%)
개인번호 포함 17
58.6%
043-838-0636 1
 
3.4%
043-836-2233 1
 
3.4%
043-836-0801 1
 
3.4%
043-838-1079 1
 
3.4%
043-838-1906 1
 
3.4%
043-836-1332 1
 
3.4%
043-838-0410 1
 
3.4%
043-838-4194 1
 
3.4%
043-838-4700 1
 
3.4%
Other values (3) 3
 
10.3%

Length

2023-12-12T19:34:37.283822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
개인번호 17
37.0%
포함 17
37.0%
043-838-0636 1
 
2.2%
043-836-2233 1
 
2.2%
043-836-0801 1
 
2.2%
043-838-1079 1
 
2.2%
043-838-1906 1
 
2.2%
043-836-1332 1
 
2.2%
043-838-0410 1
 
2.2%
043-838-4194 1
 
2.2%
Other values (4) 4
 
8.7%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-08-14
29 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-14
2nd row2023-08-14
3rd row2023-08-14
4th row2023-08-14
5th row2023-08-14

Common Values

ValueCountFrequency (%)
2023-08-14 29
100.0%

Length

2023-12-12T19:34:37.433951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:34:37.581158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-14 29
100.0%

Interactions

2023-12-12T19:34:34.404257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:34:37.673368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업체명소재지도로명주소이용료구분전화번호
연번1.0001.0001.0000.0000.447
사업체명1.0001.0001.0001.0001.000
소재지도로명주소1.0001.0001.0001.0001.000
이용료구분0.0001.0001.0001.0001.000
전화번호0.4471.0001.0001.0001.000
2023-12-12T19:34:37.822422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전화번호이용료구분
전화번호1.0000.770
이용료구분0.7701.000
2023-12-12T19:34:37.969233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번이용료구분전화번호
연번1.0000.0000.214
이용료구분0.0001.0000.770
전화번호0.2140.7701.000

Missing values

2023-12-12T19:34:34.582397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:34:34.759938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업체명소재지도로명주소이용료구분전화번호데이터기준일
01에이플러스(A+)충청북도 증평군 증평읍 중앙로8길 22유료개인번호 포함2023-08-14
12증평인력충청북도 증평군 증평읍 충청대로 1756, 1001동 105호 (예다인)유료개인번호 포함2023-08-14
23탑인력사무소충청북도 증평군 증평읍 초중10길 12유료개인번호 포함2023-08-14
34개미인력충청북도 증평군 증평읍 초중6길 47유료개인번호 포함2023-08-14
45태산인력충청북도 증평군 증평읍 초중로 37, 201호유료개인번호 포함2023-08-14
56대성인력소개사업소충청북도 증평군 증평읍 중앙로5길 3-2유료개인번호 포함2023-08-14
67다인인력충청북도 증평군 증평읍 역전로 42, 2층유료개인번호 포함2023-08-14
78증평삼보인력충청북도 증평군 증평읍 창신로 44, 2층유료개인번호 포함2023-08-14
89나눔인력충청북도 증평군 증평읍 중앙로6길 31-1유료개인번호 포함2023-08-14
910서창인력사무소충청북도 증평군 증평읍 초중2길 25-17, 1층유료개인번호 포함2023-08-14
연번사업체명소재지도로명주소이용료구분전화번호데이터기준일
1920증평건설인력충청북도 증평군 증평읍 광장로 207, 1층유료043-836-13322023-08-14
2021힘찬종합인력사무소충청북도 증평군 증평읍 역전로 20, 1층유료개인번호 포함2023-08-14
2122마스터인력충청북도 증평군 증평읍 중앙로 144, 2층유료043-838-04102023-08-14
2223다은건설인력충청북도 증평군 증평읍 중앙로 110, 2층유료개인번호 포함2023-08-14
2324증평군일자리종합지원센터충청북도 증평군 증평읍 인삼로 29무료043-838-41942023-08-14
2425바이오종합인력충청북도 증평군 증평읍 광장로 89, 증평시외버스터미널 2층유료043-838-47002023-08-14
2526삼일건설인력소개소충청북도 증평군 증평읍 역전로 34유료043-836-00312023-08-14
2627해냄인력충청북도 증평군 증평읍 중앙로 183유료043-838-84262023-08-14
2728365일 인력충청북도 증평군 증평읍 중앙로 201-3, 3층유료개인번호 포함2023-08-14
2829중앙인력직업소개소충청북도 증평군 증평읍 중앙로 251-2유료043-838-77452023-08-14