Overview

Dataset statistics

Number of variables5
Number of observations77
Missing cells22
Missing cells (%)5.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory42.7 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description세종특별자치시 직업소개소 현황을 제공합니다.데이터는 상호, 주소, 전화번호로 구성되어 있습니다.세종믿음인력,강남인력개발등의 데이터가 있습니다.
Author세종특별자치시
URLhttps://www.data.go.kr/data/15028251/fileData.do

Alerts

등록기준일 has constant value ""Constant
사업소전화번호 has 22 (28.6%) missing valuesMissing
순번 has unique valuesUnique
사업소명 has unique valuesUnique

Reproduction

Analysis started2024-03-15 02:24:38.619570
Analysis finished2024-03-15 02:24:40.406979
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct77
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39
Minimum1
Maximum77
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size821.0 B
2024-03-15T11:24:40.586168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.8
Q120
median39
Q358
95-th percentile73.2
Maximum77
Range76
Interquartile range (IQR)38

Descriptive statistics

Standard deviation22.371857
Coefficient of variation (CV)0.57363737
Kurtosis-1.2
Mean39
Median Absolute Deviation (MAD)19
Skewness0
Sum3003
Variance500.5
MonotonicityStrictly increasing
2024-03-15T11:24:41.154447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.3%
50 1
 
1.3%
57 1
 
1.3%
56 1
 
1.3%
55 1
 
1.3%
54 1
 
1.3%
53 1
 
1.3%
52 1
 
1.3%
51 1
 
1.3%
49 1
 
1.3%
Other values (67) 67
87.0%
ValueCountFrequency (%)
1 1
1.3%
2 1
1.3%
3 1
1.3%
4 1
1.3%
5 1
1.3%
6 1
1.3%
7 1
1.3%
8 1
1.3%
9 1
1.3%
10 1
1.3%
ValueCountFrequency (%)
77 1
1.3%
76 1
1.3%
75 1
1.3%
74 1
1.3%
73 1
1.3%
72 1
1.3%
71 1
1.3%
70 1
1.3%
69 1
1.3%
68 1
1.3%

사업소명
Text

UNIQUE 

Distinct77
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size744.0 B
2024-03-15T11:24:42.108626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length7.5194805
Min length2

Characters and Unicode

Total characters579
Distinct characters136
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)100.0%

Sample

1st row세종믿음인력
2nd row금호인력
3rd row이건인력
4th row진흥인력투자개발
5th row(주)대부종합개발
ValueCountFrequency (%)
주식회사 4
 
4.2%
사단법인 3
 
3.2%
직업소개소 2
 
2.1%
세종인력 2
 
2.1%
뿌리건설조경 1
 
1.1%
케이인력공사 1
 
1.1%
주)대덕인테크 1
 
1.1%
호성직업소개소 1
 
1.1%
사회서비스원 1
 
1.1%
세종특별자치시 1
 
1.1%
Other values (78) 78
82.1%
2024-03-15T11:24:43.383185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
8.8%
41
 
7.1%
27
 
4.7%
26
 
4.5%
23
 
4.0%
22
 
3.8%
19
 
3.3%
18
 
3.1%
15
 
2.6%
14
 
2.4%
Other values (126) 323
55.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 544
94.0%
Space Separator 18
 
3.1%
Open Punctuation 7
 
1.2%
Close Punctuation 7
 
1.2%
Decimal Number 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
9.4%
41
 
7.5%
27
 
5.0%
26
 
4.8%
23
 
4.2%
22
 
4.0%
19
 
3.5%
15
 
2.8%
14
 
2.6%
14
 
2.6%
Other values (121) 292
53.7%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
9 1
33.3%
Space Separator
ValueCountFrequency (%)
18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 544
94.0%
Common 35
 
6.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
9.4%
41
 
7.5%
27
 
5.0%
26
 
4.8%
23
 
4.2%
22
 
4.0%
19
 
3.5%
15
 
2.8%
14
 
2.6%
14
 
2.6%
Other values (121) 292
53.7%
Common
ValueCountFrequency (%)
18
51.4%
( 7
 
20.0%
) 7
 
20.0%
1 2
 
5.7%
9 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 544
94.0%
ASCII 35
 
6.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
51
 
9.4%
41
 
7.5%
27
 
5.0%
26
 
4.8%
23
 
4.2%
22
 
4.0%
19
 
3.5%
15
 
2.8%
14
 
2.6%
14
 
2.6%
Other values (121) 292
53.7%
ASCII
ValueCountFrequency (%)
18
51.4%
( 7
 
20.0%
) 7
 
20.0%
1 2
 
5.7%
9 1
 
2.9%
Distinct75
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size744.0 B
2024-03-15T11:24:44.423691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length19.428571
Min length14

Characters and Unicode

Total characters1496
Distinct characters84
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)94.8%

Sample

1st row세종특별자치시 어진동 671번지
2nd row세종특별자치시 금남면 용포리 128-5
3rd row세종특별자치시 연동면 예양리 111-1
4th row세종특별자치시 조치원읍 죽림리 379번지
5th row세종특별자치시 조치원읍 정리 102-16번지
ValueCountFrequency (%)
세종특별자치시 77
26.7%
조치원읍 32
 
11.1%
금남면 10
 
3.5%
남리 9
 
3.1%
용포리 9
 
3.1%
나성동 6
 
2.1%
죽림리 5
 
1.7%
전의면 4
 
1.4%
읍내리 4
 
1.4%
연서면 4
 
1.4%
Other values (106) 128
44.4%
2024-03-15T11:24:45.815234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
211
 
14.1%
109
 
7.3%
78
 
5.2%
77
 
5.1%
77
 
5.1%
77
 
5.1%
77
 
5.1%
77
 
5.1%
54
 
3.6%
1 43
 
2.9%
Other values (74) 616
41.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1020
68.2%
Decimal Number 239
 
16.0%
Space Separator 211
 
14.1%
Dash Punctuation 26
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
 
10.7%
78
 
7.6%
77
 
7.5%
77
 
7.5%
77
 
7.5%
77
 
7.5%
77
 
7.5%
54
 
5.3%
36
 
3.5%
33
 
3.2%
Other values (62) 325
31.9%
Decimal Number
ValueCountFrequency (%)
1 43
18.0%
2 35
14.6%
7 30
12.6%
5 28
11.7%
4 19
7.9%
9 19
7.9%
6 18
7.5%
3 17
 
7.1%
0 17
 
7.1%
8 13
 
5.4%
Space Separator
ValueCountFrequency (%)
211
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1020
68.2%
Common 476
31.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
 
10.7%
78
 
7.6%
77
 
7.5%
77
 
7.5%
77
 
7.5%
77
 
7.5%
77
 
7.5%
54
 
5.3%
36
 
3.5%
33
 
3.2%
Other values (62) 325
31.9%
Common
ValueCountFrequency (%)
211
44.3%
1 43
 
9.0%
2 35
 
7.4%
7 30
 
6.3%
5 28
 
5.9%
- 26
 
5.5%
4 19
 
4.0%
9 19
 
4.0%
6 18
 
3.8%
3 17
 
3.6%
Other values (2) 30
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1020
68.2%
ASCII 476
31.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
211
44.3%
1 43
 
9.0%
2 35
 
7.4%
7 30
 
6.3%
5 28
 
5.9%
- 26
 
5.5%
4 19
 
4.0%
9 19
 
4.0%
6 18
 
3.8%
3 17
 
3.6%
Other values (2) 30
 
6.3%
Hangul
ValueCountFrequency (%)
109
 
10.7%
78
 
7.6%
77
 
7.5%
77
 
7.5%
77
 
7.5%
77
 
7.5%
77
 
7.5%
54
 
5.3%
36
 
3.5%
33
 
3.2%
Other values (62) 325
31.9%

사업소전화번호
Text

MISSING 

Distinct54
Distinct (%)98.2%
Missing22
Missing (%)28.6%
Memory size744.0 B
2024-03-15T11:24:46.627521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.981818
Min length9

Characters and Unicode

Total characters659
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)96.4%

Sample

1st row044-866-5613
2nd row044-868-4585
3rd row042-822-5964
4th row044-867-9565
5th row041-867-8900
ValueCountFrequency (%)
044-863-2211 2
 
3.5%
044-868-8009 1
 
1.8%
044-866-5613 1
 
1.8%
044-863-3080 1
 
1.8%
044-867-6661 1
 
1.8%
044-866-2824 1
 
1.8%
044-862-9987 1
 
1.8%
044-863-2379 1
 
1.8%
044-865-2801 1
 
1.8%
044-862-1677 1
 
1.8%
Other values (46) 46
80.7%
2024-03-15T11:24:47.733640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 130
19.7%
- 107
16.2%
0 81
12.3%
6 80
12.1%
8 75
11.4%
1 40
 
6.1%
2 38
 
5.8%
5 31
 
4.7%
3 28
 
4.2%
9 24
 
3.6%
Other values (2) 25
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 550
83.5%
Dash Punctuation 107
 
16.2%
Space Separator 2
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 130
23.6%
0 81
14.7%
6 80
14.5%
8 75
13.6%
1 40
 
7.3%
2 38
 
6.9%
5 31
 
5.6%
3 28
 
5.1%
9 24
 
4.4%
7 23
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 107
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 659
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 130
19.7%
- 107
16.2%
0 81
12.3%
6 80
12.1%
8 75
11.4%
1 40
 
6.1%
2 38
 
5.8%
5 31
 
4.7%
3 28
 
4.2%
9 24
 
3.6%
Other values (2) 25
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 659
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 130
19.7%
- 107
16.2%
0 81
12.3%
6 80
12.1%
8 75
11.4%
1 40
 
6.1%
2 38
 
5.8%
5 31
 
4.7%
3 28
 
4.2%
9 24
 
3.6%
Other values (2) 25
 
3.8%

등록기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size744.0 B
2024-01-31
77 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-01-31
2nd row2024-01-31
3rd row2024-01-31
4th row2024-01-31
5th row2024-01-31

Common Values

ValueCountFrequency (%)
2024-01-31 77
100.0%

Length

2024-03-15T11:24:48.068647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T11:24:48.235706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-01-31 77
100.0%

Interactions

2024-03-15T11:24:39.473824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T11:24:48.344813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업소명사업소주소사업소전화번호
순번1.0001.0001.0000.930
사업소명1.0001.0001.0001.000
사업소주소1.0001.0001.0000.997
사업소전화번호0.9301.0000.9971.000

Missing values

2024-03-15T11:24:39.950878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T11:24:40.283722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사업소명사업소주소사업소전화번호등록기준일
01세종믿음인력세종특별자치시 어진동 671번지044-866-56132024-01-31
12금호인력세종특별자치시 금남면 용포리 128-5044-868-45852024-01-31
23이건인력세종특별자치시 연동면 예양리 111-1042-822-59642024-01-31
34진흥인력투자개발세종특별자치시 조치원읍 죽림리 379번지044-867-95652024-01-31
45(주)대부종합개발세종특별자치시 조치원읍 정리 102-16번지041-867-89002024-01-31
56공사일구직업소개소세종특별자치시 조치원읍 원리 1번지044-862-04192024-01-31
67장수인력개발세종특별자치시 조치원읍 남리 55-2044-866-20012024-01-31
78금남직업소개소세종특별자치시 금남면 용포리 80번지044-866-95502024-01-31
89대신인력직업소개소세종특별자치시 조치원읍 죽림리 254-1044-868-04042024-01-31
910부강인력세종특별자치시 부강면 노호리 442-11044-272-11172024-01-31
순번사업소명사업소주소사업소전화번호등록기준일
6768지오씨세종특별자치시 나성동 758<NA>2024-01-31
6869한우리세종특별자치시 나성동 730<NA>2024-01-31
6970(사)세종벤처기업협회세종특별자치시 조치원읍 군청로 93044-862-22072024-01-31
7071인건축용역세종특별자치시 조치원읍 침산리 256-5<NA>2024-01-31
7172전국구 직업소개소세종특별자치시 연서면 월하리 651-2<NA>2024-01-31
7273디케이인력세종특별자치시 조치원읍 정리 35-2<NA>2024-01-31
7374에이스 직업소개소세종특별자치시 연서면 월하리 651-2<NA>2024-01-31
7475주식회사 도움세종특별자치시 어진동 670<NA>2024-01-31
7576대한간호간병인협회세종특별자치시 나성동 790<NA>2024-01-31
7677미래보장직업소개소세종특별자치시 조치원읍 신안리 244-1<NA>2024-01-31