Overview

Dataset statistics

Number of variables7
Number of observations29
Missing cells22
Missing cells (%)10.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory61.6 B

Variable types

Numeric1
Text4
Categorical1
DateTime1

Dataset

Description이 데이터는 2023년 4월 19일 기준으로 전라북도 남원시 소재의 직업소개소 등록 현황에 대하여 업체명, 주소, 전화번호, 팩스번호, 유료 및 무료 정보 등에 대한 데이터입니다.
URLhttps://www.data.go.kr/data/15081476/fileData.do

Alerts

데이터 기준일자 has constant value ""Constant
연번 is highly overall correlated with 유료 및 무료구분High correlation
유료 및 무료구분 is highly overall correlated with 연번High correlation
유료 및 무료구분 is highly imbalanced (63.8%)Imbalance
전화번호 has 14 (48.3%) missing valuesMissing
팩스번호 has 8 (27.6%) missing valuesMissing
연번 has unique valuesUnique
업체명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:04:03.976582
Analysis finished2023-12-12 19:04:04.743285
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T04:04:04.830431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.4
Q18
median15
Q322
95-th percentile27.6
Maximum29
Range28
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.5146932
Coefficient of variation (CV)0.56764621
Kurtosis-1.2
Mean15
Median Absolute Deviation (MAD)7
Skewness0
Sum435
Variance72.5
MonotonicityStrictly increasing
2023-12-13T04:04:05.004670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
1 1
 
3.4%
2 1
 
3.4%
29 1
 
3.4%
28 1
 
3.4%
27 1
 
3.4%
26 1
 
3.4%
25 1
 
3.4%
24 1
 
3.4%
23 1
 
3.4%
22 1
 
3.4%
Other values (19) 19
65.5%
ValueCountFrequency (%)
1 1
3.4%
2 1
3.4%
3 1
3.4%
4 1
3.4%
5 1
3.4%
6 1
3.4%
7 1
3.4%
8 1
3.4%
9 1
3.4%
10 1
3.4%
ValueCountFrequency (%)
29 1
3.4%
28 1
3.4%
27 1
3.4%
26 1
3.4%
25 1
3.4%
24 1
3.4%
23 1
3.4%
22 1
3.4%
21 1
3.4%
20 1
3.4%

업체명
Text

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-13T04:04:05.319598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length8.8275862
Min length3

Characters and Unicode

Total characters256
Distinct characters73
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row새남원인력 유료직업소개소
2nd row내발로인력 유료직업소개소
3rd row향교인력 유료직업소개소
4th row서남인력 유료직업소개소
5th row남원 개미인력 유료직업소개소
ValueCountFrequency (%)
유료직업소개소 13
30.2%
새남원인력 1
 
2.3%
팜워크 1
 
2.3%
남원시장애인종합복지관직업재활팀 1
 
2.3%
대박인력 1
 
2.3%
만인인력 1
 
2.3%
남원시스템 1
 
2.3%
황소인력 1
 
2.3%
건우인력 1
 
2.3%
보람인력 1
 
2.3%
Other values (21) 21
48.8%
2023-12-13T04:04:05.752026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
10.5%
26
 
10.2%
24
 
9.4%
14
 
5.5%
14
 
5.5%
14
 
5.5%
14
 
5.5%
13
 
5.1%
13
 
5.1%
10
 
3.9%
Other values (63) 87
34.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 242
94.5%
Space Separator 14
 
5.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
11.2%
26
 
10.7%
24
 
9.9%
14
 
5.8%
14
 
5.8%
14
 
5.8%
13
 
5.4%
13
 
5.4%
10
 
4.1%
10
 
4.1%
Other values (62) 77
31.8%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 242
94.5%
Common 14
 
5.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
11.2%
26
 
10.7%
24
 
9.9%
14
 
5.8%
14
 
5.8%
14
 
5.8%
13
 
5.4%
13
 
5.4%
10
 
4.1%
10
 
4.1%
Other values (62) 77
31.8%
Common
ValueCountFrequency (%)
14
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 242
94.5%
ASCII 14
 
5.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
27
 
11.2%
26
 
10.7%
24
 
9.9%
14
 
5.8%
14
 
5.8%
14
 
5.8%
13
 
5.4%
13
 
5.4%
10
 
4.1%
10
 
4.1%
Other values (62) 77
31.8%
ASCII
ValueCountFrequency (%)
14
100.0%

주소
Text

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-13T04:04:05.998657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length17.137931
Min length15

Characters and Unicode

Total characters497
Distinct characters59
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row전라북도 남원시 의총로 51 공설시장 7동
2nd row전라북도 남원시 충정로 138
3rd row전라북도 남원시 향단로 106
4th row전라북도 남원시 동림로 18
5th row전라북도 남원시 의총로 111
ValueCountFrequency (%)
전라북도 29
24.0%
남원시 29
24.0%
의총로 5
 
4.1%
동림로 4
 
3.3%
광한북로 4
 
3.3%
충정로 3
 
2.5%
138 2
 
1.7%
운봉읍 2
 
1.7%
운봉로 2
 
1.7%
동문로 2
 
1.7%
Other values (39) 39
32.2%
2023-12-13T04:04:06.395985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
98
19.7%
33
 
6.6%
30
 
6.0%
29
 
5.8%
29
 
5.8%
29
 
5.8%
29
 
5.8%
29
 
5.8%
1 28
 
5.6%
21
 
4.2%
Other values (49) 142
28.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 314
63.2%
Space Separator 98
 
19.7%
Decimal Number 79
 
15.9%
Dash Punctuation 6
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
10.5%
30
9.6%
29
9.2%
29
9.2%
29
9.2%
29
9.2%
29
9.2%
21
 
6.7%
8
 
2.5%
7
 
2.2%
Other values (37) 70
22.3%
Decimal Number
ValueCountFrequency (%)
1 28
35.4%
3 12
15.2%
8 7
 
8.9%
6 6
 
7.6%
4 6
 
7.6%
7 5
 
6.3%
9 4
 
5.1%
2 4
 
5.1%
5 4
 
5.1%
0 3
 
3.8%
Space Separator
ValueCountFrequency (%)
98
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 314
63.2%
Common 183
36.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
10.5%
30
9.6%
29
9.2%
29
9.2%
29
9.2%
29
9.2%
29
9.2%
21
 
6.7%
8
 
2.5%
7
 
2.2%
Other values (37) 70
22.3%
Common
ValueCountFrequency (%)
98
53.6%
1 28
 
15.3%
3 12
 
6.6%
8 7
 
3.8%
- 6
 
3.3%
6 6
 
3.3%
4 6
 
3.3%
7 5
 
2.7%
9 4
 
2.2%
2 4
 
2.2%
Other values (2) 7
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 314
63.2%
ASCII 183
36.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
98
53.6%
1 28
 
15.3%
3 12
 
6.6%
8 7
 
3.8%
- 6
 
3.3%
6 6
 
3.3%
4 6
 
3.3%
7 5
 
2.7%
9 4
 
2.2%
2 4
 
2.2%
Other values (2) 7
 
3.8%
Hangul
ValueCountFrequency (%)
33
10.5%
30
9.6%
29
9.2%
29
9.2%
29
9.2%
29
9.2%
29
9.2%
21
 
6.7%
8
 
2.5%
7
 
2.2%
Other values (37) 70
22.3%

전화번호
Text

MISSING 

Distinct15
Distinct (%)100.0%
Missing14
Missing (%)48.3%
Memory size364.0 B
2023-12-13T04:04:06.599765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters180
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)100.0%

Sample

1st row063-636-8680
2nd row063-636-0555
3rd row063-636-8848
4th row063-625-0900
5th row063-636-9680
ValueCountFrequency (%)
063-636-8680 1
 
6.7%
063-636-0555 1
 
6.7%
063-636-8848 1
 
6.7%
063-625-0900 1
 
6.7%
063-636-9680 1
 
6.7%
063-633-1010 1
 
6.7%
063-635-0712 1
 
6.7%
063-634-5959 1
 
6.7%
063-625-2300 1
 
6.7%
063-636-1253 1
 
6.7%
Other values (5) 5
33.3%
2023-12-13T04:04:06.918888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 39
21.7%
3 32
17.8%
- 30
16.7%
0 29
16.1%
5 12
 
6.7%
1 9
 
5.0%
2 8
 
4.4%
4 7
 
3.9%
9 7
 
3.9%
8 6
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 150
83.3%
Dash Punctuation 30
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 39
26.0%
3 32
21.3%
0 29
19.3%
5 12
 
8.0%
1 9
 
6.0%
2 8
 
5.3%
4 7
 
4.7%
9 7
 
4.7%
8 6
 
4.0%
7 1
 
0.7%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 180
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 39
21.7%
3 32
17.8%
- 30
16.7%
0 29
16.1%
5 12
 
6.7%
1 9
 
5.0%
2 8
 
4.4%
4 7
 
3.9%
9 7
 
3.9%
8 6
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 180
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 39
21.7%
3 32
17.8%
- 30
16.7%
0 29
16.1%
5 12
 
6.7%
1 9
 
5.0%
2 8
 
4.4%
4 7
 
3.9%
9 7
 
3.9%
8 6
 
3.3%

팩스번호
Text

MISSING 

Distinct21
Distinct (%)100.0%
Missing8
Missing (%)27.6%
Memory size364.0 B
2023-12-13T04:04:07.125356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.047619
Min length12

Characters and Unicode

Total characters253
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row063-625-7575
2nd row063-636-9681
3rd row063-636-9135
4th row063-625-7228
5th row063-625-1296
ValueCountFrequency (%)
063-625-7575 1
 
4.8%
063-636-1605 1
 
4.8%
063-636-9681 1
 
4.8%
063-636-1992 1
 
4.8%
063-632-1252 1
 
4.8%
063-625-2300 1
 
4.8%
063-634-5959 1
 
4.8%
063-633-1033 1
 
4.8%
063-625-5557 1
 
4.8%
050-4316-1428 1
 
4.8%
Other values (11) 11
52.4%
2023-12-13T04:04:07.491416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 54
21.3%
3 44
17.4%
- 42
16.6%
0 29
11.5%
2 23
9.1%
5 23
9.1%
1 11
 
4.3%
4 10
 
4.0%
9 8
 
3.2%
8 5
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 211
83.4%
Dash Punctuation 42
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 54
25.6%
3 44
20.9%
0 29
13.7%
2 23
10.9%
5 23
10.9%
1 11
 
5.2%
4 10
 
4.7%
9 8
 
3.8%
8 5
 
2.4%
7 4
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 253
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 54
21.3%
3 44
17.4%
- 42
16.6%
0 29
11.5%
2 23
9.1%
5 23
9.1%
1 11
 
4.3%
4 10
 
4.0%
9 8
 
3.2%
8 5
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 253
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 54
21.3%
3 44
17.4%
- 42
16.6%
0 29
11.5%
2 23
9.1%
5 23
9.1%
1 11
 
4.3%
4 10
 
4.0%
9 8
 
3.2%
8 5
 
2.0%

유료 및 무료구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
유료
27 
무료
 
2

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유료
2nd row유료
3rd row유료
4th row유료
5th row유료

Common Values

ValueCountFrequency (%)
유료 27
93.1%
무료 2
 
6.9%

Length

2023-12-13T04:04:07.638869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:04:07.735254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유료 27
93.1%
무료 2
 
6.9%

데이터 기준일자
Date

CONSTANT 

Distinct1
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
Minimum2023-04-19 00:00:00
Maximum2023-04-19 00:00:00
2023-12-13T04:04:07.815353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:04:07.931749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T04:04:04.306613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:04:08.024111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명주소전화번호팩스번호유료 및 무료구분
연번1.0001.0001.0001.0001.0000.860
업체명1.0001.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.0001.000
팩스번호1.0001.0001.0001.0001.000NaN
유료 및 무료구분0.8601.0001.0001.000NaN1.000
2023-12-13T04:04:08.142451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번유료 및 무료구분
연번1.0000.577
유료 및 무료구분0.5771.000

Missing values

2023-12-13T04:04:04.443598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:04:04.569585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T04:04:04.677137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번업체명주소전화번호팩스번호유료 및 무료구분데이터 기준일자
01새남원인력 유료직업소개소전라북도 남원시 의총로 51 공설시장 7동<NA>063-625-7575유료2023-04-19
12내발로인력 유료직업소개소전라북도 남원시 충정로 138063-636-8680063-636-9681유료2023-04-19
23향교인력 유료직업소개소전라북도 남원시 향단로 106<NA>063-636-9135유료2023-04-19
34서남인력 유료직업소개소전라북도 남원시 동림로 18<NA>063-625-7228유료2023-04-19
45남원 개미인력 유료직업소개소전라북도 남원시 의총로 111<NA>063-625-1296유료2023-04-19
56파랑새인력 유료직업소개소전라북도 남원시 의총로 19<NA>063-632-2323유료2023-04-19
67그린인력 유료직업소개소전라북도 남원시 충정로 116063-636-0555063-636-0556유료2023-04-19
78대우인력 유료직업소개소전라북도 남원시 동문로 36<NA>063-625-2465유료2023-04-19
89남원종합인력 유료직업소개소전라북도 남원시 동림로 14063-636-8848063-634-8831유료2023-04-19
910해보자인력 유료직업소개소전라북도 남원시 동림로 33063-625-0900063-626-0449유료2023-04-19
연번업체명주소전화번호팩스번호유료 및 무료구분데이터 기준일자
1920행복일자리센터전라북도 남원시 운봉읍 운봉로 719063-634-5959063-634-5959유료2023-04-19
2021남원인력시장전라북도 남원시 네마실1길 22-7063-625-2300063-625-2300유료2023-04-19
2122보람인력전라북도 남원시 광한북로 85063-636-1253063-632-1252유료2023-04-19
2223건우인력전라북도 남원시 운봉읍 운봉로 681-1063-634-0135<NA>유료2023-04-19
2324황소인력전라북도 남원시 광한북로 103063-636-1991063-636-1992유료2023-04-19
2425남원시스템전라북도 남원시 밤티재길 14-3<NA><NA>유료2023-04-19
2526만인인력전라북도 남원시 산성길 30063-632-2024063-635-2024유료2023-04-19
2627대박인력전라북도 남원시 네마실3길 1<NA><NA>유료2023-04-19
2728남원시장애인종합복지관직업재활팀전라북도 남원시 이백면 닭뫼안길 87063-635-1544<NA>무료2023-04-19
2829남원시니어클럽전라북도 남원시 광한북로 94-13063-631-6049<NA>무료2023-04-19