Overview

Dataset statistics

Number of variables3
Number of observations88
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory25.5 B

Variable types

Categorical2
Text1

Dataset

Description사립학교교직원연금공단 제휴복지 서비스 현황과 관련된 데이터로 카테고리별 세부항목, 제휴업체 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15045873/fileData.do

Alerts

카테고리 is highly overall correlated with 세부항목High correlation
세부항목 is highly overall correlated with 카테고리High correlation
제휴업체 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:03:19.784682
Analysis finished2023-12-12 10:03:20.102866
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

카테고리
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size836.0 B
호텔 및 리조트
41 
의료
16 
법률상담
복지몰
 
4
여행 및 레저
 
4
Other values (6)
17 

Length

Max length8
Median length7
Mean length5.3863636
Min length2

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row복지몰
2nd row복지몰
3rd row복지몰
4th row복지몰
5th row호텔 및 리조트

Common Values

ValueCountFrequency (%)
호텔 및 리조트 41
46.6%
의료 16
 
18.2%
법률상담 6
 
6.8%
복지몰 4
 
4.5%
여행 및 레저 4
 
4.5%
항공, 여행 4
 
4.5%
장례 4
 
4.5%
생활 4
 
4.5%
상조 2
 
2.3%
금융 2
 
2.3%

Length

2023-12-12T19:03:20.195776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
45
24.7%
호텔 41
22.5%
리조트 41
22.5%
의료 16
 
8.8%
여행 8
 
4.4%
법률상담 6
 
3.3%
복지몰 4
 
2.2%
레저 4
 
2.2%
항공 4
 
2.2%
장례 4
 
2.2%
Other values (4) 9
 
4.9%

세부항목
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Memory size836.0 B
호텔
27 
리조트
14 
일반진료(건강검진)
법률상담
생활서비스
Other values (13)
29 

Length

Max length10
Median length5
Mean length3.4090909
Min length2

Unique

Unique5 ?
Unique (%)5.7%

Sample

1st row종합복지몰
2nd row여행복지몰
3rd row교육복지몰
4th row의료복지몰
5th row호텔

Common Values

ValueCountFrequency (%)
호텔 27
30.7%
리조트 14
15.9%
일반진료(건강검진) 7
 
8.0%
법률상담 6
 
6.8%
생활서비스 5
 
5.7%
치과 5
 
5.7%
안과 4
 
4.5%
장례식장 4
 
4.5%
레저 3
 
3.4%
상조 2
 
2.3%
Other values (8) 11
12.5%

Length

2023-12-12T19:03:20.354596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
호텔 27
30.7%
리조트 14
15.9%
일반진료(건강검진 7
 
8.0%
법률상담 6
 
6.8%
생활서비스 5
 
5.7%
치과 5
 
5.7%
안과 4
 
4.5%
장례식장 4
 
4.5%
레저 3
 
3.4%
렌터카 2
 
2.3%
Other values (8) 11
12.5%

제휴업체
Text

UNIQUE 

Distinct88
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size836.0 B
2023-12-12T19:03:20.652703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length15.5
Mean length8.625
Min length2

Characters and Unicode

Total characters759
Distinct characters224
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)100.0%

Sample

1st row㈜현대이지웰
2nd row㈜플러스앤
3rd row㈜에듀윌
4th row세이프닥
5th row오색그린야드 호텔
ValueCountFrequency (%)
호텔 6
 
4.5%
라마다 3
 
2.2%
부산 3
 
2.2%
장례식장 3
 
2.2%
제주 2
 
1.5%
코모도 2
 
1.5%
리조트 2
 
1.5%
엔케이세종병원 1
 
0.7%
이크루즈 1
 
0.7%
쏘카 1
 
0.7%
Other values (110) 110
82.1%
2023-12-12T19:03:21.090167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46
 
6.1%
32
 
4.2%
25
 
3.3%
( 20
 
2.6%
) 20
 
2.6%
18
 
2.4%
17
 
2.2%
17
 
2.2%
15
 
2.0%
13
 
1.7%
Other values (214) 536
70.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 626
82.5%
Space Separator 46
 
6.1%
Uppercase Letter 23
 
3.0%
Open Punctuation 20
 
2.6%
Close Punctuation 20
 
2.6%
Other Punctuation 12
 
1.6%
Other Symbol 4
 
0.5%
Lowercase Letter 4
 
0.5%
Decimal Number 2
 
0.3%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
5.1%
25
 
4.0%
18
 
2.9%
17
 
2.7%
17
 
2.7%
15
 
2.4%
13
 
2.1%
13
 
2.1%
12
 
1.9%
12
 
1.9%
Other values (188) 452
72.2%
Uppercase Letter
ValueCountFrequency (%)
K 4
17.4%
T 3
13.0%
E 2
8.7%
S 2
8.7%
F 2
8.7%
I 2
8.7%
A 2
8.7%
H 1
 
4.3%
N 1
 
4.3%
C 1
 
4.3%
Other values (3) 3
13.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
50.0%
s 1
25.0%
h 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 10
83.3%
& 2
 
16.7%
Decimal Number
ValueCountFrequency (%)
4 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 630
83.0%
Common 102
 
13.4%
Latin 27
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
5.1%
25
 
4.0%
18
 
2.9%
17
 
2.7%
17
 
2.7%
15
 
2.4%
13
 
2.1%
13
 
2.1%
12
 
1.9%
12
 
1.9%
Other values (189) 456
72.4%
Latin
ValueCountFrequency (%)
K 4
14.8%
T 3
11.1%
E 2
 
7.4%
S 2
 
7.4%
F 2
 
7.4%
I 2
 
7.4%
A 2
 
7.4%
e 2
 
7.4%
H 1
 
3.7%
N 1
 
3.7%
Other values (6) 6
22.2%
Common
ValueCountFrequency (%)
46
45.1%
( 20
19.6%
) 20
19.6%
, 10
 
9.8%
& 2
 
2.0%
+ 1
 
1.0%
4 1
 
1.0%
2 1
 
1.0%
- 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 626
82.5%
ASCII 129
 
17.0%
None 4
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
46
35.7%
( 20
15.5%
) 20
15.5%
, 10
 
7.8%
K 4
 
3.1%
T 3
 
2.3%
E 2
 
1.6%
& 2
 
1.6%
S 2
 
1.6%
F 2
 
1.6%
Other values (15) 18
 
14.0%
Hangul
ValueCountFrequency (%)
32
 
5.1%
25
 
4.0%
18
 
2.9%
17
 
2.7%
17
 
2.7%
15
 
2.4%
13
 
2.1%
13
 
2.1%
12
 
1.9%
12
 
1.9%
Other values (188) 452
72.2%
None
ValueCountFrequency (%)
4
100.0%

Correlations

2023-12-12T19:03:21.200012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
카테고리세부항목제휴업체
카테고리1.0000.9881.000
세부항목0.9881.0001.000
제휴업체1.0001.0001.000
2023-12-12T19:03:21.293793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세부항목카테고리
세부항목1.0000.892
카테고리0.8921.000
2023-12-12T19:03:21.374271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
카테고리세부항목
카테고리1.0000.892
세부항목0.8921.000

Missing values

2023-12-12T19:03:19.983807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:03:20.065467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

카테고리세부항목제휴업체
0복지몰종합복지몰㈜현대이지웰
1복지몰여행복지몰㈜플러스앤
2복지몰교육복지몰㈜에듀윌
3복지몰의료복지몰세이프닥
4호텔 및 리조트호텔오색그린야드 호텔
5호텔 및 리조트호텔삼성거제호텔
6호텔 및 리조트호텔그랜드조선호텔
7호텔 및 리조트호텔부산웨스틴조선호텔
8호텔 및 리조트호텔신라스테이 서부산
9호텔 및 리조트호텔코모도 호텔(부산)
카테고리세부항목제휴업체
78생활생활서비스넥센타이어
79생활생활서비스나비트(KT)
80생활생활서비스나텔레콤(SKT)
81생활생활서비스가연
82법률상담법률상담최민령 변호사(서울)
83법률상담법률상담최중섭 변호사(경인,강원)
84법률상담법률상담김형배 변호사(대전,충청)
85법률상담법률상담김정현 변호사(광주,호남,제주)
86법률상담법률상담김섭 변호사(대구,경북)
87법률상담법률상담조성제 변호사(부산,울산,경남)