Overview

Dataset statistics

Number of variables5
Number of observations2968
Missing cells104
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory121.9 KiB
Average record size in memory42.0 B

Variable types

Numeric2
Categorical1
Text2

Dataset

Description2017년부터 2022년까지 공무원연금콜센터의 상담내역을 포함한 정보입니다. 연도별 상담유형, 상담분류, 상담세분류, 상담건수 등의 데이터가 나열되어 있습니다.
URLhttps://www.data.go.kr/data/15034003/fileData.do

Alerts

상담세분류 has 102 (3.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 07:17:27.043362
Analysis finished2023-12-12 07:17:28.279344
Duration1.24 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.4845
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.2 KiB
2023-12-12T16:17:28.342887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2019
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7042233
Coefficient of variation (CV)0.00084389028
Kurtosis-1.2627628
Mean2019.4845
Median Absolute Deviation (MAD)1
Skewness0.01343565
Sum5993830
Variance2.9043772
MonotonicityIncreasing
2023-12-12T16:17:28.487062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2019 503
16.9%
2018 499
16.8%
2017 497
16.7%
2020 493
16.6%
2021 490
16.5%
2022 486
16.4%
ValueCountFrequency (%)
2017 497
16.7%
2018 499
16.8%
2019 503
16.9%
2020 493
16.6%
2021 490
16.5%
2022 486
16.4%
ValueCountFrequency (%)
2022 486
16.4%
2021 490
16.5%
2020 493
16.6%
2019 503
16.9%
2018 499
16.8%
2017 497
16.7%

상담유형
Categorical

Distinct27
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size23.3 KiB
재직관리
301 
대여학자금
274 
재해보상급여
247 
퇴직급여
225 
주택사업
222 
Other values (22)
1699 

Length

Max length8
Median length7
Mean length5.0363881
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row재직관리
2nd row금융기관알선대출
3rd row민원서류 발급
4th row연금대출
5th row연금수급자 관리

Common Values

ValueCountFrequency (%)
재직관리 301
 
10.1%
대여학자금 274
 
9.2%
재해보상급여 247
 
8.3%
퇴직급여 225
 
7.6%
주택사업 222
 
7.5%
맞춤형복지 182
 
6.1%
연금대출 178
 
6.0%
연금수급자 관리 172
 
5.8%
금융기관알선대출 163
 
5.5%
공적연금연계 124
 
4.2%
Other values (17) 880
29.6%

Length

2023-12-12T16:17:28.638859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
재직관리 301
 
9.1%
대여학자금 274
 
8.2%
재해보상급여 247
 
7.4%
퇴직급여 225
 
6.8%
주택사업 222
 
6.7%
맞춤형복지 182
 
5.5%
연금대출 178
 
5.4%
연금수급자 172
 
5.2%
관리 172
 
5.2%
금융기관알선대출 163
 
4.9%
Other values (19) 1188
35.7%
Distinct121
Distinct (%)4.1%
Missing2
Missing (%)0.1%
Memory size23.3 KiB
2023-12-12T16:17:28.980934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length6.7781524
Min length2

Characters and Unicode

Total characters20104
Distinct characters208
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.2%

Sample

1st row기여금
2nd row융자추천발급확인서
3rd row발급신청
4th row연금대출신청
5th row연금소득 과세제도
ValueCountFrequency (%)
학자금 238
 
5.6%
재직기간 138
 
3.2%
임대주택 120
 
2.8%
상환업무 119
 
2.8%
발급신청 109
 
2.6%
사용자시스템 104
 
2.4%
연금대출신청 94
 
2.2%
대부 90
 
2.1%
인터넷 90
 
2.1%
합산 90
 
2.1%
Other values (154) 3082
72.1%
2023-12-12T16:17:29.536301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1308
 
6.5%
1077
 
5.4%
698
 
3.5%
652
 
3.2%
610
 
3.0%
576
 
2.9%
536
 
2.7%
495
 
2.5%
473
 
2.4%
436
 
2.2%
Other values (198) 13243
65.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18604
92.5%
Space Separator 1308
 
6.5%
Open Punctuation 45
 
0.2%
Close Punctuation 45
 
0.2%
Other Punctuation 36
 
0.2%
Dash Punctuation 30
 
0.1%
Lowercase Letter 18
 
0.1%
Uppercase Letter 12
 
0.1%
Decimal Number 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1077
 
5.8%
698
 
3.8%
652
 
3.5%
610
 
3.3%
576
 
3.1%
536
 
2.9%
495
 
2.7%
473
 
2.5%
436
 
2.3%
385
 
2.1%
Other values (186) 12666
68.1%
Uppercase Letter
ValueCountFrequency (%)
V 4
33.3%
O 4
33.3%
C 4
33.3%
Other Punctuation
ValueCountFrequency (%)
, 26
72.2%
/ 10
 
27.8%
Decimal Number
ValueCountFrequency (%)
1 3
50.0%
2 3
50.0%
Space Separator
ValueCountFrequency (%)
1308
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Lowercase Letter
ValueCountFrequency (%)
g 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18604
92.5%
Common 1470
 
7.3%
Latin 30
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1077
 
5.8%
698
 
3.8%
652
 
3.5%
610
 
3.3%
576
 
3.1%
536
 
2.9%
495
 
2.7%
473
 
2.5%
436
 
2.3%
385
 
2.1%
Other values (186) 12666
68.1%
Common
ValueCountFrequency (%)
1308
89.0%
( 45
 
3.1%
) 45
 
3.1%
- 30
 
2.0%
, 26
 
1.8%
/ 10
 
0.7%
1 3
 
0.2%
2 3
 
0.2%
Latin
ValueCountFrequency (%)
g 18
60.0%
V 4
 
13.3%
O 4
 
13.3%
C 4
 
13.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18604
92.5%
ASCII 1500
 
7.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1308
87.2%
( 45
 
3.0%
) 45
 
3.0%
- 30
 
2.0%
, 26
 
1.7%
g 18
 
1.2%
/ 10
 
0.7%
V 4
 
0.3%
O 4
 
0.3%
C 4
 
0.3%
Other values (2) 6
 
0.4%
Hangul
ValueCountFrequency (%)
1077
 
5.8%
698
 
3.8%
652
 
3.5%
610
 
3.3%
576
 
3.1%
536
 
2.9%
495
 
2.7%
473
 
2.5%
436
 
2.3%
385
 
2.1%
Other values (186) 12666
68.1%

상담세분류
Text

MISSING 

Distinct450
Distinct (%)15.7%
Missing102
Missing (%)3.4%
Memory size23.3 KiB
2023-12-12T16:17:29.893478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length8.7083043
Min length2

Characters and Unicode

Total characters24958
Distinct characters294
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.3%

Sample

1st row소급기여금 일시납부
2nd row유선발급
3rd row급여지급사실확인서
4th row연금대출 가능금액
5th row연금소득 연말정산
ValueCountFrequency (%)
267
 
4.9%
기타 191
 
3.5%
확인 110
 
2.0%
문의 99
 
1.8%
절차 78
 
1.4%
신청절차 58
 
1.1%
대출 55
 
1.0%
구비서류 55
 
1.0%
신청 54
 
1.0%
연금대출 53
 
1.0%
Other values (573) 4441
81.3%
2023-12-12T16:17:30.355200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2595
 
10.4%
845
 
3.4%
587
 
2.4%
488
 
2.0%
488
 
2.0%
467
 
1.9%
448
 
1.8%
447
 
1.8%
444
 
1.8%
438
 
1.8%
Other values (284) 17711
71.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21215
85.0%
Space Separator 2595
 
10.4%
Close Punctuation 410
 
1.6%
Open Punctuation 410
 
1.6%
Other Punctuation 253
 
1.0%
Decimal Number 36
 
0.1%
Uppercase Letter 20
 
0.1%
Dash Punctuation 13
 
0.1%
Lowercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
845
 
4.0%
587
 
2.8%
488
 
2.3%
488
 
2.3%
467
 
2.2%
448
 
2.1%
447
 
2.1%
444
 
2.1%
438
 
2.1%
369
 
1.7%
Other values (267) 16194
76.3%
Decimal Number
ValueCountFrequency (%)
0 11
30.6%
1 10
27.8%
9 6
16.7%
2 4
 
11.1%
3 3
 
8.3%
6 2
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 190
75.1%
/ 46
 
18.2%
& 10
 
4.0%
. 7
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
Q 10
50.0%
A 10
50.0%
Space Separator
ValueCountFrequency (%)
2595
100.0%
Close Punctuation
ValueCountFrequency (%)
) 410
100.0%
Open Punctuation
ValueCountFrequency (%)
( 410
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21215
85.0%
Common 3717
 
14.9%
Latin 26
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
845
 
4.0%
587
 
2.8%
488
 
2.3%
488
 
2.3%
467
 
2.2%
448
 
2.1%
447
 
2.1%
444
 
2.1%
438
 
2.1%
369
 
1.7%
Other values (267) 16194
76.3%
Common
ValueCountFrequency (%)
2595
69.8%
) 410
 
11.0%
( 410
 
11.0%
, 190
 
5.1%
/ 46
 
1.2%
- 13
 
0.3%
0 11
 
0.3%
& 10
 
0.3%
1 10
 
0.3%
. 7
 
0.2%
Other values (4) 15
 
0.4%
Latin
ValueCountFrequency (%)
Q 10
38.5%
A 10
38.5%
e 6
23.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21215
85.0%
ASCII 3743
 
15.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2595
69.3%
) 410
 
11.0%
( 410
 
11.0%
, 190
 
5.1%
/ 46
 
1.2%
- 13
 
0.3%
0 11
 
0.3%
Q 10
 
0.3%
& 10
 
0.3%
A 10
 
0.3%
Other values (7) 38
 
1.0%
Hangul
ValueCountFrequency (%)
845
 
4.0%
587
 
2.8%
488
 
2.3%
488
 
2.3%
467
 
2.2%
448
 
2.1%
447
 
2.1%
444
 
2.1%
438
 
2.1%
369
 
1.7%
Other values (267) 16194
76.3%

상담건수
Real number (ℝ)

Distinct1555
Distinct (%)52.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1722.8935
Minimum1
Maximum40067
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.2 KiB
2023-12-12T16:17:30.519576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q149
median343.5
Q31644.5
95-th percentile8134.45
Maximum40067
Range40066
Interquartile range (IQR)1595.5

Descriptive statistics

Standard deviation3638.3744
Coefficient of variation (CV)2.1117813
Kurtosis27.818389
Mean1722.8935
Median Absolute Deviation (MAD)335.5
Skewness4.5231524
Sum5113548
Variance13237768
MonotonicityNot monotonic
2023-12-12T16:17:30.664119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 108
 
3.6%
2 48
 
1.6%
4 34
 
1.1%
3 32
 
1.1%
5 31
 
1.0%
6 22
 
0.7%
9 20
 
0.7%
7 19
 
0.6%
11 18
 
0.6%
16 17
 
0.6%
Other values (1545) 2619
88.2%
ValueCountFrequency (%)
1 108
3.6%
2 48
1.6%
3 32
 
1.1%
4 34
 
1.1%
5 31
 
1.0%
6 22
 
0.7%
7 19
 
0.6%
8 17
 
0.6%
9 20
 
0.7%
10 15
 
0.5%
ValueCountFrequency (%)
40067 1
< 0.1%
34973 1
< 0.1%
34692 1
< 0.1%
34274 1
< 0.1%
32359 1
< 0.1%
32166 1
< 0.1%
31604 1
< 0.1%
31238 1
< 0.1%
29693 1
< 0.1%
28229 1
< 0.1%

Interactions

2023-12-12T16:17:27.718359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:17:27.460326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:17:27.877781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:17:27.561884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:17:30.761023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도상담유형상담건수
연도1.0000.1090.000
상담유형0.1091.0000.272
상담건수0.0000.2721.000
2023-12-12T16:17:30.851180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도상담건수상담유형
연도1.000-0.0420.044
상담건수-0.0421.0000.101
상담유형0.0440.1011.000

Missing values

2023-12-12T16:17:28.009619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:17:28.114352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T16:17:28.221876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연도상담유형상담분류상담세분류상담건수
02017재직관리기여금소급기여금 일시납부32359
12017금융기관알선대출융자추천발급확인서유선발급26426
22017민원서류 발급발급신청급여지급사실확인서24897
32017연금대출연금대출신청연금대출 가능금액23654
42017연금수급자 관리연금소득 과세제도연금소득 연말정산17644
52017퇴직급여예상퇴직금조회 및 조회방법 문의15819
62017연금수급자 관리일부정지일부정지대상15545
72017대여학자금학자금 상환업무중복수혜자 상환절차13931
82017재직관리기여금휴직중납부13217
92017교육퇴직예정자교육교육과정 및 교육신청방법12835
연도상담유형상담분류상담세분류상담건수
29582022공적연금연계연계퇴직유족연금연계퇴직유족연금산정방법1
29592022금융기관알선대출단기재직자 신용대출단기재직자 대출 이율1
29602022금융기관알선대출생활안정자금대출대출신청 구비서류1
29612022금융기관알선대출연금수급자 신용대출장애연금수급자 대출 여부1
29622022각종민원사이버등록민원업무상 Q&A 관련문의1
29632022재해보상급여순직유족보상금헬프데스크1
29642022재해보상급여요양일시금신청절차 및 구비서류1
29652022주택사업방별임대주택방별임대 임대조건1
29662022주택사업분양주택분양계약 해지1
29672022기타기타인사처 착신문의1