Overview

Dataset statistics

Number of variables4
Number of observations120
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory34.1 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description하남시에서 운영중인 민원상담 챗봇 사용자 현황입니다. 하남챗봇은 시민들이 모바일을 통해 민원 신청 및 행정정보 열람을 가능하게 하여 편의성 향상 민원, 행정, 경제, 문화, 등 총 120개 시나리오로 구성되어 있음
URLhttps://www.data.go.kr/data/15106129/fileData.do

Alerts

연번 is highly overall correlated with 분야High correlation
분야 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:00:53.814508
Analysis finished2023-12-12 16:00:54.296299
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct120
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60.5
Minimum1
Maximum120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T01:00:54.355376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.95
Q130.75
median60.5
Q390.25
95-th percentile114.05
Maximum120
Range119
Interquartile range (IQR)59.5

Descriptive statistics

Standard deviation34.785054
Coefficient of variation (CV)0.57495957
Kurtosis-1.2
Mean60.5
Median Absolute Deviation (MAD)30
Skewness0
Sum7260
Variance1210
MonotonicityStrictly increasing
2023-12-13T01:00:54.463280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
62 1
 
0.8%
90 1
 
0.8%
89 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
86 1
 
0.8%
85 1
 
0.8%
84 1
 
0.8%
83 1
 
0.8%
Other values (110) 110
91.7%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
120 1
0.8%
119 1
0.8%
118 1
0.8%
117 1
0.8%
116 1
0.8%
115 1
0.8%
114 1
0.8%
113 1
0.8%
112 1
0.8%
111 1
0.8%

분야
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
복지
27 
행정
22 
교통
12 
코로나19
12 
경제
10 
Other values (7)
37 

Length

Max length5
Median length2
Mean length2.45
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row민원
2nd row민원
3rd row민원
4th row행정
5th row행정

Common Values

ValueCountFrequency (%)
복지 27
22.5%
행정 22
18.3%
교통 12
10.0%
코로나19 12
10.0%
경제 10
 
8.3%
문화 7
 
5.8%
안전 6
 
5.0%
교육 6
 
5.0%
주택·건축 6
 
5.0%
보건 5
 
4.2%
Other values (2) 7
 
5.8%

Length

2023-12-13T01:00:54.588208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
복지 27
22.5%
행정 22
18.3%
교통 12
10.0%
코로나19 12
10.0%
경제 10
 
8.3%
문화 7
 
5.8%
안전 6
 
5.0%
교육 6
 
5.0%
주택·건축 6
 
5.0%
보건 5
 
4.2%
Other values (2) 7
 
5.8%
Distinct66
Distinct (%)55.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T01:00:54.822624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length6.3
Min length2

Characters and Unicode

Total characters756
Distinct characters170
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)39.2%

Sample

1st row민원신청
2nd row시장에게 바란다
3rd row하남시 콜센터
4th row행정부서 안내
5th row증명서류
ValueCountFrequency (%)
안내 16
 
8.2%
차량등록 8
 
4.1%
코로나 8
 
4.1%
임산부 7
 
3.6%
증명서류 7
 
3.6%
행정신고 6
 
3.1%
장애인 6
 
3.1%
영유아 5
 
2.6%
하남시 5
 
2.6%
서류 5
 
2.6%
Other values (89) 121
62.4%
2023-12-13T01:00:55.173284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
74
 
9.8%
20
 
2.6%
19
 
2.5%
17
 
2.2%
17
 
2.2%
16
 
2.1%
16
 
2.1%
13
 
1.7%
12
 
1.6%
12
 
1.6%
Other values (160) 540
71.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 676
89.4%
Space Separator 74
 
9.8%
Other Punctuation 6
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
3.0%
19
 
2.8%
17
 
2.5%
17
 
2.5%
16
 
2.4%
16
 
2.4%
13
 
1.9%
12
 
1.8%
12
 
1.8%
11
 
1.6%
Other values (157) 523
77.4%
Other Punctuation
ValueCountFrequency (%)
/ 5
83.3%
· 1
 
16.7%
Space Separator
ValueCountFrequency (%)
74
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 676
89.4%
Common 80
 
10.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
3.0%
19
 
2.8%
17
 
2.5%
17
 
2.5%
16
 
2.4%
16
 
2.4%
13
 
1.9%
12
 
1.8%
12
 
1.8%
11
 
1.6%
Other values (157) 523
77.4%
Common
ValueCountFrequency (%)
74
92.5%
/ 5
 
6.2%
· 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 676
89.4%
ASCII 79
 
10.4%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
74
93.7%
/ 5
 
6.3%
Hangul
ValueCountFrequency (%)
20
 
3.0%
19
 
2.8%
17
 
2.5%
17
 
2.5%
16
 
2.4%
16
 
2.4%
13
 
1.9%
12
 
1.8%
12
 
1.8%
11
 
1.6%
Other values (157) 523
77.4%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct118
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T01:00:55.424410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length14
Mean length9.1916667
Min length2

Characters and Unicode

Total characters1103
Distinct characters236
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique116 ?
Unique (%)96.7%

Sample

1st row민원신청, 나의 민원보기
2nd row열린시장실
3rd row전화연결
4th row부서안내/조직도
5th row주민등록등초본
ValueCountFrequency (%)
안내 25
 
9.8%
하남시 12
 
4.7%
지원 5
 
2.0%
신고 3
 
1.2%
불법주정차 3
 
1.2%
임산부 3
 
1.2%
예약 3
 
1.2%
현황 3
 
1.2%
2
 
0.8%
지방세 2
 
0.8%
Other values (180) 193
76.0%
2023-12-13T01:00:55.813854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
138
 
12.5%
32
 
2.9%
29
 
2.6%
25
 
2.3%
22
 
2.0%
21
 
1.9%
21
 
1.9%
20
 
1.8%
19
 
1.7%
16
 
1.5%
Other values (226) 760
68.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 948
85.9%
Space Separator 138
 
12.5%
Other Punctuation 8
 
0.7%
Uppercase Letter 5
 
0.5%
Dash Punctuation 2
 
0.2%
Decimal Number 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
3.4%
29
 
3.1%
25
 
2.6%
22
 
2.3%
21
 
2.2%
21
 
2.2%
20
 
2.1%
19
 
2.0%
16
 
1.7%
14
 
1.5%
Other values (215) 729
76.9%
Uppercase Letter
ValueCountFrequency (%)
E 2
40.0%
K 1
20.0%
G 1
20.0%
S 1
20.0%
Other Punctuation
ValueCountFrequency (%)
· 4
50.0%
, 3
37.5%
/ 1
 
12.5%
Decimal Number
ValueCountFrequency (%)
9 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
138
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 948
85.9%
Common 150
 
13.6%
Latin 5
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
3.4%
29
 
3.1%
25
 
2.6%
22
 
2.3%
21
 
2.2%
21
 
2.2%
20
 
2.1%
19
 
2.0%
16
 
1.7%
14
 
1.5%
Other values (215) 729
76.9%
Common
ValueCountFrequency (%)
138
92.0%
· 4
 
2.7%
, 3
 
2.0%
- 2
 
1.3%
/ 1
 
0.7%
9 1
 
0.7%
1 1
 
0.7%
Latin
ValueCountFrequency (%)
E 2
40.0%
K 1
20.0%
G 1
20.0%
S 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 948
85.9%
ASCII 151
 
13.7%
None 4
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
138
91.4%
, 3
 
2.0%
E 2
 
1.3%
- 2
 
1.3%
K 1
 
0.7%
G 1
 
0.7%
S 1
 
0.7%
/ 1
 
0.7%
9 1
 
0.7%
1 1
 
0.7%
Hangul
ValueCountFrequency (%)
32
 
3.4%
29
 
3.1%
25
 
2.6%
22
 
2.3%
21
 
2.2%
21
 
2.2%
20
 
2.1%
19
 
2.0%
16
 
1.7%
14
 
1.5%
Other values (215) 729
76.9%
None
ValueCountFrequency (%)
· 4
100.0%

Interactions

2023-12-13T01:00:54.094477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:00:55.906756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분야대메뉴
연번1.0000.9310.993
분야0.9311.0001.000
대메뉴0.9931.0001.000
2023-12-13T01:00:55.986314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분야
연번1.0000.745
분야0.7451.000

Missing values

2023-12-13T01:00:54.204376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:00:54.271411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번분야대메뉴세부메뉴
01민원민원신청민원신청, 나의 민원보기
12민원시장에게 바란다열린시장실
23민원하남시 콜센터전화연결
34행정행정부서 안내부서안내/조직도
45행정증명서류주민등록등초본
56행정증명서류가족관계증명서
67행정증명서류인감증명서
78행정증명서류본인서명사실확인서
89행정증명서류무인민원발급기 안내
910행정증명서류건강진단결과서
연번분야대메뉴세부메뉴
110111복지노인노인복지관
111112복지노인노인복지시설 현황
112113복지노인경로당 현황
113114복지장애인장애종류 및 등록
114115복지장애인장애판정절차
115116복지장애인장애 복지제도
116117복지장애인재활관련 사이트
117118복지장애인장애인 취업정보
118119복지경기공유서비스경기공유서비스 바로가기
119120복지행복 차 공유이용방법 안내