Overview

Dataset statistics

Number of variables13
Number of observations49
Missing cells68
Missing cells (%)10.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory106.7 B

Variable types

Categorical8
Text1
DateTime4

Dataset

Description대구광역시 달서구_주민 정보화교육 현황_20230519
Author대구광역시 달서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3045971&dataSetDetailId=30459711a36610912d1d&provdMethod=FILE

Alerts

문의전화 has constant value ""Constant
담당부서 has constant value ""Constant
기준일자 has constant value ""Constant
난이도 is highly overall correlated with 구분 and 3 other fieldsHigh correlation
구분 is highly overall correlated with 과정명 and 3 other fieldsHigh correlation
과정명 is highly overall correlated with 구분 and 3 other fieldsHigh correlation
접수방법 is highly overall correlated with 구분 and 3 other fieldsHigh correlation
교육시간 is highly overall correlated with 구분 and 3 other fieldsHigh correlation
교육장소 is highly imbalanced (75.4%)Imbalance
인터넷접수 시작일 has 34 (69.4%) missing valuesMissing
인터넷접수 종료일 has 34 (69.4%) missing valuesMissing

Reproduction

Analysis started2024-04-19 05:45:18.842771
Analysis finished2024-04-19 05:45:19.653191
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
컴퓨터
23 
스마트폰
16 
키오스크
10 

Length

Max length4
Median length4
Mean length3.5306122
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row컴퓨터
2nd row컴퓨터
3rd row컴퓨터
4th row컴퓨터
5th row컴퓨터

Common Values

ValueCountFrequency (%)
컴퓨터 23
46.9%
스마트폰 16
32.7%
키오스크 10
20.4%

Length

2024-04-19T14:45:19.715253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:45:19.810334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
컴퓨터 23
46.9%
스마트폰 16
32.7%
키오스크 10
20.4%

과정명
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)22.4%
Missing0
Missing (%)0.0%
Memory size524.0 B
키오스크 체험 교육
10 
스마트폰 기초
스마트폰 SNS 활용
컴퓨터 기초
인터넷 기초
Other values (6)
15 

Length

Max length11
Median length8
Mean length7.8367347
Min length5

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st row컴퓨터 기초
2nd row인터넷 기초
3rd row문서편집 활용
4th row파워포인트 활용
5th row엑셀 활용

Common Values

ValueCountFrequency (%)
키오스크 체험 교육 10
20.4%
스마트폰 기초 8
16.3%
스마트폰 SNS 활용 8
16.3%
컴퓨터 기초 4
 
8.2%
인터넷 기초 4
 
8.2%
문서편집 활용 4
 
8.2%
엑셀 활용 4
 
8.2%
파워포인트 활용 2
 
4.1%
엑셀 고급 2
 
4.1%
사진 편집 2
 
4.1%

Length

2024-04-19T14:45:19.917620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
활용 18
15.5%
스마트폰 16
13.8%
기초 16
13.8%
키오스크 10
8.6%
체험 10
8.6%
교육 10
8.6%
sns 8
6.9%
엑셀 6
 
5.2%
컴퓨터 4
 
3.4%
인터넷 4
 
3.4%
Other values (5) 14
12.1%

난이도
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Memory size524.0 B
18 
12 
초급
중급

Length

Max length2
Median length1
Mean length1.3265306
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
18
36.7%
12
24.5%
초급 8
16.3%
중급 8
16.3%
3
 
6.1%

Length

2024-04-19T14:45:20.040079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:45:20.143241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
18
36.7%
12
24.5%
초급 8
16.3%
중급 8
16.3%
3
 
6.1%
Distinct45
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Memory size524.0 B
2024-04-19T14:45:20.343363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length16
Min length16

Characters and Unicode

Total characters784
Distinct characters15
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)83.7%

Sample

1st row01-09~01-20(10일)
2nd row01-25~02-03(08일)
3rd row02-06~02-17(10일)
4th row02-20~03-03(09일)
5th row03-06~03-17(10일)
ValueCountFrequency (%)
07-17~07-28(10일 2
 
4.1%
05-08~05-19(10일 2
 
4.1%
10-16~10-27(10일 2
 
4.1%
04-17~04-28(10일 2
 
4.1%
03-06~03-10(05일 1
 
2.0%
10-10~10-13(04일 1
 
2.0%
04-10~04-14(05일 1
 
2.0%
07-03~07-04(02일 1
 
2.0%
07-06~07-07(02일 1
 
2.0%
09-25~09-26(02일 1
 
2.0%
Other values (35) 35
71.4%
2024-04-19T14:45:20.706443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 181
23.1%
1 107
13.6%
- 98
12.5%
~ 49
 
6.2%
( 49
 
6.2%
49
 
6.2%
) 49
 
6.2%
2 48
 
6.1%
4 28
 
3.6%
7 25
 
3.2%
Other values (5) 101
12.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 490
62.5%
Dash Punctuation 98
 
12.5%
Math Symbol 49
 
6.2%
Open Punctuation 49
 
6.2%
Other Letter 49
 
6.2%
Close Punctuation 49
 
6.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 181
36.9%
1 107
21.8%
2 48
 
9.8%
4 28
 
5.7%
7 25
 
5.1%
3 24
 
4.9%
5 22
 
4.5%
6 21
 
4.3%
8 17
 
3.5%
9 17
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 98
100.0%
Math Symbol
ValueCountFrequency (%)
~ 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Other Letter
ValueCountFrequency (%)
49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 735
93.8%
Hangul 49
 
6.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 181
24.6%
1 107
14.6%
- 98
13.3%
~ 49
 
6.7%
( 49
 
6.7%
) 49
 
6.7%
2 48
 
6.5%
4 28
 
3.8%
7 25
 
3.4%
3 24
 
3.3%
Other values (4) 77
10.5%
Hangul
ValueCountFrequency (%)
49
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 735
93.8%
Hangul 49
 
6.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 181
24.6%
1 107
14.6%
- 98
13.3%
~ 49
 
6.7%
( 49
 
6.7%
) 49
 
6.7%
2 48
 
6.5%
4 28
 
3.8%
7 25
 
3.4%
3 24
 
3.3%
Other values (4) 77
10.5%
Hangul
ValueCountFrequency (%)
49
100.0%

교육시간
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
14~16시
27 
16~18시
22 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row14~16시
2nd row14~16시
3rd row14~16시
4th row14~16시
5th row14~16시

Common Values

ValueCountFrequency (%)
14~16시 27
55.1%
16~18시 22
44.9%

Length

2024-04-19T14:45:20.868211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:45:20.976004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
14~16시 27
55.1%
16~18시 22
44.9%

접수방법
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
전화(100%)
34 
인터넷(50%)+전화(50%)
15 

Length

Max length16
Median length8
Mean length10.44898
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전화(100%)
2nd row전화(100%)
3rd row인터넷(50%)+전화(50%)
4th row인터넷(50%)+전화(50%)
5th row인터넷(50%)+전화(50%)

Common Values

ValueCountFrequency (%)
전화(100%) 34
69.4%
인터넷(50%)+전화(50%) 15
30.6%

Length

2024-04-19T14:45:21.088450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:45:21.193900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전화(100 34
69.4%
인터넷(50%)+전화(50 15
30.6%
Distinct4
Distinct (%)26.7%
Missing34
Missing (%)69.4%
Memory size524.0 B
Minimum2022-12-06 00:00:00
Maximum2023-09-06 00:00:00
2024-04-19T14:45:21.270757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:45:21.388976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=4)
Distinct4
Distinct (%)26.7%
Missing34
Missing (%)69.4%
Memory size524.0 B
Minimum2022-12-08 00:00:00
Maximum2023-09-08 00:00:00
2024-04-19T14:45:21.504526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:45:21.631111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=4)
Distinct4
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum2022-12-12 00:00:00
Maximum2023-09-11 00:00:00
2024-04-19T14:45:21.748102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:45:21.867644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=4)

교육장소
Categorical

IMBALANCE 

Distinct2
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
달서아트센터 3층 컴퓨터실
47 
정부대구지방 합동청사 업무B동 1층 전산교육장
 
2

Length

Max length25
Median length14
Mean length14.44898
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row달서아트센터 3층 컴퓨터실
2nd row달서아트센터 3층 컴퓨터실
3rd row달서아트센터 3층 컴퓨터실
4th row달서아트센터 3층 컴퓨터실
5th row달서아트센터 3층 컴퓨터실

Common Values

ValueCountFrequency (%)
달서아트센터 3층 컴퓨터실 47
95.9%
정부대구지방 합동청사 업무B동 1층 전산교육장 2
 
4.1%

Length

2024-04-19T14:45:21.990288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:45:22.081297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
달서아트센터 47
31.1%
3층 47
31.1%
컴퓨터실 47
31.1%
정부대구지방 2
 
1.3%
합동청사 2
 
1.3%
업무b동 2
 
1.3%
1층 2
 
1.3%
전산교육장 2
 
1.3%

문의전화
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
053-667-2451
49 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row053-667-2451
2nd row053-667-2451
3rd row053-667-2451
4th row053-667-2451
5th row053-667-2451

Common Values

ValueCountFrequency (%)
053-667-2451 49
100.0%

Length

2024-04-19T14:45:22.172525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:45:22.254596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
053-667-2451 49
100.0%

담당부서
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
대구광역시 달서구 홍보전산과
49 

Length

Max length15
Median length15
Mean length15
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시 달서구 홍보전산과
2nd row대구광역시 달서구 홍보전산과
3rd row대구광역시 달서구 홍보전산과
4th row대구광역시 달서구 홍보전산과
5th row대구광역시 달서구 홍보전산과

Common Values

ValueCountFrequency (%)
대구광역시 달서구 홍보전산과 49
100.0%

Length

2024-04-19T14:45:22.338483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:45:22.419494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 49
33.3%
달서구 49
33.3%
홍보전산과 49
33.3%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum2023-05-19 00:00:00
Maximum2023-05-19 00:00:00
2024-04-19T14:45:22.495214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:45:22.614166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2024-04-19T14:45:22.980246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분과정명난이도교육기간_일수교육시간접수방법인터넷접수 시작일인터넷접수 종료일전화접수일교육장소
구분1.0001.0000.7910.0000.6240.445NaNNaN0.0000.134
과정명1.0001.0001.0000.8740.8621.0000.0000.0000.0000.139
난이도0.7911.0001.0000.5690.6691.0000.0000.0000.0000.316
교육기간_일수0.0000.8740.5691.0000.0000.8311.0001.0001.0001.000
교육시간0.6240.8620.6690.0001.0000.752NaNNaN0.0000.000
접수방법0.4451.0001.0000.8310.7521.000NaNNaN0.0000.000
인터넷접수 시작일NaN0.0000.0001.000NaNNaN1.0001.0001.000NaN
인터넷접수 종료일NaN0.0000.0001.000NaNNaN1.0001.0001.000NaN
전화접수일0.0000.0000.0001.0000.0000.0001.0001.0001.0000.000
교육장소0.1340.1390.3161.0000.0000.000NaNNaN0.0001.000
2024-04-19T14:45:23.160324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
난이도구분과정명교육장소교육시간접수방법
난이도1.0000.7900.9290.3710.7740.968
구분0.7901.0000.9090.2170.8810.683
과정명0.9290.9091.0000.1000.7790.899
교육장소0.3710.2170.1001.0000.0000.000
교육시간0.7740.8810.7790.0001.0000.542
접수방법0.9680.6830.8990.0000.5421.000
2024-04-19T14:45:23.257004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분과정명난이도교육시간접수방법교육장소
구분1.0000.9090.7900.8810.6830.217
과정명0.9091.0000.9290.7790.8990.100
난이도0.7900.9291.0000.7740.9680.371
교육시간0.8810.7790.7741.0000.5420.000
접수방법0.6830.8990.9680.5421.0000.000
교육장소0.2170.1000.3710.0000.0001.000

Missing values

2024-04-19T14:45:19.290011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:45:19.497446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-19T14:45:19.608411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분과정명난이도교육기간_일수교육시간접수방법인터넷접수 시작일인터넷접수 종료일전화접수일교육장소문의전화담당부서기준일자
0컴퓨터컴퓨터 기초01-09~01-20(10일)14~16시전화(100%)<NA><NA>2022-12-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
1컴퓨터인터넷 기초01-25~02-03(08일)14~16시전화(100%)<NA><NA>2022-12-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
2컴퓨터문서편집 활용02-06~02-17(10일)14~16시인터넷(50%)+전화(50%)2022-12-062022-12-082022-12-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
3컴퓨터파워포인트 활용02-20~03-03(09일)14~16시인터넷(50%)+전화(50%)2022-12-062022-12-082022-12-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
4컴퓨터엑셀 활용03-06~03-17(10일)14~16시인터넷(50%)+전화(50%)2022-12-062022-12-082022-12-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
5컴퓨터엑셀 고급03-20~03-31(10일)14~16시인터넷(50%)+전화(50%)2022-12-062022-12-082022-12-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
6컴퓨터컴퓨터 기초04-03~04-14(10일)14~16시전화(100%)<NA><NA>2023-03-10달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
7컴퓨터인터넷 기초04-17~04-28(10일)14~16시전화(100%)<NA><NA>2023-03-10달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
8컴퓨터문서편집 활용05-08~05-19(10일)14~16시인터넷(50%)+전화(50%)2023-03-062023-03-082023-03-10달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
9컴퓨터엑셀 활용05-22~06-02(10일)14~16시인터넷(50%)+전화(50%)2023-03-062023-03-082023-03-10달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
구분과정명난이도교육기간_일수교육시간접수방법인터넷접수 시작일인터넷접수 종료일전화접수일교육장소문의전화담당부서기준일자
39스마트폰스마트폰 기초초급06-05~06-09(04일)16~18시전화(100%)<NA><NA>2023-03-10달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
40스마트폰스마트폰 SNS 활용중급06-12~06-23(10일)16~18시전화(100%)<NA><NA>2023-03-10정부대구지방 합동청사 업무B동 1층 전산교육장053-667-2451대구광역시 달서구 홍보전산과2023-05-19
41스마트폰스마트폰 기초초급07-10~07-14(05일)16~18시전화(100%)<NA><NA>2023-06-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
42스마트폰스마트폰 SNS 활용중급07-17~07-28(10일)16~18시전화(100%)<NA><NA>2023-06-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
43스마트폰스마트폰 기초초급09-04~09-08(05일)16~18시전화(100%)<NA><NA>2023-06-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
44스마트폰스마트폰 SNS 활용중급09-11~09-22(10일)16~18시전화(100%)<NA><NA>2023-06-12달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
45스마트폰스마트폰 기초초급10-10~10-13(04일)16~18시전화(100%)<NA><NA>2023-09-11달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
46스마트폰스마트폰 SNS 활용중급10-16~10-27(10일)16~18시전화(100%)<NA><NA>2023-09-11달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
47스마트폰스마트폰 기초초급11-06~11-10(05일)16~18시전화(100%)<NA><NA>2023-09-11달서아트센터 3층 컴퓨터실053-667-2451대구광역시 달서구 홍보전산과2023-05-19
48스마트폰스마트폰 SNS 활용중급11-13~11-24(10일)16~18시전화(100%)<NA><NA>2023-09-11정부대구지방 합동청사 업무B동 1층 전산교육장053-667-2451대구광역시 달서구 홍보전산과2023-05-19