Overview

Dataset statistics

Number of variables19
Number of observations25
Missing cells228
Missing cells (%)48.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory165.3 B

Variable types

Numeric2
Categorical5
Text7
Unsupported5

Dataset

Description한국 교통안전공단 철도 자격관리 시스템의 철도자격시험 응시에 필요한 소모품에 대한 데이터 정보로 테이블 명, 제품명 등을 제공 하고 있습니다.
Author한국교통안전공단
URLhttps://www.data.go.kr/data/15064516/fileData.do

Alerts

대분류 has constant value ""Constant
영문 테이블명 has constant value ""Constant
한글 테이블명 has constant value ""Constant
6월 has constant value ""Constant
7월 has constant value ""Constant
순번 is highly overall correlated with 년도 and 1 other fieldsHigh correlation
년도 is highly overall correlated with 순번High correlation
4월 is highly overall correlated with 순번High correlation
1월 has 10 (40.0%) missing valuesMissing
2월 has 15 (60.0%) missing valuesMissing
3월 has 15 (60.0%) missing valuesMissing
5월 has 15 (60.0%) missing valuesMissing
6월 has 24 (96.0%) missing valuesMissing
7월 has 24 (96.0%) missing valuesMissing
8월 has 25 (100.0%) missing valuesMissing
9월 has 25 (100.0%) missing valuesMissing
10월 has 25 (100.0%) missing valuesMissing
11월 has 25 (100.0%) missing valuesMissing
12월 has 25 (100.0%) missing valuesMissing
순번 has unique valuesUnique
8월 is an unsupported type, check if it needs cleaning or further analysisUnsupported
9월 is an unsupported type, check if it needs cleaning or further analysisUnsupported
10월 is an unsupported type, check if it needs cleaning or further analysisUnsupported
11월 is an unsupported type, check if it needs cleaning or further analysisUnsupported
12월 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 10:34:58.876130
Analysis finished2023-12-12 10:35:00.451133
Duration1.58 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13
Minimum1
Maximum25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T19:35:00.535680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.2
Q17
median13
Q319
95-th percentile23.8
Maximum25
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.3598007
Coefficient of variation (CV)0.56613852
Kurtosis-1.2
Mean13
Median Absolute Deviation (MAD)6
Skewness0
Sum325
Variance54.166667
MonotonicityStrictly increasing
2023-12-12T19:35:00.683921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
1 1
 
4.0%
2 1
 
4.0%
25 1
 
4.0%
24 1
 
4.0%
23 1
 
4.0%
22 1
 
4.0%
21 1
 
4.0%
20 1
 
4.0%
19 1
 
4.0%
18 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
1 1
4.0%
2 1
4.0%
3 1
4.0%
4 1
4.0%
5 1
4.0%
6 1
4.0%
7 1
4.0%
8 1
4.0%
9 1
4.0%
10 1
4.0%
ValueCountFrequency (%)
25 1
4.0%
24 1
4.0%
23 1
4.0%
22 1
4.0%
21 1
4.0%
20 1
4.0%
19 1
4.0%
18 1
4.0%
17 1
4.0%
16 1
4.0%

대분류
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
소모품 관리 정보
25 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소모품 관리 정보
2nd row소모품 관리 정보
3rd row소모품 관리 정보
4th row소모품 관리 정보
5th row소모품 관리 정보

Common Values

ValueCountFrequency (%)
소모품 관리 정보 25
100.0%

Length

2023-12-12T19:35:00.835892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:35:00.956149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소모품 25
33.3%
관리 25
33.3%
정보 25
33.3%

영문 테이블명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
TB_LM1118
25 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTB_LM1118
2nd rowTB_LM1118
3rd rowTB_LM1118
4th rowTB_LM1118
5th rowTB_LM1118

Common Values

ValueCountFrequency (%)
TB_LM1118 25
100.0%

Length

2023-12-12T19:35:01.086188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:35:01.209915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
tb_lm1118 25
100.0%

한글 테이블명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
소모품 관리
25 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소모품 관리
2nd row소모품 관리
3rd row소모품 관리
4th row소모품 관리
5th row소모품 관리

Common Values

ValueCountFrequency (%)
소모품 관리 25
100.0%

Length

2023-12-12T19:35:01.336848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:35:01.451408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소모품 25
50.0%
관리 25
50.0%

년도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2018
2019
2017
2020

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row2018
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2018 9
36.0%
2019 9
36.0%
2017 6
24.0%
2020 1
 
4.0%

Length

2023-12-12T19:35:01.577001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:35:01.704900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 9
36.0%
2019 9
36.0%
2017 6
24.0%
2020 1
 
4.0%

일련번호
Real number (ℝ)

Distinct9
Distinct (%)36.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.48
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T19:35:01.823738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile8.8
Maximum9
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.5839247
Coefficient of variation (CV)0.5767689
Kurtosis-1.0695643
Mean4.48
Median Absolute Deviation (MAD)2
Skewness0.23755275
Sum112
Variance6.6766667
MonotonicityNot monotonic
2023-12-12T19:35:01.966795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 4
16.0%
2 3
12.0%
3 3
12.0%
4 3
12.0%
5 3
12.0%
6 3
12.0%
9 2
8.0%
7 2
8.0%
8 2
8.0%
ValueCountFrequency (%)
1 4
16.0%
2 3
12.0%
3 3
12.0%
4 3
12.0%
5 3
12.0%
6 3
12.0%
7 2
8.0%
8 2
8.0%
9 2
8.0%
ValueCountFrequency (%)
9 2
8.0%
8 2
8.0%
7 2
8.0%
6 3
12.0%
5 3
12.0%
4 3
12.0%
3 3
12.0%
2 3
12.0%
1 4
16.0%
Distinct20
Distinct (%)80.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T19:35:02.189688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length10
Mean length9
Min length2

Characters and Unicode

Total characters225
Distinct characters58
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)60.0%

Sample

1st row홍보물품
2nd row철도차량 운전면허증
3rd row철도교통 관제자격증명서
4th row발급기 소모품
5th row연금동 등사기 잉크
ValueCountFrequency (%)
연금동 11
22.0%
등사기 4
 
8.0%
마스터지 3
 
6.0%
홍보물품 2
 
4.0%
소모품 2
 
4.0%
box 2
 
4.0%
b4 2
 
4.0%
a4 2
 
4.0%
중질지 2
 
4.0%
발급기 2
 
4.0%
Other values (15) 18
36.0%
2023-12-12T19:35:02.887545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
12.0%
12
 
5.3%
12
 
5.3%
12
 
5.3%
( 8
 
3.6%
) 8
 
3.6%
8
 
3.6%
8
 
3.6%
B 5
 
2.2%
4 5
 
2.2%
Other values (48) 120
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 164
72.9%
Space Separator 27
 
12.0%
Uppercase Letter 11
 
4.9%
Open Punctuation 8
 
3.6%
Close Punctuation 8
 
3.6%
Decimal Number 5
 
2.2%
Other Punctuation 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
7.3%
12
 
7.3%
12
 
7.3%
8
 
4.9%
8
 
4.9%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (39) 92
56.1%
Uppercase Letter
ValueCountFrequency (%)
B 5
45.5%
O 2
 
18.2%
A 2
 
18.2%
X 2
 
18.2%
Space Separator
ValueCountFrequency (%)
27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Decimal Number
ValueCountFrequency (%)
4 5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 164
72.9%
Common 50
 
22.2%
Latin 11
 
4.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
7.3%
12
 
7.3%
12
 
7.3%
8
 
4.9%
8
 
4.9%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (39) 92
56.1%
Common
ValueCountFrequency (%)
27
54.0%
( 8
 
16.0%
) 8
 
16.0%
4 5
 
10.0%
, 2
 
4.0%
Latin
ValueCountFrequency (%)
B 5
45.5%
O 2
 
18.2%
A 2
 
18.2%
X 2
 
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 164
72.9%
ASCII 61
 
27.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27
44.3%
( 8
 
13.1%
) 8
 
13.1%
B 5
 
8.2%
4 5
 
8.2%
, 2
 
3.3%
O 2
 
3.3%
A 2
 
3.3%
X 2
 
3.3%
Hangul
ValueCountFrequency (%)
12
 
7.3%
12
 
7.3%
12
 
7.3%
8
 
4.9%
8
 
4.9%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (39) 92
56.1%

1월
Text

MISSING 

Distinct15
Distinct (%)100.0%
Missing10
Missing (%)40.0%
Memory size332.0 B
2023-12-12T19:35:03.069140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length18
Mean length14.533333
Min length1

Characters and Unicode

Total characters218
Distinct characters57
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)100.0%

Sample

1st row7,000개 - 250개(수원)
2nd row10개
3rd row9롤
4th row3set : 검정(3개), 빨강(3개), 파랑(3개)
5th row28Box
ValueCountFrequency (%)
12
24.0%
3
 
6.0%
5개 2
 
4.0%
검정(3개 2
 
4.0%
box 2
 
4.0%
17 2
 
4.0%
7,000개 1
 
2.0%
1400개 1
 
2.0%
1500 1
 
2.0%
리본잉크 1
 
2.0%
Other values (23) 23
46.0%
2023-12-12T19:35:03.421685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
16.1%
0 24
 
11.0%
15
 
6.9%
( 12
 
5.5%
) 12
 
5.5%
3 10
 
4.6%
- 7
 
3.2%
5 7
 
3.2%
, 7
 
3.2%
1 7
 
3.2%
Other values (47) 82
37.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 61
28.0%
Other Letter 60
27.5%
Space Separator 35
16.1%
Lowercase Letter 16
 
7.3%
Open Punctuation 12
 
5.5%
Close Punctuation 12
 
5.5%
Other Punctuation 12
 
5.5%
Dash Punctuation 7
 
3.2%
Uppercase Letter 2
 
0.9%
Math Symbol 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
25.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (24) 26
43.3%
Decimal Number
ValueCountFrequency (%)
0 24
39.3%
3 10
16.4%
5 7
 
11.5%
1 7
 
11.5%
2 6
 
9.8%
7 3
 
4.9%
4 2
 
3.3%
9 1
 
1.6%
8 1
 
1.6%
Lowercase Letter
ValueCountFrequency (%)
o 4
25.0%
x 4
25.0%
s 2
12.5%
e 2
12.5%
b 2
12.5%
t 2
12.5%
Other Punctuation
ValueCountFrequency (%)
, 7
58.3%
: 5
41.7%
Space Separator
ValueCountFrequency (%)
35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 140
64.2%
Hangul 60
27.5%
Latin 18
 
8.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
25.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (24) 26
43.3%
Common
ValueCountFrequency (%)
35
25.0%
0 24
17.1%
( 12
 
8.6%
) 12
 
8.6%
3 10
 
7.1%
- 7
 
5.0%
5 7
 
5.0%
, 7
 
5.0%
1 7
 
5.0%
2 6
 
4.3%
Other values (6) 13
 
9.3%
Latin
ValueCountFrequency (%)
o 4
22.2%
x 4
22.2%
s 2
11.1%
B 2
11.1%
e 2
11.1%
b 2
11.1%
t 2
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 158
72.5%
Hangul 60
 
27.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35
22.2%
0 24
15.2%
( 12
 
7.6%
) 12
 
7.6%
3 10
 
6.3%
- 7
 
4.4%
5 7
 
4.4%
, 7
 
4.4%
1 7
 
4.4%
2 6
 
3.8%
Other values (13) 31
19.6%
Hangul
ValueCountFrequency (%)
15
25.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (24) 26
43.3%

2월
Text

MISSING 

Distinct10
Distinct (%)100.0%
Missing15
Missing (%)60.0%
Memory size332.0 B
2023-12-12T19:35:03.603768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length17
Mean length9.6
Min length1

Characters and Unicode

Total characters96
Distinct characters42
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)100.0%

Sample

1st row회색 : 26개 남색 : 29개
2nd row2 box
3rd row= 830개
4th row1280개
5th row리본잉크 : 5개, 필름 : 5개
ValueCountFrequency (%)
6
22.2%
5개 2
 
7.4%
box 2
 
7.4%
필름 1
 
3.7%
29 1
 
3.7%
컬러(파랑,노랑,빨강:3개씩 1
 
3.7%
검정(3개 1
 
3.7%
set 1
 
3.7%
3 1
 
3.7%
10롤 1
 
3.7%
Other values (10) 10
37.0%
2023-12-12T19:35:03.895629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
17.7%
9
 
9.4%
: 6
 
6.2%
2 5
 
5.2%
3 4
 
4.2%
0 4
 
4.2%
, 4
 
4.2%
5 3
 
3.1%
9 2
 
2.1%
1 2
 
2.1%
Other values (32) 40
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31
32.3%
Decimal Number 24
25.0%
Space Separator 17
17.7%
Other Punctuation 10
 
10.4%
Lowercase Letter 9
 
9.4%
Open Punctuation 2
 
2.1%
Close Punctuation 2
 
2.1%
Math Symbol 1
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
29.0%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (11) 11
35.5%
Decimal Number
ValueCountFrequency (%)
2 5
20.8%
3 4
16.7%
0 4
16.7%
5 3
12.5%
9 2
 
8.3%
1 2
 
8.3%
8 2
 
8.3%
6 1
 
4.2%
4 1
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
b 2
22.2%
x 2
22.2%
o 2
22.2%
t 1
11.1%
e 1
11.1%
s 1
11.1%
Other Punctuation
ValueCountFrequency (%)
: 6
60.0%
, 4
40.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 56
58.3%
Hangul 31
32.3%
Latin 9
 
9.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
29.0%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (11) 11
35.5%
Common
ValueCountFrequency (%)
17
30.4%
: 6
 
10.7%
2 5
 
8.9%
3 4
 
7.1%
0 4
 
7.1%
, 4
 
7.1%
5 3
 
5.4%
9 2
 
3.6%
1 2
 
3.6%
( 2
 
3.6%
Other values (5) 7
12.5%
Latin
ValueCountFrequency (%)
b 2
22.2%
x 2
22.2%
o 2
22.2%
t 1
11.1%
e 1
11.1%
s 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 65
67.7%
Hangul 31
32.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17
26.2%
: 6
 
9.2%
2 5
 
7.7%
3 4
 
6.2%
0 4
 
6.2%
, 4
 
6.2%
5 3
 
4.6%
9 2
 
3.1%
1 2
 
3.1%
( 2
 
3.1%
Other values (11) 16
24.6%
Hangul
ValueCountFrequency (%)
9
29.0%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (11) 11
35.5%

3월
Text

MISSING 

Distinct10
Distinct (%)100.0%
Missing15
Missing (%)60.0%
Memory size332.0 B
2023-12-12T19:35:04.123294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length11.5
Mean length9.3
Min length1

Characters and Unicode

Total characters93
Distinct characters42
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)100.0%

Sample

1st row회색 : 26개 남색 : 29개
2nd row0box
3rd row=750
4th row1240개
5th row리본잉크 : 5개, 필름 : 5개
ValueCountFrequency (%)
5
20.0%
5개 2
 
8.0%
회색 1
 
4.0%
50개 1
 
4.0%
box 1
 
4.0%
29 1
 
4.0%
컬러(파랑,노랑,빨강:3개씩 1
 
4.0%
검정(3개 1
 
4.0%
set 1
 
4.0%
3 1
 
4.0%
Other values (10) 10
40.0%
2023-12-12T19:35:04.492432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
16.1%
8
 
8.6%
: 6
 
6.5%
0 5
 
5.4%
2 5
 
5.4%
, 4
 
4.3%
5 4
 
4.3%
3 3
 
3.2%
x 2
 
2.2%
1 2
 
2.2%
Other values (32) 39
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30
32.3%
Decimal Number 24
25.8%
Space Separator 15
16.1%
Other Punctuation 10
 
10.8%
Lowercase Letter 9
 
9.7%
Open Punctuation 2
 
2.2%
Close Punctuation 2
 
2.2%
Math Symbol 1
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
26.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (11) 11
36.7%
Decimal Number
ValueCountFrequency (%)
0 5
20.8%
2 5
20.8%
5 4
16.7%
3 3
12.5%
1 2
 
8.3%
9 2
 
8.3%
4 1
 
4.2%
7 1
 
4.2%
6 1
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
x 2
22.2%
o 2
22.2%
b 2
22.2%
e 1
11.1%
t 1
11.1%
s 1
11.1%
Other Punctuation
ValueCountFrequency (%)
: 6
60.0%
, 4
40.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 54
58.1%
Hangul 30
32.3%
Latin 9
 
9.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
26.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (11) 11
36.7%
Common
ValueCountFrequency (%)
15
27.8%
: 6
 
11.1%
0 5
 
9.3%
2 5
 
9.3%
, 4
 
7.4%
5 4
 
7.4%
3 3
 
5.6%
1 2
 
3.7%
( 2
 
3.7%
) 2
 
3.7%
Other values (5) 6
 
11.1%
Latin
ValueCountFrequency (%)
x 2
22.2%
o 2
22.2%
b 2
22.2%
e 1
11.1%
t 1
11.1%
s 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 63
67.7%
Hangul 30
32.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15
23.8%
: 6
 
9.5%
0 5
 
7.9%
2 5
 
7.9%
, 4
 
6.3%
5 4
 
6.3%
3 3
 
4.8%
x 2
 
3.2%
1 2
 
3.2%
( 2
 
3.2%
Other values (11) 15
23.8%
Hangul
ValueCountFrequency (%)
8
26.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (11) 11
36.7%

4월
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
<NA>
15 
-
6(4BOX 구매)
 
1

Length

Max length10
Median length4
Mean length3.16
Min length1

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row-
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 15
60.0%
- 9
36.0%
6(4BOX 구매) 1
 
4.0%

Length

2023-12-12T19:35:04.671213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:35:04.821661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 15
57.7%
9
34.6%
6(4box 1
 
3.8%
구매 1
 
3.8%

5월
Text

MISSING 

Distinct10
Distinct (%)100.0%
Missing15
Missing (%)60.0%
Memory size332.0 B
2023-12-12T19:35:04.974781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length19.5
Mean length11.1
Min length3

Characters and Unicode

Total characters111
Distinct characters51
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)100.0%

Sample

1st row회색 : 10개 남색 : 12개
2nd row4box
3rd row350개
4th row980개
5th row리본잉크 : 21개, 필름 : 20개
ValueCountFrequency (%)
5
 
18.5%
회색 1
 
3.7%
50개 1
 
3.7%
8vbox 1
 
3.7%
폐기 1
 
3.7%
8(6박스 1
 
3.7%
30box 1
 
3.7%
컬러(파랑,노랑,빨강:3개씩 1
 
3.7%
검정(3개 1
 
3.7%
set 1
 
3.7%
Other values (13) 13
48.1%
2023-12-12T19:35:05.291356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
15.3%
9
 
8.1%
0 7
 
6.3%
: 6
 
5.4%
3 5
 
4.5%
, 5
 
4.5%
1 4
 
3.6%
8 3
 
2.7%
) 3
 
2.7%
2 3
 
2.7%
Other values (41) 49
44.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37
33.3%
Decimal Number 27
24.3%
Space Separator 17
15.3%
Other Punctuation 11
 
9.9%
Lowercase Letter 9
 
8.1%
Uppercase Letter 4
 
3.6%
Close Punctuation 3
 
2.7%
Open Punctuation 3
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
24.3%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (17) 17
45.9%
Decimal Number
ValueCountFrequency (%)
0 7
25.9%
3 5
18.5%
1 4
14.8%
8 3
11.1%
2 3
11.1%
5 2
 
7.4%
6 1
 
3.7%
4 1
 
3.7%
9 1
 
3.7%
Lowercase Letter
ValueCountFrequency (%)
x 2
22.2%
o 2
22.2%
b 2
22.2%
s 1
11.1%
e 1
11.1%
t 1
11.1%
Uppercase Letter
ValueCountFrequency (%)
X 1
25.0%
O 1
25.0%
V 1
25.0%
B 1
25.0%
Other Punctuation
ValueCountFrequency (%)
: 6
54.5%
, 5
45.5%
Space Separator
ValueCountFrequency (%)
17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 61
55.0%
Hangul 37
33.3%
Latin 13
 
11.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
24.3%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (17) 17
45.9%
Common
ValueCountFrequency (%)
17
27.9%
0 7
11.5%
: 6
 
9.8%
3 5
 
8.2%
, 5
 
8.2%
1 4
 
6.6%
8 3
 
4.9%
) 3
 
4.9%
2 3
 
4.9%
( 3
 
4.9%
Other values (4) 5
 
8.2%
Latin
ValueCountFrequency (%)
x 2
15.4%
o 2
15.4%
b 2
15.4%
X 1
7.7%
O 1
7.7%
V 1
7.7%
B 1
7.7%
s 1
7.7%
e 1
7.7%
t 1
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 74
66.7%
Hangul 37
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17
23.0%
0 7
 
9.5%
: 6
 
8.1%
3 5
 
6.8%
, 5
 
6.8%
1 4
 
5.4%
8 3
 
4.1%
) 3
 
4.1%
2 3
 
4.1%
( 3
 
4.1%
Other values (14) 18
24.3%
Hangul
ValueCountFrequency (%)
9
24.3%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (17) 17
45.9%

6월
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing24
Missing (%)96.0%
Memory size332.0 B
2023-12-12T19:35:05.495131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length19
Mean length19
Min length19

Characters and Unicode

Total characters19
Distinct characters15
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row3(서울1BOX 대전1BVX 불출)
ValueCountFrequency (%)
3(서울1box 1
33.3%
대전1bvx 1
33.3%
불출 1
33.3%
2023-12-12T19:35:05.873921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2
 
10.5%
B 2
 
10.5%
X 2
 
10.5%
2
 
10.5%
3 1
 
5.3%
( 1
 
5.3%
1
 
5.3%
1
 
5.3%
O 1
 
5.3%
1
 
5.3%
Other values (5) 5
26.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 6
31.6%
Other Letter 6
31.6%
Decimal Number 3
15.8%
Space Separator 2
 
10.5%
Open Punctuation 1
 
5.3%
Close Punctuation 1
 
5.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
X 2
33.3%
O 1
16.7%
V 1
16.7%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
3 1
33.3%
Space Separator
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7
36.8%
Latin 6
31.6%
Hangul 6
31.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Common
ValueCountFrequency (%)
1 2
28.6%
2
28.6%
3 1
14.3%
( 1
14.3%
) 1
14.3%
Latin
ValueCountFrequency (%)
B 2
33.3%
X 2
33.3%
O 1
16.7%
V 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13
68.4%
Hangul 6
31.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 2
15.4%
B 2
15.4%
X 2
15.4%
2
15.4%
3 1
7.7%
( 1
7.7%
O 1
7.7%
V 1
7.7%
) 1
7.7%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

7월
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing24
Missing (%)96.0%
Memory size332.0 B
2023-12-12T19:35:05.997586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2
Distinct characters2
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row2개
ValueCountFrequency (%)
2개 1
100.0%
2023-12-12T19:35:06.288152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1
50.0%
1
50.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1
50.0%
Other Letter 1
50.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Other Letter
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1
50.0%
Hangul 1
50.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 1
100.0%
Hangul
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1
50.0%
Hangul 1
50.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1
100.0%
Hangul
ValueCountFrequency (%)
1
100.0%

8월
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B

9월
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B

10월
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B

11월
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B

12월
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B

Interactions

2023-12-12T19:34:59.653368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:34:59.487537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:34:59.740350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:34:59.560485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:35:06.425246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번년도일련번호물품명1월2월3월4월5월
순번1.0000.8720.0000.5171.0001.0001.0001.0001.000
년도0.8721.0000.0000.7761.0001.0001.0000.4941.000
일련번호0.0000.0001.0001.0001.0001.0001.0000.0001.000
물품명0.5170.7761.0001.0001.0001.0001.0001.0001.000
1월1.0001.0001.0001.0001.0001.0001.0001.0001.000
2월1.0001.0001.0001.0001.0001.0001.0001.0001.000
3월1.0001.0001.0001.0001.0001.0001.0001.0001.000
4월1.0000.4940.0001.0001.0001.0001.0001.0001.000
5월1.0001.0001.0001.0001.0001.0001.0001.0001.000
2023-12-12T19:35:06.589646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도4월
년도1.0000.312
4월0.3121.000
2023-12-12T19:35:06.713376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번일련번호년도4월
순번1.000-0.0650.5520.707
일련번호-0.0651.0000.0000.000
년도0.5520.0001.0000.312
4월0.7070.0000.3121.000

Missing values

2023-12-12T19:34:59.907238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:35:00.150627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T19:35:00.355873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번대분류영문 테이블명한글 테이블명년도일련번호물품명1월2월3월4월5월6월7월8월9월10월11월12월
01소모품 관리 정보TB_LM1118소모품 관리20189홍보물품<NA>회색 : 26개 남색 : 29개회색 : 26개 남색 : 29개-회색 : 10개 남색 : 12개<NA><NA><NA><NA><NA><NA><NA>
12소모품 관리 정보TB_LM1118소모품 관리20191철도차량 운전면허증7,000개 - 250개(수원)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
23소모품 관리 정보TB_LM1118소모품 관리20192철도교통 관제자격증명서<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
34소모품 관리 정보TB_LM1118소모품 관리20193발급기 소모품<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
45소모품 관리 정보TB_LM1118소모품 관리20194연금동 등사기 잉크10개<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
56소모품 관리 정보TB_LM1118소모품 관리20195연금동 등사기 마스터지9롤<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
67소모품 관리 정보TB_LM1118소모품 관리20196연금동 복합기 토너카트리지3set : 검정(3개), 빨강(3개), 파랑(3개)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
78소모품 관리 정보TB_LM1118소모품 관리20197연금동 중질지 (B4)28Box<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
89소모품 관리 정보TB_LM1118소모품 관리20198연금동 (A4)4Box<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
910소모품 관리 정보TB_LM1118소모품 관리20199홍보물품<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
순번대분류영문 테이블명한글 테이블명년도일련번호물품명1월2월3월4월5월6월7월8월9월10월11월12월
1516소모품 관리 정보TB_LM1118소모품 관리20184연금동 등사기 잉크53 개50개50개-50개<NA><NA><NA><NA><NA><NA><NA>
1617소모품 관리 정보TB_LM1118소모품 관리20185연금동 등사기 마스터지17 롤10롤10롤-10롤<NA><NA><NA><NA><NA><NA><NA>
1718소모품 관리 정보TB_LM1118소모품 관리20186연금동 복합기 토너카트리지3 set : 검정(3개), 컬러(파랑,노랑,빨강:3개씩)3 set : 검정(3개), 컬러(파랑,노랑,빨강:3개씩)3 set : 검정(3개), 컬러(파랑,노랑,빨강:3개씩)-3 set : 검정(3개), 컬러(파랑,노랑,빨강:3개씩)<NA><NA><NA><NA><NA><NA><NA>
1819소모품 관리 정보TB_LM1118소모품 관리20187연금동 중질지(B4)30 box29 box29 box-30box<NA><NA><NA><NA><NA><NA><NA>
1920소모품 관리 정보TB_LM1118소모품 관리20171리본<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2021소모품 관리 정보TB_LM1118소모품 관리20172카드(면허) ,BOX5426(4BOX 구매)8(6박스 폐기, 8VBOX 구매)3(서울1BOX 대전1BVX 불출)<NA><NA><NA><NA><NA><NA>
2122소모품 관리 정보TB_LM1118소모품 관리20173카드(관제), BOX<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2223소모품 관리 정보TB_LM1118소모품 관리20174B4 중질지<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2324소모품 관리 정보TB_LM1118소모품 관리20175마스터지 (연금동)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2425소모품 관리 정보TB_LM1118소모품 관리20176잉크(연금동)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>