Overview

Dataset statistics

Number of variables6
Number of observations151
Missing cells0
Missing cells (%)0.0%
Duplicate rows10
Duplicate rows (%)6.6%
Total size in memory7.8 KiB
Average record size in memory52.9 B

Variable types

Categorical4
Numeric1
Text1

Dataset

DescriptionSample
Author한국인터넷진흥원
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KIS00000000000000009

Alerts

생성년도 has constant value ""Constant
생성월 has constant value ""Constant
생성일 has constant value ""Constant
Dataset has 10 (6.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-10 06:27:58.831068
Analysis finished2023-12-10 06:27:59.832785
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

생성년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2018
151 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 151
100.0%

Length

2023-12-10T15:27:59.941834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:28:00.106185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 151
100.0%

생성월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
12
151 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row12
2nd row12
3rd row12
4th row12
5th row12

Common Values

ValueCountFrequency (%)
12 151
100.0%

Length

2023-12-10T15:28:00.291241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:28:00.490332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
12 151
100.0%

생성일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
19
151 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row19
2nd row19
3rd row19
4th row19
5th row19

Common Values

ValueCountFrequency (%)
19 151
100.0%

Length

2023-12-10T15:28:00.656310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:28:00.821759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
19 151
100.0%

생성시분초
Real number (ℝ)

Distinct40
Distinct (%)26.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean156427.73
Minimum5730
Maximum235234
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-10T15:28:00.981668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5730
5-th percentile102315
Q1133909
median155549
Q3173118
95-th percentile218438.5
Maximum235234
Range229504
Interquartile range (IQR)39209

Descriptive statistics

Standard deviation41698.318
Coefficient of variation (CV)0.26656602
Kurtosis2.1079049
Mean156427.73
Median Absolute Deviation (MAD)21640
Skewness-0.79953848
Sum23620587
Variance1.7387497 × 109
MonotonicityNot monotonic
2023-12-10T15:28:01.150117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
133909 11
 
7.3%
173118 10
 
6.6%
205157 9
 
6.0%
170600 9
 
6.0%
172610 9
 
6.0%
170100 9
 
6.0%
141426 9
 
6.0%
133358 8
 
5.3%
155046 7
 
4.6%
110825 6
 
4.0%
Other values (30) 64
42.4%
ValueCountFrequency (%)
5730 2
 
1.3%
12734 1
 
0.7%
44744 1
 
0.7%
50254 1
 
0.7%
100806 1
 
0.7%
102315 4
2.6%
110825 6
4.0%
111325 1
 
0.7%
112332 2
 
1.3%
112838 3
2.0%
ValueCountFrequency (%)
235234 2
 
1.3%
233234 2
 
1.3%
232730 2
 
1.3%
231219 2
 
1.3%
205658 2
 
1.3%
205158 5
3.3%
205157 9
6.0%
204650 4
2.6%
194633 1
 
0.7%
192627 4
2.6%

IP주소
Categorical

Distinct16
Distinct (%)10.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
67.*.248.10
62 
104.*.14.159
25 
209.*.34.239
18 
125.*.133.7
14 
-
 
6
Other values (11)
26 

Length

Max length13
Median length11
Mean length10.986755
Min length1

Unique

Unique4 ?
Unique (%)2.6%

Sample

1st row67.*.248.10
2nd row67.*.248.10
3rd row67.*.248.10
4th row104.*.15.159
5th row104.*.14.159

Common Values

ValueCountFrequency (%)
67.*.248.10 62
41.1%
104.*.14.159 25
16.6%
209.*.34.239 18
 
11.9%
125.*.133.7 14
 
9.3%
- 6
 
4.0%
13.*.153.143 5
 
3.3%
27.*.72.227 4
 
2.6%
104.*.15.159 3
 
2.0%
61.*.164.234 3
 
2.0%
61.*.132.180 3
 
2.0%
Other values (6) 8
 
5.3%

Length

2023-12-10T15:28:01.354228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
67.*.248.10 62
41.1%
104.*.14.159 25
16.6%
209.*.34.239 18
 
11.9%
125.*.133.7 14
 
9.3%
6
 
4.0%
13.*.153.143 5
 
3.3%
27.*.72.227 4
 
2.6%
104.*.15.159 3
 
2.0%
61.*.164.234 3
 
2.0%
61.*.132.180 3
 
2.0%
Other values (6) 8
 
5.3%

URL
Text

Distinct111
Distinct (%)73.5%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-10T15:28:01.681492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length19.986755
Min length12

Characters and Unicode

Total characters3018
Distinct characters65
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)52.3%

Sample

1st rowhxxp://bit.ly/2EBEFK0
2nd rowhxxp://bit.ly/2ScVSNf
3rd rowhxxp://bit.ly/2rT1ikZ
4th rowhxxps://is.gd/Aqx02h
5th rowhxxps://is.gd/YKcn3z
ValueCountFrequency (%)
hxxps://hoy.kr/golx 4
 
2.6%
hxxp://2019.01.09 4
 
2.6%
hxxp://bit.ly/2rt1ikz 3
 
2.0%
hankeys.itnsk.cn 3
 
2.0%
hxxps://c11.kr/5512 3
 
2.0%
hxxp://bit.ly/2ed56pb 3
 
2.0%
hxxp://bit.ly/lqwb 2
 
1.3%
hxxp://www.cjmaaio.pro 2
 
1.3%
hxxp://bit.ly/2ebefk0 2
 
1.3%
hxxps://is.gd/c5mrnt 2
 
1.3%
Other values (101) 123
81.5%
2023-12-10T15:28:02.237954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 420
 
13.9%
x 297
 
9.8%
h 188
 
6.2%
. 170
 
5.6%
p 152
 
5.0%
: 144
 
4.8%
s 118
 
3.9%
i 111
 
3.7%
l 94
 
3.1%
2 93
 
3.1%
Other values (55) 1231
40.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1676
55.5%
Other Punctuation 734
24.3%
Uppercase Letter 343
 
11.4%
Decimal Number 265
 
8.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
x 297
17.7%
h 188
11.2%
p 152
 
9.1%
s 118
 
7.0%
i 111
 
6.6%
l 94
 
5.6%
y 85
 
5.1%
b 80
 
4.8%
t 76
 
4.5%
g 52
 
3.1%
Other values (16) 423
25.2%
Uppercase Letter
ValueCountFrequency (%)
F 23
 
6.7%
B 21
 
6.1%
P 19
 
5.5%
C 18
 
5.2%
S 17
 
5.0%
Q 17
 
5.0%
X 17
 
5.0%
K 16
 
4.7%
T 16
 
4.7%
G 15
 
4.4%
Other values (16) 164
47.8%
Decimal Number
ValueCountFrequency (%)
2 93
35.1%
1 43
16.2%
0 31
 
11.7%
5 23
 
8.7%
9 19
 
7.2%
6 17
 
6.4%
3 13
 
4.9%
4 12
 
4.5%
8 8
 
3.0%
7 6
 
2.3%
Other Punctuation
ValueCountFrequency (%)
/ 420
57.2%
. 170
23.2%
: 144
 
19.6%

Most occurring scripts

ValueCountFrequency (%)
Latin 2019
66.9%
Common 999
33.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
x 297
 
14.7%
h 188
 
9.3%
p 152
 
7.5%
s 118
 
5.8%
i 111
 
5.5%
l 94
 
4.7%
y 85
 
4.2%
b 80
 
4.0%
t 76
 
3.8%
g 52
 
2.6%
Other values (42) 766
37.9%
Common
ValueCountFrequency (%)
/ 420
42.0%
. 170
17.0%
: 144
 
14.4%
2 93
 
9.3%
1 43
 
4.3%
0 31
 
3.1%
5 23
 
2.3%
9 19
 
1.9%
6 17
 
1.7%
3 13
 
1.3%
Other values (3) 26
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3018
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 420
 
13.9%
x 297
 
9.8%
h 188
 
6.2%
. 170
 
5.6%
p 152
 
5.0%
: 144
 
4.8%
s 118
 
3.9%
i 111
 
3.7%
l 94
 
3.1%
2 93
 
3.1%
Other values (55) 1231
40.8%

Interactions

2023-12-10T15:27:59.013980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:28:02.396638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
생성시분초IP주소
생성시분초1.0000.507
IP주소0.5071.000
2023-12-10T15:28:02.524739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
생성시분초IP주소
생성시분초1.0000.191
IP주소0.1911.000

Missing values

2023-12-10T15:27:59.581590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:27:59.769074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

생성년도생성월생성일생성시분초IP주소URL
02018121911283867.*.248.10hxxp://bit.ly/2EBEFK0
12018121915300967.*.248.10hxxp://bit.ly/2ScVSNf
22018121913390967.*.248.10hxxp://bit.ly/2rT1ikZ
320181219155046104.*.15.159hxxps://is.gd/Aqx02h
420181219205157104.*.14.159hxxps://is.gd/YKcn3z
52018121911082567.*.248.10hxxp://bit.ly/2SVoz0P
62018121913335867.*.248.10hxxp://bit.ly/2rT1ikZ
720181219170600-hxxp://2019.01.09
82018121917261054.*.52.76hxxp://bit.do/eDBP6
92018121917010061.*.164.234www.gouha.com.cn
생성년도생성월생성일생성시분초IP주소URL
1412018121913390967.*.248.10hxxp://bit.ly/2EmiSFt
1422018121913390927.*.72.227www.cjmaaio.pro
14320181219133358209.*.34.239hxxps://han.gl/kUcc7
1442018121917010013.*.153.143hxxps://c11.kr/5512
1452018121920565867.*.248.10hxxp://bit.ly/2S4y0jj
1462018121915554967.*.248.10hxxp://bit.ly/2BunFl2
1472018121920515767.*.248.10hxxp://bit.ly/2T36zlx
1482018121920515767.*.248.10hxxp://bit.ly/2rMnAol
14920181219102315104.*.14.159hxxps://is.gd/C5mRNt
1502018121915351813.*.153.143hxxps://c11.kr/5512

Duplicate rows

Most frequently occurring

생성년도생성월생성일생성시분초IP주소URL# duplicates
420181219170600-hxxp://2019.01.094
02018121913335867.*.248.10hxxp://bit.ly/2rT1ikZ2
12018121915504667.*.248.10hxxp://bit.ly/2rT9qC12
220181219170100125.*.133.7hxxps://hoy.kr/GolX2
32018121917010061.*.132.180hankeys.itnsk.cn2
520181219170600125.*.133.7hxxps://hoy.kr/dFP02
620181219172610125.*.133.7hxxps://hoy.kr/GolX2
720181219192627209.*.34.239hxxps://han.gl/LcXA22
820181219192627209.*.34.239hxxps://han.gl/dQ0672
92018121923121967.*.248.10hxxps://bit.ly/2ByFkb42