Overview

Dataset statistics

Number of variables6
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.1 KiB
Average record size in memory51.3 B

Variable types

Numeric3
Categorical2
Text1

Dataset

DescriptionSample
Author코난테크놀로지
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOPLACE

Alerts

"채널값" has constant value ""Constant
"해당일자" has constant value ""Constant
"기본키값" is highly overall correlated with "차례값" and 1 other fieldsHigh correlation
"차례값" is highly overall correlated with "기본키값" and 1 other fieldsHigh correlation
"건수값" is highly overall correlated with "기본키값" and 1 other fieldsHigh correlation
"기본키값" has unique valuesUnique
"차례값" has unique valuesUnique
"이슈어값" has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:30:15.825217
Analysis finished2023-12-10 06:30:18.118419
Duration2.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

"기본키값"
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15549.5
Minimum15350
Maximum15749
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:30:18.253592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15350
5-th percentile15369.95
Q115449.75
median15549.5
Q315649.25
95-th percentile15729.05
Maximum15749
Range399
Interquartile range (IQR)199.5

Descriptive statistics

Standard deviation115.6143
Coefficient of variation (CV)0.0074352424
Kurtosis-1.2
Mean15549.5
Median Absolute Deviation (MAD)100
Skewness0
Sum6219800
Variance13366.667
MonotonicityStrictly increasing
2023-12-10T15:30:18.528532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15350 1
 
0.2%
15614 1
 
0.2%
15624 1
 
0.2%
15623 1
 
0.2%
15622 1
 
0.2%
15621 1
 
0.2%
15620 1
 
0.2%
15619 1
 
0.2%
15618 1
 
0.2%
15617 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
15350 1
0.2%
15351 1
0.2%
15352 1
0.2%
15353 1
0.2%
15354 1
0.2%
15355 1
0.2%
15356 1
0.2%
15357 1
0.2%
15358 1
0.2%
15359 1
0.2%
ValueCountFrequency (%)
15749 1
0.2%
15748 1
0.2%
15747 1
0.2%
15746 1
0.2%
15745 1
0.2%
15744 1
0.2%
15743 1
0.2%
15742 1
0.2%
15741 1
0.2%
15740 1
0.2%

"채널값"
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"블로그"
400 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"블로그"
2nd row"블로그"
3rd row"블로그"
4th row"블로그"
5th row"블로그"

Common Values

ValueCountFrequency (%)
"블로그" 400
100.0%

Length

2023-12-10T15:30:18.800052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:30:18.962229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
블로그 400
100.0%

"해당일자"
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2020-05-01
400 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-05-01
2nd row2020-05-01
3rd row2020-05-01
4th row2020-05-01
5th row2020-05-01

Common Values

ValueCountFrequency (%)
2020-05-01 400
100.0%

Length

2023-12-10T15:30:19.129320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:30:19.312641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-05-01 400
100.0%

"차례값"
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.5
Minimum1
Maximum400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:30:19.501042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20.95
Q1100.75
median200.5
Q3300.25
95-th percentile380.05
Maximum400
Range399
Interquartile range (IQR)199.5

Descriptive statistics

Standard deviation115.6143
Coefficient of variation (CV)0.57662993
Kurtosis-1.2
Mean200.5
Median Absolute Deviation (MAD)100
Skewness0
Sum80200
Variance13366.667
MonotonicityStrictly increasing
2023-12-10T15:30:19.748522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
265 1
 
0.2%
275 1
 
0.2%
274 1
 
0.2%
273 1
 
0.2%
272 1
 
0.2%
271 1
 
0.2%
270 1
 
0.2%
269 1
 
0.2%
268 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
400 1
0.2%
399 1
0.2%
398 1
0.2%
397 1
0.2%
396 1
0.2%
395 1
0.2%
394 1
0.2%
393 1
0.2%
392 1
0.2%
391 1
0.2%

"이슈어값"
Text

UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-10T15:30:20.631608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.2375
Min length4

Characters and Unicode

Total characters2095
Distinct characters336
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique400 ?
Unique (%)100.0%

Sample

1st row"아파트"
2nd row"식당"
3rd row"부동산"
4th row"학교"
5th row"주방"
ValueCountFrequency (%)
아파트 1
 
0.2%
한강공원 1
 
0.2%
세차장 1
 
0.2%
대공원 1
 
0.2%
체험장 1
 
0.2%
북카페 1
 
0.2%
왁싱샵 1
 
0.2%
mall 1
 
0.2%
체육공원 1
 
0.2%
여관 1
 
0.2%
Other values (390) 390
97.5%
2023-12-10T15:30:21.367651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 800
38.2%
46
 
2.2%
37
 
1.8%
31
 
1.5%
27
 
1.3%
22
 
1.1%
21
 
1.0%
21
 
1.0%
20
 
1.0%
19
 
0.9%
Other values (326) 1051
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1287
61.4%
Other Punctuation 800
38.2%
Lowercase Letter 8
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
3.6%
37
 
2.9%
31
 
2.4%
27
 
2.1%
22
 
1.7%
21
 
1.6%
21
 
1.6%
20
 
1.6%
19
 
1.5%
19
 
1.5%
Other values (319) 1024
79.6%
Lowercase Letter
ValueCountFrequency (%)
l 2
25.0%
a 2
25.0%
m 1
12.5%
s 1
12.5%
p 1
12.5%
c 1
12.5%
Other Punctuation
ValueCountFrequency (%)
" 800
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1287
61.4%
Common 800
38.2%
Latin 8
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
3.6%
37
 
2.9%
31
 
2.4%
27
 
2.1%
22
 
1.7%
21
 
1.6%
21
 
1.6%
20
 
1.6%
19
 
1.5%
19
 
1.5%
Other values (319) 1024
79.6%
Latin
ValueCountFrequency (%)
l 2
25.0%
a 2
25.0%
m 1
12.5%
s 1
12.5%
p 1
12.5%
c 1
12.5%
Common
ValueCountFrequency (%)
" 800
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1287
61.4%
ASCII 808
38.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 800
99.0%
l 2
 
0.2%
a 2
 
0.2%
m 1
 
0.1%
s 1
 
0.1%
p 1
 
0.1%
c 1
 
0.1%
Hangul
ValueCountFrequency (%)
46
 
3.6%
37
 
2.9%
31
 
2.4%
27
 
2.1%
22
 
1.7%
21
 
1.6%
21
 
1.6%
20
 
1.6%
19
 
1.5%
19
 
1.5%
Other values (319) 1024
79.6%

"건수값"
Real number (ℝ)

HIGH CORRELATION 

Distinct316
Distinct (%)79.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean962.475
Minimum16
Maximum13722
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:30:21.629339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16
5-th percentile22
Q198.5
median257.5
Q3802.75
95-th percentile4691.15
Maximum13722
Range13706
Interquartile range (IQR)704.25

Descriptive statistics

Standard deviation1979.5932
Coefficient of variation (CV)2.0567736
Kurtosis17.224991
Mean962.475
Median Absolute Deviation (MAD)208
Skewness3.8955712
Sum384990
Variance3918789.2
MonotonicityDecreasing
2023-12-10T15:30:21.873777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18 5
 
1.2%
20 5
 
1.2%
85 4
 
1.0%
21 4
 
1.0%
60 4
 
1.0%
26 4
 
1.0%
38 3
 
0.8%
149 3
 
0.8%
291 3
 
0.8%
160 3
 
0.8%
Other values (306) 362
90.5%
ValueCountFrequency (%)
16 3
0.8%
17 2
 
0.5%
18 5
1.2%
20 5
1.2%
21 4
1.0%
22 3
0.8%
23 2
 
0.5%
24 2
 
0.5%
25 2
 
0.5%
26 4
1.0%
ValueCountFrequency (%)
13722 1
0.2%
13515 1
0.2%
12448 1
0.2%
11623 1
0.2%
10690 1
0.2%
10329 1
0.2%
10021 1
0.2%
9126 1
0.2%
8630 1
0.2%
7380 1
0.2%

Interactions

2023-12-10T15:30:17.274003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:30:16.221709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:30:16.763561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:30:17.446468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:30:16.404672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:30:16.942067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:30:17.613420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:30:16.592936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:30:17.101609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:30:22.033387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""차례값""건수값"
"기본키값"1.0001.0000.806
"차례값"1.0001.0000.807
"건수값"0.8060.8071.000
2023-12-10T15:30:22.204702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""차례값""건수값"
"기본키값"1.0001.000-1.000
"차례값"1.0001.000-1.000
"건수값"-1.000-1.0001.000

Missing values

2023-12-10T15:30:17.841362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:30:18.038756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
015350"블로그"2020-05-011"아파트"13722
115351"블로그"2020-05-012"식당"13515
215352"블로그"2020-05-013"부동산"12448
315353"블로그"2020-05-014"학교"11623
415354"블로그"2020-05-015"주방"10690
515355"블로그"2020-05-016"병원"10329
615356"블로그"2020-05-017"주택"10021
715357"블로그"2020-05-018"주차장"9126
815358"블로그"2020-05-019"화장실"8630
915359"블로그"2020-05-0110"법원"7380
"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
39015740"블로그"2020-05-01391"스낵바"18
39115741"블로그"2020-05-01392"신발가게"18
39215742"블로그"2020-05-01393"동네병원"18
39315743"블로그"2020-05-01394"라이브카페"18
39415744"블로그"2020-05-01395"화물실"18
39515745"블로그"2020-05-01396"헌책방"17
39615746"블로그"2020-05-01397"손님방"17
39715747"블로그"2020-05-01398"추모관"16
39815748"블로그"2020-05-01399"자동차용품점"16
39915749"블로그"2020-05-01400"료칸"16