Amino acid dipepetide frequency for Kaeng Khoi virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.749AlaAla: 1.749 ± 1.428
0.749AlaCys: 0.749 ± 0.223
2.498AlaAsp: 2.498 ± 0.291
3.248AlaGlu: 3.248 ± 0.864
1.249AlaPhe: 1.249 ± 0.498
0.749AlaGly: 0.749 ± 0.356
0.5AlaHis: 0.5 ± 0.159
2.498AlaIle: 2.498 ± 0.291
3.997AlaLys: 3.997 ± 0.653
3.497AlaLeu: 3.497 ± 1.134
1.249AlaMet: 1.249 ± 0.498
3.497AlaAsn: 3.497 ± 0.775
0.5AlaPro: 0.5 ± 0.316
1.749AlaGln: 1.749 ± 0.387
2.248AlaArg: 2.248 ± 1.074
2.748AlaSer: 2.748 ± 0.967
2.498AlaThr: 2.498 ± 1.815
1.999AlaVal: 1.999 ± 0.572
0.5AlaTrp: 0.5 ± 0.699
2.498AlaTyr: 2.498 ± 1.007
0.0AlaXaa: 0.0 ± 0.0
Cys
0.999CysAla: 0.999 ± 0.632
0.0CysCys: 0.0 ± 0.0
0.999CysAsp: 0.999 ± 0.317
1.249CysGlu: 1.249 ± 1.126
1.499CysPhe: 1.499 ± 0.712
1.749CysGly: 1.749 ± 1.576
0.25CysHis: 0.25 ± 0.225
2.498CysIle: 2.498 ± 1.007
2.998CysLys: 2.998 ± 1.723
2.498CysLeu: 2.498 ± 1.054
0.749CysMet: 0.749 ± 0.223
0.749CysAsn: 0.749 ± 0.474
1.749CysPro: 1.749 ± 0.656
1.749CysGln: 1.749 ± 0.7
0.5CysArg: 0.5 ± 0.45
2.248CysSer: 2.248 ± 1.193
1.249CysThr: 1.249 ± 1.818
1.499CysVal: 1.499 ± 1.351
0.0CysTrp: 0.0 ± 0.0
1.499CysTyr: 1.499 ± 0.445
0.0CysXaa: 0.0 ± 0.0
Asp
1.249AspAla: 1.249 ± 0.524
1.249AspCys: 1.249 ± 0.928
2.498AspAsp: 2.498 ± 0.996
2.998AspGlu: 2.998 ± 0.834
4.247AspPhe: 4.247 ± 1.795
0.999AspGly: 0.999 ± 0.317
0.5AspHis: 0.5 ± 0.45
6.745AspIle: 6.745 ± 1.923
5.746AspLys: 5.746 ± 1.034
6.245AspLeu: 6.245 ± 1.423
1.499AspMet: 1.499 ± 0.947
4.497AspAsn: 4.497 ± 0.833
2.248AspPro: 2.248 ± 0.783
2.998AspGln: 2.998 ± 0.502
1.499AspArg: 1.499 ± 0.436
3.747AspSer: 3.747 ± 1.29
1.249AspThr: 1.249 ± 0.353
1.749AspVal: 1.749 ± 0.587
0.999AspTrp: 0.999 ± 0.352
2.498AspTyr: 2.498 ± 0.996
0.0AspXaa: 0.0 ± 0.0
Glu
2.498GluAla: 2.498 ± 0.596
1.499GluCys: 1.499 ± 1.02
2.498GluAsp: 2.498 ± 0.291
3.747GluGlu: 3.747 ± 1.113
2.998GluPhe: 2.998 ± 0.834
1.749GluGly: 1.749 ± 1.039
1.249GluHis: 1.249 ± 0.498
6.995GluIle: 6.995 ± 0.588
4.247GluLys: 4.247 ± 1.92
6.995GluLeu: 6.995 ± 1.76
2.248GluMet: 2.248 ± 1.424
4.247GluAsn: 4.247 ± 1.203
1.749GluPro: 1.749 ± 0.587
1.499GluGln: 1.499 ± 0.649
2.498GluArg: 2.498 ± 0.572
5.496GluSer: 5.496 ± 1.006
4.247GluThr: 4.247 ± 1.662
3.497GluVal: 3.497 ± 0.75
0.5GluTrp: 0.5 ± 0.159
2.248GluTyr: 2.248 ± 1.115
0.0GluXaa: 0.0 ± 0.0
Phe
1.499PheAla: 1.499 ± 0.445
1.999PheCys: 1.999 ± 0.635
3.747PheAsp: 3.747 ± 0.379
2.748PheGlu: 2.748 ± 0.788
1.999PhePhe: 1.999 ± 1.356
2.498PheGly: 2.498 ± 1.21
0.749PheHis: 0.749 ± 0.676
5.496PheIle: 5.496 ± 0.902
5.246PheLys: 5.246 ± 1.42
3.747PheLeu: 3.747 ± 2.403
0.5PheMet: 0.5 ± 0.316
3.997PheAsn: 3.997 ± 1.088
0.999PhePro: 0.999 ± 0.649
0.749PheGln: 0.749 ± 0.636
1.749PheArg: 1.749 ± 0.803
4.497PheSer: 4.497 ± 1.484
2.998PheThr: 2.998 ± 0.851
1.249PheVal: 1.249 ± 0.498
0.25PheTrp: 0.25 ± 0.158
1.749PheTyr: 1.749 ± 1.188
0.0PheXaa: 0.0 ± 0.0
Gly
1.749GlyAla: 1.749 ± 0.387
2.248GlyCys: 2.248 ± 0.811
1.749GlyAsp: 1.749 ± 0.803
1.999GlyGlu: 1.999 ± 0.959
1.999GlyPhe: 1.999 ± 1.56
0.749GlyGly: 0.749 ± 0.474
0.749GlyHis: 0.749 ± 0.89
2.998GlyIle: 2.998 ± 3.344
1.999GlyLys: 1.999 ± 0.772
4.996GlyLeu: 4.996 ± 0.878
0.749GlyMet: 0.749 ± 1.527
3.248GlyAsn: 3.248 ± 1.621
1.249GlyPro: 1.249 ± 0.498
0.999GlyGln: 0.999 ± 0.561
2.748GlyArg: 2.748 ± 2.024
2.998GlySer: 2.998 ± 2.407
2.498GlyThr: 2.498 ± 1.51
2.498GlyVal: 2.498 ± 0.306
0.5GlyTrp: 0.5 ± 0.159
0.999GlyTyr: 0.999 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
0.749HisAla: 0.749 ± 0.676
0.999HisCys: 0.999 ± 0.574
1.249HisAsp: 1.249 ± 0.503
1.499HisGlu: 1.499 ± 0.445
1.499HisPhe: 1.499 ± 0.533
1.999HisGly: 1.999 ± 1.651
0.25HisHis: 0.25 ± 0.158
1.999HisIle: 1.999 ± 0.635
1.999HisLys: 1.999 ± 0.703
1.749HisLeu: 1.749 ± 0.803
0.25HisMet: 0.25 ± 0.225
0.999HisAsn: 0.999 ± 0.352
0.999HisPro: 0.999 ± 0.574
0.0HisGln: 0.0 ± 0.0
0.749HisArg: 0.749 ± 0.636
1.999HisSer: 1.999 ± 0.703
0.0HisThr: 0.0 ± 0.0
1.499HisVal: 1.499 ± 0.436
0.25HisTrp: 0.25 ± 0.76
0.5HisTyr: 0.5 ± 0.45
0.0HisXaa: 0.0 ± 0.0
Ile
4.497IleAla: 4.497 ± 0.538
1.749IleCys: 1.749 ± 0.5
4.996IleAsp: 4.996 ± 2.542
7.494IleGlu: 7.494 ± 2.151
3.497IlePhe: 3.497 ± 0.796
3.997IleGly: 3.997 ± 1.143
2.998IleHis: 2.998 ± 0.348
8.743IleIle: 8.743 ± 3.181
9.493IleLys: 9.493 ± 2.458
10.242IleLeu: 10.242 ± 1.743
1.499IleMet: 1.499 ± 0.604
5.746IleAsn: 5.746 ± 1.764
2.248IlePro: 2.248 ± 0.811
3.497IleGln: 3.497 ± 0.294
4.247IleArg: 4.247 ± 0.6
6.745IleSer: 6.745 ± 0.675
6.745IleThr: 6.745 ± 1.566
4.996IleVal: 4.996 ± 1.423
0.5IleTrp: 0.5 ± 0.316
6.245IleTyr: 6.245 ± 1.526
0.0IleXaa: 0.0 ± 0.0
Lys
4.247LysAla: 4.247 ± 0.589
1.249LysCys: 1.249 ± 0.797
5.746LysAsp: 5.746 ± 0.481
5.746LysGlu: 5.746 ± 2.192
4.247LysPhe: 4.247 ± 1.503
1.999LysGly: 1.999 ± 0.703
1.999LysHis: 1.999 ± 0.568
8.993LysIle: 8.993 ± 0.446
6.495LysLys: 6.495 ± 1.102
7.494LysLeu: 7.494 ± 1.431
2.998LysMet: 2.998 ± 0.89
6.495LysAsn: 6.495 ± 1.85
2.498LysPro: 2.498 ± 0.569
1.999LysGln: 1.999 ± 0.674
2.248LysArg: 2.248 ± 0.468
6.995LysSer: 6.995 ± 0.374
5.746LysThr: 5.746 ± 0.866
4.247LysVal: 4.247 ± 0.712
0.5LysTrp: 0.5 ± 0.316
5.246LysTyr: 5.246 ± 2.218
0.0LysXaa: 0.0 ± 0.0
Leu
3.747LeuAla: 3.747 ± 2.305
2.498LeuCys: 2.498 ± 1.593
5.496LeuAsp: 5.496 ± 1.122
7.744LeuGlu: 7.744 ± 0.75
5.746LeuPhe: 5.746 ± 1.624
2.748LeuGly: 2.748 ± 0.827
2.498LeuHis: 2.498 ± 1.044
8.244LeuIle: 8.244 ± 1.5
9.993LeuLys: 9.993 ± 1.575
8.244LeuLeu: 8.244 ± 1.721
2.498LeuMet: 2.498 ± 0.741
5.746LeuAsn: 5.746 ± 1.052
5.246LeuPro: 5.246 ± 0.37
2.748LeuGln: 2.748 ± 0.589
2.748LeuArg: 2.748 ± 1.863
9.993LeuSer: 9.993 ± 1.469
5.496LeuThr: 5.496 ± 1.234
4.247LeuVal: 4.247 ± 1.602
0.25LeuTrp: 0.25 ± 0.158
3.997LeuTyr: 3.997 ± 0.688
0.0LeuXaa: 0.0 ± 0.0
Met
0.749MetAla: 0.749 ± 0.223
0.25MetCys: 0.25 ± 0.98
1.749MetAsp: 1.749 ± 1.339
1.499MetGlu: 1.499 ± 0.436
1.499MetPhe: 1.499 ± 0.436
0.999MetGly: 0.999 ± 0.802
0.0MetHis: 0.0 ± 0.0
2.498MetIle: 2.498 ± 0.786
1.749MetLys: 1.749 ± 0.803
2.748MetLeu: 2.748 ± 1.146
0.999MetMet: 0.999 ± 1.547
1.749MetAsn: 1.749 ± 1.027
1.749MetPro: 1.749 ± 0.653
0.749MetGln: 0.749 ± 0.636
1.499MetArg: 1.499 ± 0.533
1.749MetSer: 1.749 ± 0.587
0.749MetThr: 0.749 ± 0.474
0.749MetVal: 0.749 ± 0.223
0.0MetTrp: 0.0 ± 0.0
0.25MetTyr: 0.25 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
2.998AsnAla: 2.998 ± 0.348
1.999AsnCys: 1.999 ± 1.336
4.497AsnAsp: 4.497 ± 1.484
2.998AsnGlu: 2.998 ± 0.89
2.998AsnPhe: 2.998 ± 0.502
2.498AsnGly: 2.498 ± 0.992
2.248AsnHis: 2.248 ± 0.43
6.995AsnIle: 6.995 ± 2.563
4.247AsnLys: 4.247 ± 1.242
6.995AsnLeu: 6.995 ± 1.668
2.248AsnMet: 2.248 ± 0.783
4.497AsnAsn: 4.497 ± 0.571
2.248AsnPro: 2.248 ± 1.391
2.498AsnGln: 2.498 ± 0.291
3.747AsnArg: 3.747 ± 0.936
3.497AsnSer: 3.497 ± 0.602
3.248AsnThr: 3.248 ± 0.538
1.249AsnVal: 1.249 ± 0.498
1.249AsnTrp: 1.249 ± 0.503
4.247AsnTyr: 4.247 ± 1.353
0.0AsnXaa: 0.0 ± 0.0
Pro
1.749ProAla: 1.749 ± 0.567
0.25ProCys: 0.25 ± 0.98
1.249ProAsp: 1.249 ± 0.789
1.499ProGlu: 1.499 ± 0.533
0.749ProPhe: 0.749 ± 0.223
2.498ProGly: 2.498 ± 1.094
0.5ProHis: 0.5 ± 0.45
4.996ProIle: 4.996 ± 1.119
1.999ProLys: 1.999 ± 1.031
2.748ProLeu: 2.748 ± 1.759
0.25ProMet: 0.25 ± 0.158
1.749ProAsn: 1.749 ± 0.567
0.749ProPro: 0.749 ± 0.223
0.5ProGln: 0.5 ± 0.159
0.749ProArg: 0.749 ± 0.223
1.499ProSer: 1.499 ± 0.533
0.749ProThr: 0.749 ± 0.356
2.748ProVal: 2.748 ± 0.379
0.749ProTrp: 0.749 ± 0.474
1.249ProTyr: 1.249 ± 0.498
0.0ProXaa: 0.0 ± 0.0
Gln
0.999GlnAla: 0.999 ± 0.649
0.999GlnCys: 0.999 ± 0.574
2.498GlnAsp: 2.498 ± 0.786
0.999GlnGlu: 0.999 ± 0.632
2.498GlnPhe: 2.498 ± 0.992
1.999GlnGly: 1.999 ± 1.355
0.999GlnHis: 0.999 ± 0.632
2.998GlnIle: 2.998 ± 1.055
2.248GlnLys: 2.248 ± 0.847
2.498GlnLeu: 2.498 ± 0.306
0.25GlnMet: 0.25 ± 0.158
1.249GlnAsn: 1.249 ± 0.498
0.5GlnPro: 0.5 ± 0.316
0.999GlnGln: 0.999 ± 0.802
1.499GlnArg: 1.499 ± 0.794
1.749GlnSer: 1.749 ± 1.328
3.497GlnThr: 3.497 ± 1.524
0.5GlnVal: 0.5 ± 0.316
0.5GlnTrp: 0.5 ± 0.737
1.249GlnTyr: 1.249 ± 0.524
0.0GlnXaa: 0.0 ± 0.0
Arg
2.498ArgAla: 2.498 ± 0.572
0.999ArgCys: 0.999 ± 0.352
1.499ArgAsp: 1.499 ± 0.436
1.749ArgGlu: 1.749 ± 0.5
1.749ArgPhe: 1.749 ± 0.567
0.999ArgGly: 0.999 ± 1.398
1.499ArgHis: 1.499 ± 0.533
3.248ArgIle: 3.248 ± 0.281
2.748ArgLys: 2.748 ± 0.958
3.997ArgLeu: 3.997 ± 1.341
0.749ArgMet: 0.749 ± 0.474
2.748ArgAsn: 2.748 ± 0.788
0.25ArgPro: 0.25 ± 0.158
1.249ArgGln: 1.249 ± 2.148
1.999ArgArg: 1.999 ± 0.568
2.998ArgSer: 2.998 ± 1.067
0.749ArgThr: 0.749 ± 0.223
1.749ArgVal: 1.749 ± 0.387
0.749ArgTrp: 0.749 ± 1.922
1.749ArgTyr: 1.749 ± 0.387
0.0ArgXaa: 0.0 ± 0.0
Ser
2.248SerAla: 2.248 ± 0.675
3.248SerCys: 3.248 ± 1.64
3.248SerAsp: 3.248 ± 0.54
5.496SerGlu: 5.496 ± 1.826
1.999SerPhe: 1.999 ± 0.572
4.497SerGly: 4.497 ± 2.524
1.249SerHis: 1.249 ± 0.498
7.994SerIle: 7.994 ± 1.023
7.994SerLys: 7.994 ± 2.34
9.493SerLeu: 9.493 ± 1.489
1.749SerMet: 1.749 ± 0.7
2.998SerAsn: 2.998 ± 0.913
1.749SerPro: 1.749 ± 0.653
1.749SerGln: 1.749 ± 0.656
1.249SerArg: 1.249 ± 0.353
4.746SerSer: 4.746 ± 2.071
4.996SerThr: 4.996 ± 0.857
4.247SerVal: 4.247 ± 1.973
0.5SerTrp: 0.5 ± 0.159
2.998SerTyr: 2.998 ± 0.89
0.0SerXaa: 0.0 ± 0.0
Thr
2.498ThrAla: 2.498 ± 0.572
1.749ThrCys: 1.749 ± 0.929
3.497ThrAsp: 3.497 ± 0.602
3.747ThrGlu: 3.747 ± 0.648
3.747ThrPhe: 3.747 ± 1.493
3.497ThrGly: 3.497 ± 1.571
0.5ThrHis: 0.5 ± 0.45
6.245ThrIle: 6.245 ± 2.138
4.247ThrLys: 4.247 ± 0.639
5.246ThrLeu: 5.246 ± 1.379
1.249ThrMet: 1.249 ± 0.498
4.497ThrAsn: 4.497 ± 2.137
1.249ThrPro: 1.249 ± 0.353
0.999ThrGln: 0.999 ± 0.632
1.499ThrArg: 1.499 ± 0.649
3.747ThrSer: 3.747 ± 1.059
2.248ThrThr: 2.248 ± 0.652
2.748ThrVal: 2.748 ± 0.982
1.499ThrTrp: 1.499 ± 2.374
2.248ThrTyr: 2.248 ± 0.847
0.0ThrXaa: 0.0 ± 0.0
Val
1.999ValAla: 1.999 ± 0.959
1.999ValCys: 1.999 ± 0.616
2.498ValAsp: 2.498 ± 0.555
3.248ValGlu: 3.248 ± 0.344
1.249ValPhe: 1.249 ± 0.524
1.249ValGly: 1.249 ± 0.524
1.499ValHis: 1.499 ± 0.649
4.497ValIle: 4.497 ± 1.864
3.747ValLys: 3.747 ± 0.395
4.497ValLeu: 4.497 ± 1.781
0.5ValMet: 0.5 ± 0.699
4.746ValAsn: 4.746 ± 0.746
0.5ValPro: 0.5 ± 0.159
1.499ValGln: 1.499 ± 0.807
0.25ValArg: 0.25 ± 0.225
2.748ValSer: 2.748 ± 0.589
4.497ValThr: 4.497 ± 0.538
1.749ValVal: 1.749 ± 1.486
0.0ValTrp: 0.0 ± 0.0
1.749ValTyr: 1.749 ± 0.5
0.0ValXaa: 0.0 ± 0.0
Trp
0.5TrpAla: 0.5 ± 0.159
0.25TrpCys: 0.25 ± 0.158
0.999TrpAsp: 0.999 ± 0.561
0.25TrpGlu: 0.25 ± 0.158
0.749TrpPhe: 0.749 ± 0.356
0.5TrpGly: 0.5 ± 0.737
0.25TrpHis: 0.25 ± 0.158
1.249TrpIle: 1.249 ± 0.772
0.25TrpLys: 0.25 ± 0.158
1.249TrpLeu: 1.249 ± 0.353
0.25TrpMet: 0.25 ± 0.76
0.749TrpAsn: 0.749 ± 0.89
0.0TrpPro: 0.0 ± 0.0
0.25TrpGln: 0.25 ± 0.158
0.749TrpArg: 0.749 ± 0.983
1.499TrpSer: 1.499 ± 1.401
0.5TrpThr: 0.5 ± 0.316
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.999TyrAla: 0.999 ± 0.317
0.749TyrCys: 0.749 ± 0.676
2.248TyrAsp: 2.248 ± 0.275
2.748TyrGlu: 2.748 ± 0.917
2.248TyrPhe: 2.248 ± 1.115
1.749TyrGly: 1.749 ± 1.328
0.749TyrHis: 0.749 ± 0.356
4.497TyrIle: 4.497 ± 1.428
5.746TyrLys: 5.746 ± 2.123
4.746TyrLeu: 4.746 ± 1.633
1.249TyrMet: 1.249 ± 0.721
2.998TyrAsn: 2.998 ± 0.89
0.749TyrPro: 0.749 ± 0.636
2.248TyrGln: 2.248 ± 0.847
1.499TyrArg: 1.499 ± 0.436
2.748TyrSer: 2.748 ± 0.917
2.998TyrThr: 2.998 ± 0.851
1.499TyrVal: 1.499 ± 0.649
0.749TyrTrp: 0.749 ± 0.474
1.499TyrTyr: 1.499 ± 0.476
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4004 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski