Amino acid dipepetide frequency for Caimito virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.038AlaAla: 2.038 ± 1.725
1.274AlaCys: 1.274 ± 0.769
2.038AlaAsp: 2.038 ± 0.493
3.312AlaGlu: 3.312 ± 0.923
1.783AlaPhe: 1.783 ± 0.568
2.038AlaGly: 2.038 ± 3.149
0.51AlaHis: 0.51 ± 0.125
5.605AlaIle: 5.605 ± 1.451
3.822AlaLys: 3.822 ± 0.187
3.567AlaLeu: 3.567 ± 0.987
1.783AlaMet: 1.783 ± 1.364
2.548AlaAsn: 2.548 ± 0.625
1.783AlaPro: 1.783 ± 1.36
0.764AlaGln: 0.764 ± 0.184
2.803AlaArg: 2.803 ± 0.841
2.293AlaSer: 2.293 ± 0.552
2.548AlaThr: 2.548 ± 2.05
3.312AlaVal: 3.312 ± 0.876
0.255AlaTrp: 0.255 ± 0.163
1.529AlaTyr: 1.529 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
1.019CysAla: 1.019 ± 0.325
0.0CysCys: 0.0 ± 0.0
0.255CysAsp: 0.255 ± 0.225
2.038CysGlu: 2.038 ± 0.763
2.548CysPhe: 2.548 ± 0.878
2.548CysGly: 2.548 ± 2.254
0.51CysHis: 0.51 ± 0.125
2.803CysIle: 2.803 ± 0.8
2.803CysLys: 2.803 ± 2.119
2.038CysLeu: 2.038 ± 0.763
0.764CysMet: 0.764 ± 0.883
3.057CysAsn: 3.057 ± 1.405
1.783CysPro: 1.783 ± 0.557
1.019CysGln: 1.019 ± 0.25
0.764CysArg: 0.764 ± 0.326
2.548CysSer: 2.548 ± 0.537
1.274CysThr: 1.274 ± 0.48
0.51CysVal: 0.51 ± 0.451
0.0CysTrp: 0.0 ± 0.0
1.274CysTyr: 1.274 ± 0.269
0.0CysXaa: 0.0 ± 0.0
Asp
2.038AspAla: 2.038 ± 1.353
1.529AspCys: 1.529 ± 0.652
3.567AspAsp: 3.567 ± 1.111
3.822AspGlu: 3.822 ± 0.92
2.548AspPhe: 2.548 ± 0.634
2.548AspGly: 2.548 ± 0.625
0.51AspHis: 0.51 ± 0.125
7.643AspIle: 7.643 ± 0.984
3.567AspLys: 3.567 ± 1.114
6.369AspLeu: 6.369 ± 0.536
1.783AspMet: 1.783 ± 0.772
1.783AspAsn: 1.783 ± 1.36
1.529AspPro: 1.529 ± 0.981
3.312AspGln: 3.312 ± 1.274
2.293AspArg: 2.293 ± 1.264
4.331AspSer: 4.331 ± 0.823
2.293AspThr: 2.293 ± 0.381
3.312AspVal: 3.312 ± 0.514
0.764AspTrp: 0.764 ± 1.563
3.057AspTyr: 3.057 ± 1.613
0.0AspXaa: 0.0 ± 0.0
Glu
3.057GluAla: 3.057 ± 0.244
2.293GluCys: 2.293 ± 0.678
2.803GluAsp: 2.803 ± 0.424
4.586GluGlu: 4.586 ± 0.134
2.803GluPhe: 2.803 ± 1.471
1.274GluGly: 1.274 ± 0.269
1.529GluHis: 1.529 ± 0.368
7.389GluIle: 7.389 ± 1.595
3.567GluLys: 3.567 ± 1.539
5.096GluLeu: 5.096 ± 0.258
3.312GluMet: 3.312 ± 1.588
4.586GluAsn: 4.586 ± 0.667
2.038GluPro: 2.038 ± 0.649
2.038GluGln: 2.038 ± 0.443
3.822GluArg: 3.822 ± 1.763
5.35GluSer: 5.35 ± 0.262
3.312GluThr: 3.312 ± 1.028
2.038GluVal: 2.038 ± 0.763
0.51GluTrp: 0.51 ± 0.125
2.803GluTyr: 2.803 ± 0.277
0.0GluXaa: 0.0 ± 0.0
Phe
0.764PheAla: 0.764 ± 0.184
2.293PheCys: 2.293 ± 0.552
3.567PheAsp: 3.567 ± 0.12
4.841PheGlu: 4.841 ± 1.823
3.567PhePhe: 3.567 ± 1.367
2.038PheGly: 2.038 ± 0.5
1.783PheHis: 1.783 ± 0.502
2.548PheIle: 2.548 ± 0.537
5.35PheLys: 5.35 ± 1.467
5.86PheLeu: 5.86 ± 1.239
1.274PheMet: 1.274 ± 0.612
2.548PheAsn: 2.548 ± 0.974
0.764PhePro: 0.764 ± 0.184
1.019PheGln: 1.019 ± 0.654
1.529PheArg: 1.529 ± 0.981
5.096PheSer: 5.096 ± 1.583
2.803PheThr: 2.803 ± 1.174
1.783PheVal: 1.783 ± 0.376
0.51PheTrp: 0.51 ± 0.125
1.274PheTyr: 1.274 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
0.764GlyAla: 0.764 ± 0.72
3.312GlyCys: 3.312 ± 0.514
2.803GlyAsp: 2.803 ± 1.219
3.822GlyGlu: 3.822 ± 1.317
1.783GlyPhe: 1.783 ± 0.502
1.783GlyGly: 1.783 ± 0.568
0.255GlyHis: 0.255 ± 0.225
2.293GlyIle: 2.293 ± 0.678
3.312GlyLys: 3.312 ± 0.655
5.096GlyLeu: 5.096 ± 1.074
0.255GlyMet: 0.255 ± 0.225
3.567GlyAsn: 3.567 ± 1.114
1.783GlyPro: 1.783 ± 0.376
2.548GlyGln: 2.548 ± 0.537
1.783GlyArg: 1.783 ± 0.557
2.548GlySer: 2.548 ± 1.539
2.548GlyThr: 2.548 ± 1.363
2.548GlyVal: 2.548 ± 2.077
0.764GlyTrp: 0.764 ± 0.184
1.783GlyTyr: 1.783 ± 0.871
0.0GlyXaa: 0.0 ± 0.0
His
1.274HisAla: 1.274 ± 0.439
1.019HisCys: 1.019 ± 0.325
1.274HisAsp: 1.274 ± 0.269
0.764HisGlu: 0.764 ± 0.184
1.019HisPhe: 1.019 ± 0.546
1.529HisGly: 1.529 ± 0.368
0.255HisHis: 0.255 ± 0.163
1.019HisIle: 1.019 ± 0.25
1.274HisLys: 1.274 ± 0.769
1.529HisLeu: 1.529 ± 0.368
0.764HisMet: 0.764 ± 0.326
1.019HisAsn: 1.019 ± 0.25
0.51HisPro: 0.51 ± 0.125
0.255HisGln: 0.255 ± 0.163
0.51HisArg: 0.51 ± 0.125
0.764HisSer: 0.764 ± 0.72
1.274HisThr: 1.274 ± 0.269
1.274HisVal: 1.274 ± 0.439
0.0HisTrp: 0.0 ± 0.0
0.764HisTyr: 0.764 ± 0.676
0.0HisXaa: 0.0 ± 0.0
Ile
4.586IleAla: 4.586 ± 1.044
1.529IleCys: 1.529 ± 0.994
5.096IleAsp: 5.096 ± 0.654
5.86IleGlu: 5.86 ± 0.777
3.312IlePhe: 3.312 ± 0.201
3.822IleGly: 3.822 ± 1.149
2.293IleHis: 2.293 ± 0.977
6.369IleIle: 6.369 ± 0.781
9.427IleLys: 9.427 ± 1.536
5.605IleLeu: 5.605 ± 0.352
2.548IleMet: 2.548 ± 0.625
4.841IleAsn: 4.841 ± 1.23
2.293IlePro: 2.293 ± 0.552
3.822IleGln: 3.822 ± 0.187
5.096IleArg: 5.096 ± 1.365
8.408IleSer: 8.408 ± 2.023
5.096IleThr: 5.096 ± 0.769
5.096IleVal: 5.096 ± 0.769
0.764IleTrp: 0.764 ± 0.49
1.274IleTyr: 1.274 ± 0.269
0.0IleXaa: 0.0 ± 0.0
Lys
3.567LysAla: 3.567 ± 0.997
2.548LysCys: 2.548 ± 1.539
6.115LysAsp: 6.115 ± 1.48
4.586LysGlu: 4.586 ± 1.103
2.803LysPhe: 2.803 ± 1.137
4.331LysGly: 4.331 ± 1.026
3.057LysHis: 3.057 ± 0.346
5.605LysIle: 5.605 ± 0.721
5.35LysLys: 5.35 ± 0.651
4.586LysLeu: 4.586 ± 0.303
3.312LysMet: 3.312 ± 0.923
2.548LysAsn: 2.548 ± 0.663
3.312LysPro: 3.312 ± 0.865
2.038LysGln: 2.038 ± 0.493
5.86LysArg: 5.86 ± 0.571
5.096LysSer: 5.096 ± 1.104
7.134LysThr: 7.134 ± 2.17
5.35LysVal: 5.35 ± 1.481
0.51LysTrp: 0.51 ± 0.327
2.293LysTyr: 2.293 ± 0.522
0.0LysXaa: 0.0 ± 0.0
Leu
4.841LeuAla: 4.841 ± 1.171
2.548LeuCys: 2.548 ± 0.663
5.35LeuAsp: 5.35 ± 0.262
5.86LeuGlu: 5.86 ± 0.482
6.624LeuPhe: 6.624 ± 1.467
4.076LeuGly: 4.076 ± 1.526
1.274LeuHis: 1.274 ± 0.48
6.879LeuIle: 6.879 ± 1.445
6.369LeuLys: 6.369 ± 2.288
8.153LeuLeu: 8.153 ± 1.542
2.293LeuMet: 2.293 ± 0.803
5.605LeuAsn: 5.605 ± 0.555
3.312LeuPro: 3.312 ± 0.655
3.057LeuGln: 3.057 ± 0.642
2.038LeuArg: 2.038 ± 0.747
5.096LeuSer: 5.096 ± 0.968
4.841LeuThr: 4.841 ± 1.23
3.312LeuVal: 3.312 ± 0.923
1.019LeuTrp: 1.019 ± 0.325
3.312LeuTyr: 3.312 ± 1.028
0.0LeuXaa: 0.0 ± 0.0
Met
2.548MetAla: 2.548 ± 2.05
0.255MetCys: 0.255 ± 0.225
1.529MetAsp: 1.529 ± 0.546
1.529MetGlu: 1.529 ± 0.546
1.529MetPhe: 1.529 ± 0.863
0.764MetGly: 0.764 ± 0.184
0.255MetHis: 0.255 ± 0.225
3.312MetIle: 3.312 ± 0.865
2.038MetLys: 2.038 ± 2.174
3.312MetLeu: 3.312 ± 0.733
0.764MetMet: 0.764 ± 0.49
2.293MetAsn: 2.293 ± 0.552
0.51MetPro: 0.51 ± 0.327
0.764MetGln: 0.764 ± 0.184
1.274MetArg: 1.274 ± 0.269
3.822MetSer: 3.822 ± 0.572
2.038MetThr: 2.038 ± 0.443
1.019MetVal: 1.019 ± 0.752
0.0MetTrp: 0.0 ± 0.0
0.764MetTyr: 0.764 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
3.057AsnAla: 3.057 ± 0.678
0.764AsnCys: 0.764 ± 0.326
5.605AsnAsp: 5.605 ± 0.352
3.057AsnGlu: 3.057 ± 0.995
2.803AsnPhe: 2.803 ± 1.399
1.529AsnGly: 1.529 ± 0.368
0.764AsnHis: 0.764 ± 0.49
3.057AsnIle: 3.057 ± 0.642
3.312AsnLys: 3.312 ± 2.125
5.35AsnLeu: 5.35 ± 1.129
2.548AsnMet: 2.548 ± 0.625
3.312AsnAsn: 3.312 ± 0.865
4.076AsnPro: 4.076 ± 0.986
4.076AsnGln: 4.076 ± 0.397
3.057AsnArg: 3.057 ± 1.038
2.548AsnSer: 2.548 ± 0.683
3.822AsnThr: 3.822 ± 2.65
1.783AsnVal: 1.783 ± 0.494
1.783AsnTrp: 1.783 ± 0.557
3.057AsnTyr: 3.057 ± 0.974
0.0AsnXaa: 0.0 ± 0.0
Pro
1.274ProAla: 1.274 ± 0.846
0.51ProCys: 0.51 ± 0.125
2.038ProAsp: 2.038 ± 0.493
2.038ProGlu: 2.038 ± 1.073
1.529ProPhe: 1.529 ± 0.375
2.548ProGly: 2.548 ± 1.287
0.51ProHis: 0.51 ± 0.125
3.567ProIle: 3.567 ± 0.351
1.529ProLys: 1.529 ± 0.652
2.803ProLeu: 2.803 ± 1.137
0.764ProMet: 0.764 ± 0.184
2.293ProAsn: 2.293 ± 0.552
0.764ProPro: 0.764 ± 0.184
0.51ProGln: 0.51 ± 0.327
1.019ProArg: 1.019 ± 0.325
2.293ProSer: 2.293 ± 0.552
2.038ProThr: 2.038 ± 0.5
2.038ProVal: 2.038 ± 0.5
0.51ProTrp: 0.51 ± 0.327
1.529ProTyr: 1.529 ± 0.368
0.0ProXaa: 0.0 ± 0.0
Gln
1.529GlnAla: 1.529 ± 0.618
0.764GlnCys: 0.764 ± 0.326
2.038GlnAsp: 2.038 ± 0.443
2.038GlnGlu: 2.038 ± 0.443
2.548GlnPhe: 2.548 ± 0.368
1.783GlnGly: 1.783 ± 0.557
0.764GlnHis: 0.764 ± 0.676
2.548GlnIle: 2.548 ± 0.683
4.331GlnLys: 4.331 ± 1.486
2.293GlnLeu: 2.293 ± 0.678
0.51GlnMet: 0.51 ± 0.327
1.783GlnAsn: 1.783 ± 0.666
0.255GlnPro: 0.255 ± 0.163
1.019GlnGln: 1.019 ± 0.25
2.803GlnArg: 2.803 ± 1.106
3.057GlnSer: 3.057 ± 0.974
2.293GlnThr: 2.293 ± 0.492
1.274GlnVal: 1.274 ± 0.269
0.51GlnTrp: 0.51 ± 0.125
1.529GlnTyr: 1.529 ± 0.981
0.0GlnXaa: 0.0 ± 0.0
Arg
1.529ArgAla: 1.529 ± 0.368
1.019ArgCys: 1.019 ± 0.546
3.312ArgAsp: 3.312 ± 0.709
2.548ArgGlu: 2.548 ± 0.368
2.803ArgPhe: 2.803 ± 0.424
2.548ArgGly: 2.548 ± 0.537
0.51ArgHis: 0.51 ± 0.125
3.822ArgIle: 3.822 ± 0.572
2.548ArgLys: 2.548 ± 0.683
4.076ArgLeu: 4.076 ± 0.873
1.019ArgMet: 1.019 ± 1.532
4.331ArgAsn: 4.331 ± 2.087
0.764ArgPro: 0.764 ± 0.732
1.529ArgGln: 1.529 ± 0.368
3.822ArgArg: 3.822 ± 0.204
3.057ArgSer: 3.057 ± 0.736
2.803ArgThr: 2.803 ± 0.841
2.038ArgVal: 2.038 ± 0.649
0.0ArgTrp: 0.0 ± 0.0
1.529ArgTyr: 1.529 ± 0.981
0.0ArgXaa: 0.0 ± 0.0
Ser
3.312SerAla: 3.312 ± 1.516
3.312SerCys: 3.312 ± 1.201
3.822SerAsp: 3.822 ± 1.44
3.312SerGlu: 3.312 ± 1.127
2.548SerPhe: 2.548 ± 0.368
3.057SerGly: 3.057 ± 1.038
0.764SerHis: 0.764 ± 0.184
9.682SerIle: 9.682 ± 1.63
7.898SerLys: 7.898 ± 1.392
6.115SerLeu: 6.115 ± 0.468
2.293SerMet: 2.293 ± 1.125
2.548SerAsn: 2.548 ± 1.005
2.038SerPro: 2.038 ± 0.443
2.803SerGln: 2.803 ± 0.825
3.057SerArg: 3.057 ± 0.736
6.369SerSer: 6.369 ± 2.101
4.076SerThr: 4.076 ± 1.0
3.312SerVal: 3.312 ± 1.44
0.51SerTrp: 0.51 ± 0.757
3.312SerTyr: 3.312 ± 0.865
0.0SerXaa: 0.0 ± 0.0
Thr
4.076ThrAla: 4.076 ± 0.724
2.293ThrCys: 2.293 ± 1.315
2.548ThrAsp: 2.548 ± 1.45
3.822ThrGlu: 3.822 ± 1.783
3.312ThrPhe: 3.312 ± 1.028
3.567ThrGly: 3.567 ± 1.114
1.274ThrHis: 1.274 ± 0.769
5.096ThrIle: 5.096 ± 2.703
4.586ThrLys: 4.586 ± 0.985
4.586ThrLeu: 4.586 ± 1.088
1.274ThrMet: 1.274 ± 0.789
3.312ThrAsn: 3.312 ± 0.514
2.293ThrPro: 2.293 ± 0.678
1.783ThrGln: 1.783 ± 0.376
1.529ThrArg: 1.529 ± 0.863
4.331ThrSer: 4.331 ± 0.909
1.783ThrThr: 1.783 ± 0.666
2.548ThrVal: 2.548 ± 0.385
1.019ThrTrp: 1.019 ± 0.25
3.312ThrTyr: 3.312 ± 0.655
0.0ThrXaa: 0.0 ± 0.0
Val
2.038ValAla: 2.038 ± 1.295
1.783ValCys: 1.783 ± 0.871
2.038ValAsp: 2.038 ± 0.443
3.822ValGlu: 3.822 ± 0.554
2.038ValPhe: 2.038 ± 0.649
2.038ValGly: 2.038 ± 2.159
0.51ValHis: 0.51 ± 0.125
3.567ValIle: 3.567 ± 0.715
4.076ValLys: 4.076 ± 0.397
5.35ValLeu: 5.35 ± 1.152
0.764ValMet: 0.764 ± 0.184
3.057ValAsn: 3.057 ± 0.244
1.783ValPro: 1.783 ± 1.218
1.783ValGln: 1.783 ± 0.871
1.529ValArg: 1.529 ± 0.863
3.567ValSer: 3.567 ± 1.137
2.548ValThr: 2.548 ± 1.287
4.586ValVal: 4.586 ± 0.834
0.0ValTrp: 0.0 ± 0.0
3.057ValTyr: 3.057 ± 1.303
0.0ValXaa: 0.0 ± 0.0
Trp
0.255TrpAla: 0.255 ± 0.225
0.255TrpCys: 0.255 ± 0.225
0.0TrpAsp: 0.0 ± 0.0
0.255TrpGlu: 0.255 ± 0.163
1.783TrpPhe: 1.783 ± 0.376
0.51TrpGly: 0.51 ± 0.451
0.0TrpHis: 0.0 ± 0.0
0.51TrpIle: 0.51 ± 0.125
0.255TrpLys: 0.255 ± 0.814
1.529TrpLeu: 1.529 ± 0.702
0.0TrpMet: 0.0 ± 0.0
0.255TrpAsn: 0.255 ± 0.163
0.0TrpPro: 0.0 ± 0.0
0.764TrpGln: 0.764 ± 0.732
0.0TrpArg: 0.0 ± 0.0
1.783TrpSer: 1.783 ± 1.144
0.51TrpThr: 0.51 ± 0.327
0.764TrpVal: 0.764 ± 0.184
0.0TrpTrp: 0.0 ± 0.0
0.51TrpTyr: 0.51 ± 0.327
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.038TyrAla: 2.038 ± 0.763
0.255TyrCys: 0.255 ± 0.225
2.803TyrAsp: 2.803 ± 0.277
2.548TyrGlu: 2.548 ± 0.634
1.529TyrPhe: 1.529 ± 0.981
1.019TyrGly: 1.019 ± 0.654
0.764TyrHis: 0.764 ± 0.676
3.822TyrIle: 3.822 ± 0.806
4.586TyrLys: 4.586 ± 0.977
2.548TyrLeu: 2.548 ± 0.625
2.038TyrMet: 2.038 ± 0.747
4.076TyrAsn: 4.076 ± 1.097
0.51TyrPro: 0.51 ± 0.125
0.764TyrGln: 0.764 ± 0.326
1.019TyrArg: 1.019 ± 0.654
1.783TyrSer: 1.783 ± 0.502
3.312TyrThr: 3.312 ± 0.201
2.038TyrVal: 2.038 ± 0.649
0.51TyrTrp: 0.51 ± 0.125
1.019TyrTyr: 1.019 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3926 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski