Amino acid dipepetide frequency for Cuiaba virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.069AlaAla: 1.069 ± 0.698
0.802AlaCys: 0.802 ± 0.408
2.94AlaAsp: 2.94 ± 0.723
1.069AlaGlu: 1.069 ± 0.387
1.871AlaPhe: 1.871 ± 0.576
2.673AlaGly: 2.673 ± 1.097
1.604AlaHis: 1.604 ± 0.866
2.94AlaIle: 2.94 ± 0.794
2.673AlaLys: 2.673 ± 1.941
5.079AlaLeu: 5.079 ± 1.046
0.0AlaMet: 0.0 ± 0.0
1.604AlaAsn: 1.604 ± 0.621
1.604AlaPro: 1.604 ± 0.865
1.604AlaGln: 1.604 ± 0.559
1.871AlaArg: 1.871 ± 0.64
2.138AlaSer: 2.138 ± 0.893
1.069AlaThr: 1.069 ± 0.589
1.871AlaVal: 1.871 ± 1.28
0.535AlaTrp: 0.535 ± 0.413
0.802AlaTyr: 0.802 ± 0.635
0.0AlaXaa: 0.0 ± 0.0
Cys
1.069CysAla: 1.069 ± 0.404
0.0CysCys: 0.0 ± 0.0
0.802CysAsp: 0.802 ± 0.429
1.337CysGlu: 1.337 ± 0.522
1.069CysPhe: 1.069 ± 0.399
1.069CysGly: 1.069 ± 0.634
0.802CysHis: 0.802 ± 0.436
1.871CysIle: 1.871 ± 0.558
1.604CysLys: 1.604 ± 0.558
1.871CysLeu: 1.871 ± 0.628
1.069CysMet: 1.069 ± 0.833
1.871CysAsn: 1.871 ± 0.576
0.535CysPro: 0.535 ± 0.517
1.069CysGln: 1.069 ± 0.803
1.069CysArg: 1.069 ± 0.363
1.069CysSer: 1.069 ± 0.511
1.604CysThr: 1.604 ± 0.868
0.802CysVal: 0.802 ± 0.475
0.535CysTrp: 0.535 ± 0.317
1.069CysTyr: 1.069 ± 0.594
0.0CysXaa: 0.0 ± 0.0
Asp
1.871AspAla: 1.871 ± 0.558
1.069AspCys: 1.069 ± 0.511
5.346AspAsp: 5.346 ± 2.649
4.544AspGlu: 4.544 ± 0.65
3.208AspPhe: 3.208 ± 1.51
2.94AspGly: 2.94 ± 0.919
0.535AspHis: 0.535 ± 0.517
4.277AspIle: 4.277 ± 1.73
4.01AspLys: 4.01 ± 0.499
7.217AspLeu: 7.217 ± 1.762
2.673AspMet: 2.673 ± 0.785
3.208AspAsn: 3.208 ± 0.593
3.742AspPro: 3.742 ± 0.73
2.406AspGln: 2.406 ± 1.14
1.337AspArg: 1.337 ± 0.538
1.337AspSer: 1.337 ± 0.982
3.475AspThr: 3.475 ± 1.069
1.069AspVal: 1.069 ± 0.634
1.337AspTrp: 1.337 ± 0.593
3.208AspTyr: 3.208 ± 1.144
0.0AspXaa: 0.0 ± 0.0
Glu
3.475GluAla: 3.475 ± 1.623
2.138GluCys: 2.138 ± 1.051
4.812GluAsp: 4.812 ± 1.236
9.088GluGlu: 9.088 ± 2.373
3.208GluPhe: 3.208 ± 1.103
3.475GluGly: 3.475 ± 0.6
1.337GluHis: 1.337 ± 0.578
5.079GluIle: 5.079 ± 1.364
5.079GluLys: 5.079 ± 1.419
4.01GluLeu: 4.01 ± 1.718
2.138GluMet: 2.138 ± 0.683
2.138GluAsn: 2.138 ± 0.684
2.138GluPro: 2.138 ± 1.677
1.604GluGln: 1.604 ± 0.859
3.475GluArg: 3.475 ± 1.042
4.544GluSer: 4.544 ± 1.705
2.673GluThr: 2.673 ± 0.428
4.277GluVal: 4.277 ± 0.772
0.267GluTrp: 0.267 ± 0.347
2.138GluTyr: 2.138 ± 0.702
0.0GluXaa: 0.0 ± 0.0
Phe
1.069PheAla: 1.069 ± 0.363
1.069PheCys: 1.069 ± 0.392
2.406PheAsp: 2.406 ± 0.464
1.604PheGlu: 1.604 ± 0.691
2.94PhePhe: 2.94 ± 0.63
2.138PheGly: 2.138 ± 0.297
0.535PheHis: 0.535 ± 0.529
3.742PheIle: 3.742 ± 0.925
4.544PheLys: 4.544 ± 1.019
6.148PheLeu: 6.148 ± 0.859
0.802PheMet: 0.802 ± 0.326
2.673PheAsn: 2.673 ± 0.702
3.475PhePro: 3.475 ± 0.698
0.535PheGln: 0.535 ± 0.317
2.673PheArg: 2.673 ± 0.821
3.475PheSer: 3.475 ± 0.708
0.802PheThr: 0.802 ± 0.635
2.94PheVal: 2.94 ± 0.803
0.535PheTrp: 0.535 ± 0.317
1.069PheTyr: 1.069 ± 0.709
0.0PheXaa: 0.0 ± 0.0
Gly
2.138GlyAla: 2.138 ± 0.638
0.802GlyCys: 0.802 ± 0.346
2.673GlyAsp: 2.673 ± 0.758
2.94GlyGlu: 2.94 ± 0.493
4.01GlyPhe: 4.01 ± 1.426
4.01GlyGly: 4.01 ± 0.606
1.604GlyHis: 1.604 ± 0.458
4.544GlyIle: 4.544 ± 1.645
4.277GlyLys: 4.277 ± 1.578
5.079GlyLeu: 5.079 ± 1.645
1.604GlyMet: 1.604 ± 0.558
2.94GlyAsn: 2.94 ± 1.115
1.604GlyPro: 1.604 ± 0.952
2.406GlyGln: 2.406 ± 0.992
2.673GlyArg: 2.673 ± 0.661
3.742GlySer: 3.742 ± 0.934
2.673GlyThr: 2.673 ± 0.923
3.208GlyVal: 3.208 ± 2.096
1.069GlyTrp: 1.069 ± 0.634
2.94GlyTyr: 2.94 ± 0.671
0.0GlyXaa: 0.0 ± 0.0
His
0.535HisAla: 0.535 ± 0.297
0.802HisCys: 0.802 ± 0.408
0.802HisAsp: 0.802 ± 0.408
1.604HisGlu: 1.604 ± 0.866
1.604HisPhe: 1.604 ± 0.741
0.267HisGly: 0.267 ± 0.158
0.535HisHis: 0.535 ± 0.317
1.337HisIle: 1.337 ± 0.602
1.604HisLys: 1.604 ± 0.409
1.337HisLeu: 1.337 ± 0.511
0.802HisMet: 0.802 ± 0.408
0.0HisAsn: 0.0 ± 0.0
2.406HisPro: 2.406 ± 0.915
0.802HisGln: 0.802 ± 0.425
1.337HisArg: 1.337 ± 0.342
1.337HisSer: 1.337 ± 1.024
0.802HisThr: 0.802 ± 0.503
0.267HisVal: 0.267 ± 0.158
0.535HisTrp: 0.535 ± 0.695
1.069HisTyr: 1.069 ± 0.363
0.0HisXaa: 0.0 ± 0.0
Ile
2.138IleAla: 2.138 ± 1.11
1.069IleCys: 1.069 ± 0.404
4.01IleAsp: 4.01 ± 0.428
3.208IleGlu: 3.208 ± 1.603
2.406IlePhe: 2.406 ± 0.966
6.148IleGly: 6.148 ± 1.278
1.871IleHis: 1.871 ± 0.867
5.346IleIle: 5.346 ± 1.124
5.881IleLys: 5.881 ± 2.017
5.613IleLeu: 5.613 ± 1.31
3.208IleMet: 3.208 ± 1.002
3.742IleAsn: 3.742 ± 1.271
3.475IlePro: 3.475 ± 0.489
3.208IleGln: 3.208 ± 0.907
4.544IleArg: 4.544 ± 1.098
8.019IleSer: 8.019 ± 2.124
6.148IleThr: 6.148 ± 0.881
4.01IleVal: 4.01 ± 0.702
2.406IleTrp: 2.406 ± 0.806
1.604IleTyr: 1.604 ± 0.951
0.0IleXaa: 0.0 ± 0.0
Lys
2.138LysAla: 2.138 ± 0.726
1.337LysCys: 1.337 ± 0.342
4.544LysAsp: 4.544 ± 1.003
5.346LysGlu: 5.346 ± 1.186
4.277LysPhe: 4.277 ± 1.365
4.277LysGly: 4.277 ± 1.344
1.871LysHis: 1.871 ± 1.428
4.812LysIle: 4.812 ± 1.194
6.95LysLys: 6.95 ± 1.607
7.217LysLeu: 7.217 ± 1.716
2.138LysMet: 2.138 ± 0.366
3.475LysAsn: 3.475 ± 1.468
4.01LysPro: 4.01 ± 1.322
2.406LysGln: 2.406 ± 0.421
5.613LysArg: 5.613 ± 1.11
3.742LysSer: 3.742 ± 1.126
6.148LysThr: 6.148 ± 2.423
3.742LysVal: 3.742 ± 0.918
1.604LysTrp: 1.604 ± 0.559
1.604LysTyr: 1.604 ± 0.727
0.0LysXaa: 0.0 ± 0.0
Leu
3.475LeuAla: 3.475 ± 1.133
1.871LeuCys: 1.871 ± 0.645
4.01LeuAsp: 4.01 ± 1.065
7.485LeuGlu: 7.485 ± 1.671
3.742LeuPhe: 3.742 ± 1.172
5.346LeuGly: 5.346 ± 1.061
0.802LeuHis: 0.802 ± 0.31
7.485LeuIle: 7.485 ± 1.5
6.95LeuLys: 6.95 ± 0.874
5.613LeuLeu: 5.613 ± 1.15
4.01LeuMet: 4.01 ± 1.471
5.881LeuAsn: 5.881 ± 1.492
3.475LeuPro: 3.475 ± 0.697
1.337LeuGln: 1.337 ± 0.901
6.683LeuArg: 6.683 ± 1.54
10.425LeuSer: 10.425 ± 2.028
4.544LeuThr: 4.544 ± 1.538
3.475LeuVal: 3.475 ± 0.512
0.802LeuTrp: 0.802 ± 0.475
5.079LeuTyr: 5.079 ± 1.001
0.0LeuXaa: 0.0 ± 0.0
Met
0.802MetAla: 0.802 ± 0.346
1.069MetCys: 1.069 ± 0.404
2.406MetAsp: 2.406 ± 1.266
1.069MetGlu: 1.069 ± 0.996
2.138MetPhe: 2.138 ± 0.667
1.069MetGly: 1.069 ± 0.577
0.535MetHis: 0.535 ± 0.71
3.475MetIle: 3.475 ± 1.36
2.138MetLys: 2.138 ± 0.647
1.871MetLeu: 1.871 ± 0.517
0.802MetMet: 0.802 ± 0.854
1.337MetAsn: 1.337 ± 0.593
0.535MetPro: 0.535 ± 0.498
0.535MetGln: 0.535 ± 0.317
1.604MetArg: 1.604 ± 0.558
3.475MetSer: 3.475 ± 0.925
1.604MetThr: 1.604 ± 1.142
1.337MetVal: 1.337 ± 0.393
0.802MetTrp: 0.802 ± 0.31
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.673AsnAla: 2.673 ± 0.63
1.604AsnCys: 1.604 ± 0.698
2.673AsnAsp: 2.673 ± 0.37
1.871AsnGlu: 1.871 ± 0.806
0.802AsnPhe: 0.802 ± 0.451
1.871AsnGly: 1.871 ± 0.86
0.802AsnHis: 0.802 ± 0.31
2.673AsnIle: 2.673 ± 1.262
3.208AsnLys: 3.208 ± 0.59
5.613AsnLeu: 5.613 ± 2.015
1.337AsnMet: 1.337 ± 0.779
3.742AsnAsn: 3.742 ± 1.557
3.475AsnPro: 3.475 ± 1.08
1.604AsnGln: 1.604 ± 0.69
2.406AsnArg: 2.406 ± 0.832
3.208AsnSer: 3.208 ± 0.587
4.01AsnThr: 4.01 ± 1.172
1.871AsnVal: 1.871 ± 0.85
0.535AsnTrp: 0.535 ± 0.317
1.604AsnTyr: 1.604 ± 0.662
0.0AsnXaa: 0.0 ± 0.0
Pro
2.138ProAla: 2.138 ± 1.11
1.337ProCys: 1.337 ± 0.529
3.208ProAsp: 3.208 ± 1.526
2.138ProGlu: 2.138 ± 0.904
1.069ProPhe: 1.069 ± 0.399
1.871ProGly: 1.871 ± 1.499
0.535ProHis: 0.535 ± 0.695
2.94ProIle: 2.94 ± 0.728
2.138ProLys: 2.138 ± 0.599
4.812ProLeu: 4.812 ± 1.312
0.802ProMet: 0.802 ± 0.429
2.673ProAsn: 2.673 ± 0.982
2.94ProPro: 2.94 ± 0.99
0.802ProGln: 0.802 ± 0.425
2.138ProArg: 2.138 ± 0.956
5.881ProSer: 5.881 ± 2.163
1.337ProThr: 1.337 ± 0.602
3.742ProVal: 3.742 ± 1.324
0.0ProTrp: 0.0 ± 0.0
1.604ProTyr: 1.604 ± 0.748
0.0ProXaa: 0.0 ± 0.0
Gln
1.337GlnAla: 1.337 ± 0.522
0.802GlnCys: 0.802 ± 0.31
1.604GlnAsp: 1.604 ± 0.246
2.406GlnGlu: 2.406 ± 1.027
0.535GlnPhe: 0.535 ± 0.297
2.138GlnGly: 2.138 ± 1.44
0.802GlnHis: 0.802 ± 0.475
1.871GlnIle: 1.871 ± 0.717
2.406GlnLys: 2.406 ± 1.249
3.208GlnLeu: 3.208 ± 1.26
0.802GlnMet: 0.802 ± 0.475
2.406GlnAsn: 2.406 ± 0.923
0.267GlnPro: 0.267 ± 0.158
0.802GlnGln: 0.802 ± 0.429
1.871GlnArg: 1.871 ± 0.861
4.812GlnSer: 4.812 ± 0.587
1.337GlnThr: 1.337 ± 0.349
1.069GlnVal: 1.069 ± 0.577
0.535GlnTrp: 0.535 ± 0.695
0.802GlnTyr: 0.802 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
1.337ArgAla: 1.337 ± 0.819
1.337ArgCys: 1.337 ± 1.131
3.742ArgAsp: 3.742 ± 0.609
3.742ArgGlu: 3.742 ± 0.569
2.673ArgPhe: 2.673 ± 0.752
4.01ArgGly: 4.01 ± 0.979
0.802ArgHis: 0.802 ± 0.475
4.544ArgIle: 4.544 ± 1.848
4.01ArgLys: 4.01 ± 1.443
3.742ArgLeu: 3.742 ± 0.651
0.535ArgMet: 0.535 ± 0.415
1.604ArgAsn: 1.604 ± 0.246
2.406ArgPro: 2.406 ± 0.615
1.871ArgGln: 1.871 ± 0.697
1.871ArgArg: 1.871 ± 0.706
4.544ArgSer: 4.544 ± 0.735
2.673ArgThr: 2.673 ± 0.753
2.94ArgVal: 2.94 ± 0.682
0.267ArgTrp: 0.267 ± 0.455
3.208ArgTyr: 3.208 ± 0.996
0.0ArgXaa: 0.0 ± 0.0
Ser
4.01SerAla: 4.01 ± 0.887
2.406SerCys: 2.406 ± 0.857
4.812SerAsp: 4.812 ± 1.454
6.683SerGlu: 6.683 ± 1.924
2.94SerPhe: 2.94 ± 1.26
4.277SerGly: 4.277 ± 1.553
2.138SerHis: 2.138 ± 0.799
7.485SerIle: 7.485 ± 1.319
6.683SerLys: 6.683 ± 1.228
7.485SerLeu: 7.485 ± 0.93
1.337SerMet: 1.337 ± 0.792
2.94SerAsn: 2.94 ± 1.451
2.673SerPro: 2.673 ± 0.73
3.742SerGln: 3.742 ± 0.88
4.277SerArg: 4.277 ± 0.83
6.95SerSer: 6.95 ± 2.437
2.94SerThr: 2.94 ± 0.556
4.812SerVal: 4.812 ± 2.165
2.138SerTrp: 2.138 ± 0.726
2.406SerTyr: 2.406 ± 1.009
0.0SerXaa: 0.0 ± 0.0
Thr
1.337ThrAla: 1.337 ± 0.918
0.802ThrCys: 0.802 ± 0.638
2.94ThrAsp: 2.94 ± 0.587
3.475ThrGlu: 3.475 ± 0.532
1.337ThrPhe: 1.337 ± 0.365
4.01ThrGly: 4.01 ± 0.771
0.802ThrHis: 0.802 ± 0.317
5.346ThrIle: 5.346 ± 1.497
4.812ThrLys: 4.812 ± 1.562
5.613ThrLeu: 5.613 ± 1.447
1.604ThrMet: 1.604 ± 0.627
0.802ThrAsn: 0.802 ± 0.31
1.069ThrPro: 1.069 ± 0.709
1.604ThrGln: 1.604 ± 0.56
1.871ThrArg: 1.871 ± 0.601
4.544ThrSer: 4.544 ± 1.455
2.94ThrThr: 2.94 ± 1.243
2.138ThrVal: 2.138 ± 0.708
2.406ThrTrp: 2.406 ± 0.829
2.406ThrTyr: 2.406 ± 0.801
0.0ThrXaa: 0.0 ± 0.0
Val
2.406ValAla: 2.406 ± 0.794
1.069ValCys: 1.069 ± 0.634
2.673ValAsp: 2.673 ± 0.752
2.673ValGlu: 2.673 ± 0.887
2.138ValPhe: 2.138 ± 0.708
1.871ValGly: 1.871 ± 0.642
0.802ValHis: 0.802 ± 0.475
4.277ValIle: 4.277 ± 1.126
4.01ValLys: 4.01 ± 0.903
5.079ValLeu: 5.079 ± 1.346
1.871ValMet: 1.871 ± 0.931
2.673ValAsn: 2.673 ± 0.592
1.604ValPro: 1.604 ± 0.44
1.069ValGln: 1.069 ± 0.904
1.871ValArg: 1.871 ± 0.517
5.079ValSer: 5.079 ± 1.259
2.673ValThr: 2.673 ± 0.788
2.94ValVal: 2.94 ± 1.133
0.802ValTrp: 0.802 ± 0.806
4.01ValTyr: 4.01 ± 1.417
0.0ValXaa: 0.0 ± 0.0
Trp
0.267TrpAla: 0.267 ± 0.347
0.0TrpCys: 0.0 ± 0.0
0.535TrpAsp: 0.535 ± 0.297
1.604TrpGlu: 1.604 ± 0.69
0.802TrpPhe: 0.802 ± 0.317
0.802TrpGly: 0.802 ± 0.31
0.535TrpHis: 0.535 ± 0.289
1.337TrpIle: 1.337 ± 0.525
0.802TrpLys: 0.802 ± 0.317
2.673TrpLeu: 2.673 ± 0.998
1.069TrpMet: 1.069 ± 0.639
1.069TrpAsn: 1.069 ± 0.399
0.535TrpPro: 0.535 ± 0.498
0.802TrpGln: 0.802 ± 0.425
0.535TrpArg: 0.535 ± 0.431
0.802TrpSer: 0.802 ± 0.475
0.802TrpThr: 0.802 ± 0.425
1.871TrpVal: 1.871 ± 0.865
0.0TrpTrp: 0.0 ± 0.0
0.535TrpTyr: 0.535 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.535TyrAla: 0.535 ± 0.517
0.802TyrCys: 0.802 ± 0.475
1.871TyrAsp: 1.871 ± 0.584
3.208TyrGlu: 3.208 ± 1.676
2.406TyrPhe: 2.406 ± 0.923
2.673TyrGly: 2.673 ± 0.558
0.535TyrHis: 0.535 ± 0.498
2.673TyrIle: 2.673 ± 0.861
3.475TyrLys: 3.475 ± 0.81
3.208TyrLeu: 3.208 ± 0.947
0.0TyrMet: 0.0 ± 0.0
1.069TyrAsn: 1.069 ± 0.399
2.138TyrPro: 2.138 ± 0.956
1.604TyrGln: 1.604 ± 1.239
1.871TyrArg: 1.871 ± 0.736
4.01TyrSer: 4.01 ± 2.428
1.604TyrThr: 1.604 ± 0.608
3.208TyrVal: 3.208 ± 0.993
0.267TyrTrp: 0.267 ± 0.347
1.337TyrTyr: 1.337 ± 0.414
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3742 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski