Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_158

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.345AlaAla: 8.345 ± 6.125
1.391AlaCys: 1.391 ± 1.271
4.868AlaAsp: 4.868 ± 2.402
5.563AlaGlu: 5.563 ± 1.002
2.782AlaPhe: 2.782 ± 0.61
4.172AlaGly: 4.172 ± 1.179
3.477AlaHis: 3.477 ± 1.671
4.868AlaIle: 4.868 ± 2.166
4.172AlaLys: 4.172 ± 1.958
4.868AlaLeu: 4.868 ± 1.602
2.086AlaMet: 2.086 ± 1.083
4.172AlaAsn: 4.172 ± 1.741
3.477AlaPro: 3.477 ± 1.59
3.477AlaGln: 3.477 ± 1.898
2.782AlaArg: 2.782 ± 1.171
6.259AlaSer: 6.259 ± 3.153
1.391AlaThr: 1.391 ± 0.691
2.782AlaVal: 2.782 ± 0.987
2.086AlaTrp: 2.086 ± 0.912
2.086AlaTyr: 2.086 ± 1.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.695CysCys: 0.695 ± 0.874
0.695CysAsp: 0.695 ± 0.457
2.086CysGlu: 2.086 ± 1.204
0.0CysPhe: 0.0 ± 0.0
2.086CysGly: 2.086 ± 1.907
0.0CysHis: 0.0 ± 0.0
0.695CysIle: 0.695 ± 0.874
0.0CysLys: 0.0 ± 0.0
3.477CysLeu: 3.477 ± 1.59
0.0CysMet: 0.0 ± 0.0
1.391CysAsn: 1.391 ± 0.556
0.0CysPro: 0.0 ± 0.0
0.695CysGln: 0.695 ± 0.636
0.695CysArg: 0.695 ± 0.636
1.391CysSer: 1.391 ± 1.271
0.695CysThr: 0.695 ± 0.874
1.391CysVal: 1.391 ± 1.271
1.391CysTrp: 1.391 ± 0.556
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.477AspAla: 3.477 ± 1.055
1.391AspCys: 1.391 ± 0.556
2.782AspAsp: 2.782 ± 1.171
2.782AspGlu: 2.782 ± 1.167
2.782AspPhe: 2.782 ± 0.833
3.477AspGly: 3.477 ± 1.59
0.695AspHis: 0.695 ± 0.457
2.086AspIle: 2.086 ± 1.083
2.782AspLys: 2.782 ± 1.146
4.172AspLeu: 4.172 ± 1.27
1.391AspMet: 1.391 ± 0.914
3.477AspAsn: 3.477 ± 1.065
0.695AspPro: 0.695 ± 0.636
1.391AspGln: 1.391 ± 0.914
3.477AspArg: 3.477 ± 1.102
6.259AspSer: 6.259 ± 1.132
4.868AspThr: 4.868 ± 2.725
3.477AspVal: 3.477 ± 1.102
1.391AspTrp: 1.391 ± 0.691
4.868AspTyr: 4.868 ± 1.401
0.0AspXaa: 0.0 ± 0.0
Glu
2.086GluAla: 2.086 ± 0.813
1.391GluCys: 1.391 ± 0.556
2.086GluAsp: 2.086 ± 0.953
4.172GluGlu: 4.172 ± 1.628
3.477GluPhe: 3.477 ± 1.804
2.782GluGly: 2.782 ± 1.498
4.172GluHis: 4.172 ± 1.571
1.391GluIle: 1.391 ± 0.972
2.086GluLys: 2.086 ± 1.39
2.782GluLeu: 2.782 ± 2.454
2.782GluMet: 2.782 ± 1.314
2.086GluAsn: 2.086 ± 0.814
2.086GluPro: 2.086 ± 1.39
4.172GluGln: 4.172 ± 1.53
4.868GluArg: 4.868 ± 1.653
6.954GluSer: 6.954 ± 1.858
3.477GluThr: 3.477 ± 1.085
4.172GluVal: 4.172 ± 1.367
0.695GluTrp: 0.695 ± 0.457
4.172GluTyr: 4.172 ± 1.53
0.0GluXaa: 0.0 ± 0.0
Phe
1.391PheAla: 1.391 ± 0.822
1.391PheCys: 1.391 ± 1.271
3.477PheAsp: 3.477 ± 1.029
2.782PheGlu: 2.782 ± 3.313
2.782PhePhe: 2.782 ± 1.381
4.868PheGly: 4.868 ± 1.774
1.391PheHis: 1.391 ± 0.914
2.086PheIle: 2.086 ± 1.907
2.086PheLys: 2.086 ± 1.907
2.086PheLeu: 2.086 ± 1.371
2.086PheMet: 2.086 ± 1.04
4.868PheAsn: 4.868 ± 1.739
0.695PhePro: 0.695 ± 0.828
2.086PheGln: 2.086 ± 1.691
2.086PheArg: 2.086 ± 0.813
2.782PheSer: 2.782 ± 1.146
1.391PheThr: 1.391 ± 0.822
2.086PheVal: 2.086 ± 1.04
0.695PheTrp: 0.695 ± 0.457
5.563PheTyr: 5.563 ± 2.538
0.0PheXaa: 0.0 ± 0.0
Gly
3.477GlyAla: 3.477 ± 0.916
1.391GlyCys: 1.391 ± 0.822
2.782GlyAsp: 2.782 ± 1.427
4.868GlyGlu: 4.868 ± 1.719
0.695GlyPhe: 0.695 ± 0.636
2.782GlyGly: 2.782 ± 1.266
0.695GlyHis: 0.695 ± 0.636
2.782GlyIle: 2.782 ± 0.566
4.172GlyLys: 4.172 ± 1.437
3.477GlyLeu: 3.477 ± 1.56
0.695GlyMet: 0.695 ± 0.457
4.172GlyAsn: 4.172 ± 1.313
0.695GlyPro: 0.695 ± 0.457
2.782GlyGln: 2.782 ± 1.192
0.695GlyArg: 0.695 ± 0.457
6.259GlySer: 6.259 ± 2.536
5.563GlyThr: 5.563 ± 0.587
2.782GlyVal: 2.782 ± 1.381
0.0GlyTrp: 0.0 ± 0.0
4.868GlyTyr: 4.868 ± 0.816
0.0GlyXaa: 0.0 ± 0.0
His
1.391HisAla: 1.391 ± 0.556
0.0HisCys: 0.0 ± 0.0
0.695HisAsp: 0.695 ± 0.636
0.0HisGlu: 0.0 ± 0.0
2.782HisPhe: 2.782 ± 1.381
2.086HisGly: 2.086 ± 0.912
0.0HisHis: 0.0 ± 0.0
1.391HisIle: 1.391 ± 0.914
0.695HisLys: 0.695 ± 0.457
4.172HisLeu: 4.172 ± 1.508
0.0HisMet: 0.0 ± 0.0
0.695HisAsn: 0.695 ± 0.736
0.695HisPro: 0.695 ± 0.874
3.477HisGln: 3.477 ± 0.848
0.695HisArg: 0.695 ± 0.457
2.086HisSer: 2.086 ± 1.103
0.695HisThr: 0.695 ± 0.457
0.695HisVal: 0.695 ± 0.457
0.0HisTrp: 0.0 ± 0.0
0.695HisTyr: 0.695 ± 0.636
0.0HisXaa: 0.0 ± 0.0
Ile
4.868IleAla: 4.868 ± 1.415
0.695IleCys: 0.695 ± 0.874
1.391IleAsp: 1.391 ± 0.822
5.563IleGlu: 5.563 ± 2.653
4.172IlePhe: 4.172 ± 1.437
2.782IleGly: 2.782 ± 1.383
0.0IleHis: 0.0 ± 0.0
2.086IleIle: 2.086 ± 0.813
1.391IleLys: 1.391 ± 0.831
2.086IleLeu: 2.086 ± 1.283
0.0IleMet: 0.0 ± 0.0
6.259IleAsn: 6.259 ± 1.538
4.172IlePro: 4.172 ± 1.543
2.782IleGln: 2.782 ± 1.488
2.782IleArg: 2.782 ± 1.192
4.172IleSer: 4.172 ± 1.522
2.782IleThr: 2.782 ± 0.833
0.695IleVal: 0.695 ± 0.874
0.695IleTrp: 0.695 ± 0.457
2.086IleTyr: 2.086 ± 1.103
0.0IleXaa: 0.0 ± 0.0
Lys
3.477LysAla: 3.477 ± 0.888
0.0LysCys: 0.0 ± 0.0
2.086LysAsp: 2.086 ± 0.568
2.086LysGlu: 2.086 ± 0.814
2.782LysPhe: 2.782 ± 1.111
4.172LysGly: 4.172 ± 2.376
0.0LysHis: 0.0 ± 0.0
1.391LysIle: 1.391 ± 0.952
4.172LysLys: 4.172 ± 2.277
2.782LysLeu: 2.782 ± 1.712
0.695LysMet: 0.695 ± 0.636
4.868LysAsn: 4.868 ± 3.21
1.391LysPro: 1.391 ± 0.952
2.782LysGln: 2.782 ± 1.903
1.391LysArg: 1.391 ± 0.952
4.868LysSer: 4.868 ± 1.131
4.172LysThr: 4.172 ± 1.762
2.782LysVal: 2.782 ± 1.712
0.0LysTrp: 0.0 ± 0.0
4.868LysTyr: 4.868 ± 0.816
0.0LysXaa: 0.0 ± 0.0
Leu
6.259LeuAla: 6.259 ± 1.487
1.391LeuCys: 1.391 ± 0.556
5.563LeuAsp: 5.563 ± 1.569
4.868LeuGlu: 4.868 ± 1.133
1.391LeuPhe: 1.391 ± 0.98
4.868LeuGly: 4.868 ± 1.884
0.695LeuHis: 0.695 ± 0.828
2.782LeuIle: 2.782 ± 1.383
4.172LeuLys: 4.172 ± 1.369
6.954LeuLeu: 6.954 ± 1.71
1.391LeuMet: 1.391 ± 1.284
2.782LeuAsn: 2.782 ± 0.61
8.345LeuPro: 8.345 ± 0.523
5.563LeuGln: 5.563 ± 1.834
2.782LeuArg: 2.782 ± 0.566
6.259LeuSer: 6.259 ± 2.116
6.259LeuThr: 6.259 ± 1.07
0.695LeuVal: 0.695 ± 0.457
0.695LeuTrp: 0.695 ± 0.636
2.782LeuTyr: 2.782 ± 1.111
0.0LeuXaa: 0.0 ± 0.0
Met
0.695MetAla: 0.695 ± 0.457
0.695MetCys: 0.695 ± 0.874
0.695MetAsp: 0.695 ± 0.874
0.0MetGlu: 0.0 ± 0.0
0.695MetPhe: 0.695 ± 0.874
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.391MetIle: 1.391 ± 0.822
2.086MetLys: 2.086 ± 0.769
2.086MetLeu: 2.086 ± 1.401
0.0MetMet: 0.0 ± 0.0
2.086MetAsn: 2.086 ± 1.966
0.695MetPro: 0.695 ± 0.457
0.695MetGln: 0.695 ± 0.457
0.695MetArg: 0.695 ± 0.457
2.782MetSer: 2.782 ± 1.192
1.391MetThr: 1.391 ± 0.556
2.086MetVal: 2.086 ± 0.568
0.0MetTrp: 0.0 ± 0.0
1.391MetTyr: 1.391 ± 0.691
0.0MetXaa: 0.0 ± 0.0
Asn
7.65AsnAla: 7.65 ± 4.76
0.0AsnCys: 0.0 ± 0.0
2.086AsnAsp: 2.086 ± 1.667
2.086AsnGlu: 2.086 ± 0.568
2.086AsnPhe: 2.086 ± 1.512
1.391AsnGly: 1.391 ± 0.972
0.695AsnHis: 0.695 ± 0.457
4.868AsnIle: 4.868 ± 2.923
3.477AsnLys: 3.477 ± 1.694
4.172AsnLeu: 4.172 ± 0.939
1.391AsnMet: 1.391 ± 0.878
5.563AsnAsn: 5.563 ± 4.081
1.391AsnPro: 1.391 ± 0.831
3.477AsnGln: 3.477 ± 1.16
4.868AsnArg: 4.868 ± 1.739
3.477AsnSer: 3.477 ± 1.592
6.954AsnThr: 6.954 ± 2.824
4.172AsnVal: 4.172 ± 1.522
0.0AsnTrp: 0.0 ± 0.0
1.391AsnTyr: 1.391 ± 0.556
0.0AsnXaa: 0.0 ± 0.0
Pro
1.391ProAla: 1.391 ± 0.822
0.695ProCys: 0.695 ± 0.636
1.391ProAsp: 1.391 ± 0.556
2.782ProGlu: 2.782 ± 1.498
2.086ProPhe: 2.086 ± 0.794
1.391ProGly: 1.391 ± 0.914
1.391ProHis: 1.391 ± 0.556
3.477ProIle: 3.477 ± 0.848
1.391ProLys: 1.391 ± 0.691
2.086ProLeu: 2.086 ± 1.093
2.086ProMet: 2.086 ± 0.814
2.782ProAsn: 2.782 ± 0.987
1.391ProPro: 1.391 ± 0.831
2.086ProGln: 2.086 ± 1.371
2.086ProArg: 2.086 ± 1.103
4.172ProSer: 4.172 ± 1.972
3.477ProThr: 3.477 ± 2.336
2.782ProVal: 2.782 ± 1.488
0.695ProTrp: 0.695 ± 0.736
1.391ProTyr: 1.391 ± 0.556
0.0ProXaa: 0.0 ± 0.0
Gln
4.868GlnAla: 4.868 ± 2.163
0.695GlnCys: 0.695 ± 0.636
2.086GlnAsp: 2.086 ± 1.371
3.477GlnGlu: 3.477 ± 1.375
2.782GlnPhe: 2.782 ± 1.643
1.391GlnGly: 1.391 ± 0.691
0.0GlnHis: 0.0 ± 0.0
2.782GlnIle: 2.782 ± 1.847
2.782GlnLys: 2.782 ± 1.943
5.563GlnLeu: 5.563 ± 1.31
0.0GlnMet: 0.0 ± 0.0
1.391GlnAsn: 1.391 ± 0.914
0.695GlnPro: 0.695 ± 0.457
0.0GlnGln: 0.0 ± 0.0
4.172GlnArg: 4.172 ± 0.631
3.477GlnSer: 3.477 ± 1.16
2.086GlnThr: 2.086 ± 2.209
2.782GlnVal: 2.782 ± 1.171
0.695GlnTrp: 0.695 ± 0.636
2.782GlnTyr: 2.782 ± 1.111
0.0GlnXaa: 0.0 ± 0.0
Arg
2.086ArgAla: 2.086 ± 0.912
1.391ArgCys: 1.391 ± 0.98
4.172ArgAsp: 4.172 ± 1.179
3.477ArgGlu: 3.477 ± 0.388
2.782ArgPhe: 2.782 ± 1.146
2.086ArgGly: 2.086 ± 0.794
1.391ArgHis: 1.391 ± 0.556
2.086ArgIle: 2.086 ± 1.103
1.391ArgLys: 1.391 ± 0.98
4.172ArgLeu: 4.172 ± 0.939
2.086ArgMet: 2.086 ± 1.29
3.477ArgAsn: 3.477 ± 2.296
2.782ArgPro: 2.782 ± 1.171
0.0ArgGln: 0.0 ± 0.0
2.782ArgArg: 2.782 ± 1.662
2.086ArgSer: 2.086 ± 0.568
1.391ArgThr: 1.391 ± 0.822
2.086ArgVal: 2.086 ± 1.04
0.0ArgTrp: 0.0 ± 0.0
4.868ArgTyr: 4.868 ± 1.536
0.0ArgXaa: 0.0 ± 0.0
Ser
12.517SerAla: 12.517 ± 3.792
1.391SerCys: 1.391 ± 0.556
7.65SerAsp: 7.65 ± 1.864
4.868SerGlu: 4.868 ± 1.153
2.782SerPhe: 2.782 ± 1.871
5.563SerGly: 5.563 ± 1.975
2.782SerHis: 2.782 ± 1.171
4.172SerIle: 4.172 ± 2.249
2.782SerLys: 2.782 ± 1.171
9.736SerLeu: 9.736 ± 2.457
2.086SerMet: 2.086 ± 1.788
4.172SerAsn: 4.172 ± 1.367
4.172SerPro: 4.172 ± 0.831
2.086SerGln: 2.086 ± 1.04
1.391SerArg: 1.391 ± 0.914
8.345SerSer: 8.345 ± 3.679
2.086SerThr: 2.086 ± 0.912
4.868SerVal: 4.868 ± 1.675
1.391SerTrp: 1.391 ± 0.556
3.477SerTyr: 3.477 ± 2.336
0.0SerXaa: 0.0 ± 0.0
Thr
5.563ThrAla: 5.563 ± 1.599
2.086ThrCys: 2.086 ± 0.794
4.172ThrAsp: 4.172 ± 1.16
3.477ThrGlu: 3.477 ± 2.03
4.868ThrPhe: 4.868 ± 1.774
5.563ThrGly: 5.563 ± 1.088
0.695ThrHis: 0.695 ± 0.457
2.086ThrIle: 2.086 ± 1.371
4.172ThrLys: 4.172 ± 2.207
4.868ThrLeu: 4.868 ± 0.774
0.0ThrMet: 0.0 ± 0.0
2.782ThrAsn: 2.782 ± 2.066
0.695ThrPro: 0.695 ± 0.457
0.695ThrGln: 0.695 ± 0.636
3.477ThrArg: 3.477 ± 1.627
4.868ThrSer: 4.868 ± 1.739
6.259ThrThr: 6.259 ± 2.177
1.391ThrVal: 1.391 ± 1.29
0.695ThrTrp: 0.695 ± 0.736
2.782ThrTyr: 2.782 ± 1.296
0.0ThrXaa: 0.0 ± 0.0
Val
1.391ValAla: 1.391 ± 0.914
0.0ValCys: 0.0 ± 0.0
5.563ValAsp: 5.563 ± 1.045
3.477ValGlu: 3.477 ± 0.388
2.782ValPhe: 2.782 ± 1.643
1.391ValGly: 1.391 ± 0.98
1.391ValHis: 1.391 ± 0.914
3.477ValIle: 3.477 ± 0.848
2.086ValLys: 2.086 ± 0.769
2.086ValLeu: 2.086 ± 0.912
0.0ValMet: 0.0 ± 0.0
2.086ValAsn: 2.086 ± 0.568
4.868ValPro: 4.868 ± 1.774
2.782ValGln: 2.782 ± 1.643
2.086ValArg: 2.086 ± 1.667
3.477ValSer: 3.477 ± 1.671
2.782ValThr: 2.782 ± 1.171
1.391ValVal: 1.391 ± 0.822
0.695ValTrp: 0.695 ± 0.636
2.782ValTyr: 2.782 ± 1.286
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.391TrpPhe: 1.391 ± 0.556
0.0TrpGly: 0.0 ± 0.0
2.086TrpHis: 2.086 ± 0.568
0.695TrpIle: 0.695 ± 0.457
0.695TrpLys: 0.695 ± 0.636
0.695TrpLeu: 0.695 ± 0.457
0.695TrpMet: 0.695 ± 0.636
1.391TrpAsn: 1.391 ± 0.691
0.0TrpPro: 0.0 ± 0.0
1.391TrpGln: 1.391 ± 0.691
0.0TrpArg: 0.0 ± 0.0
1.391TrpSer: 1.391 ± 0.691
0.0TrpThr: 0.0 ± 0.0
0.695TrpVal: 0.695 ± 0.457
0.0TrpTrp: 0.0 ± 0.0
1.391TrpTyr: 1.391 ± 0.831
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.172TyrAla: 4.172 ± 1.667
0.695TyrCys: 0.695 ± 0.457
3.477TyrAsp: 3.477 ± 1.627
3.477TyrGlu: 3.477 ± 1.739
2.782TyrPhe: 2.782 ± 1.146
2.782TyrGly: 2.782 ± 1.58
1.391TyrHis: 1.391 ± 1.271
4.868TyrIle: 4.868 ± 1.824
4.172TyrLys: 4.172 ± 2.105
4.868TyrLeu: 4.868 ± 1.21
0.0TyrMet: 0.0 ± 0.0
0.695TyrAsn: 0.695 ± 0.457
2.086TyrPro: 2.086 ± 1.103
2.086TyrGln: 2.086 ± 1.04
2.782TyrArg: 2.782 ± 0.884
6.954TyrSer: 6.954 ± 2.352
3.477TyrThr: 3.477 ± 1.805
2.782TyrVal: 2.782 ± 0.891
0.695TyrTrp: 0.695 ± 0.736
5.563TyrTyr: 5.563 ± 2.571
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1439 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski