Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_586

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
1.467AlaCys: 1.467 ± 1.446
2.935AlaAsp: 2.935 ± 1.552
4.402AlaGlu: 4.402 ± 1.674
2.935AlaPhe: 2.935 ± 1.276
0.734AlaGly: 0.734 ± 0.689
0.0AlaHis: 0.0 ± 0.0
2.201AlaIle: 2.201 ± 1.455
4.402AlaLys: 4.402 ± 2.462
8.07AlaLeu: 8.07 ± 2.427
2.935AlaMet: 2.935 ± 0.969
4.402AlaAsn: 4.402 ± 1.316
3.668AlaPro: 3.668 ± 1.23
6.603AlaGln: 6.603 ± 3.561
2.201AlaArg: 2.201 ± 1.303
6.603AlaSer: 6.603 ± 5.162
5.136AlaThr: 5.136 ± 1.262
1.467AlaVal: 1.467 ± 0.97
0.734AlaTrp: 0.734 ± 0.761
0.734AlaTyr: 0.734 ± 0.689
0.0AlaXaa: 0.0 ± 0.0
Cys
0.734CysAla: 0.734 ± 0.761
0.734CysCys: 0.734 ± 0.761
0.734CysAsp: 0.734 ± 1.02
0.734CysGlu: 0.734 ± 1.402
0.0CysPhe: 0.0 ± 0.0
0.734CysGly: 0.734 ± 0.761
0.734CysHis: 0.734 ± 0.498
0.734CysIle: 0.734 ± 0.498
2.935CysLys: 2.935 ± 3.045
0.734CysLeu: 0.734 ± 0.761
0.0CysMet: 0.0 ± 0.0
0.734CysAsn: 0.734 ± 0.761
0.0CysPro: 0.0 ± 0.0
1.467CysGln: 1.467 ± 1.523
0.734CysArg: 0.734 ± 0.498
2.201CysSer: 2.201 ± 1.299
0.0CysThr: 0.0 ± 0.0
0.734CysVal: 0.734 ± 0.761
0.734CysTrp: 0.734 ± 0.498
0.734CysTyr: 0.734 ± 0.761
0.0CysXaa: 0.0 ± 0.0
Asp
3.668AspAla: 3.668 ± 1.496
0.734AspCys: 0.734 ± 0.761
1.467AspAsp: 1.467 ± 1.129
3.668AspGlu: 3.668 ± 1.745
4.402AspPhe: 4.402 ± 1.914
2.201AspGly: 2.201 ± 0.658
0.734AspHis: 0.734 ± 0.498
5.869AspIle: 5.869 ± 1.997
2.935AspLys: 2.935 ± 2.083
3.668AspLeu: 3.668 ± 1.191
2.201AspMet: 2.201 ± 1.189
4.402AspAsn: 4.402 ± 1.813
0.0AspPro: 0.0 ± 0.0
2.201AspGln: 2.201 ± 0.914
1.467AspArg: 1.467 ± 1.129
5.136AspSer: 5.136 ± 2.601
2.201AspThr: 2.201 ± 1.155
6.603AspVal: 6.603 ± 2.312
0.0AspTrp: 0.0 ± 0.0
4.402AspTyr: 4.402 ± 1.559
0.0AspXaa: 0.0 ± 0.0
Glu
5.869GluAla: 5.869 ± 1.575
0.0GluCys: 0.0 ± 0.0
2.201GluAsp: 2.201 ± 0.927
1.467GluGlu: 1.467 ± 0.935
4.402GluPhe: 4.402 ± 2.257
3.668GluGly: 3.668 ± 1.991
1.467GluHis: 1.467 ± 0.995
1.467GluIle: 1.467 ± 0.673
1.467GluLys: 1.467 ± 1.923
3.668GluLeu: 3.668 ± 1.496
0.734GluMet: 0.734 ± 0.926
1.467GluAsn: 1.467 ± 0.995
0.0GluPro: 0.0 ± 0.0
2.935GluGln: 2.935 ± 0.969
5.136GluArg: 5.136 ± 2.273
2.201GluSer: 2.201 ± 1.455
2.935GluThr: 2.935 ± 1.326
0.0GluVal: 0.0 ± 0.0
1.467GluTrp: 1.467 ± 0.673
3.668GluTyr: 3.668 ± 1.146
0.0GluXaa: 0.0 ± 0.0
Phe
2.935PheAla: 2.935 ± 1.346
0.734PheCys: 0.734 ± 1.02
4.402PheAsp: 4.402 ± 1.876
1.467PheGlu: 1.467 ± 0.638
5.869PhePhe: 5.869 ± 2.516
8.07PheGly: 8.07 ± 2.255
1.467PheHis: 1.467 ± 0.673
3.668PheIle: 3.668 ± 1.517
3.668PheLys: 3.668 ± 1.517
5.136PheLeu: 5.136 ± 2.223
1.467PheMet: 1.467 ± 1.392
1.467PheAsn: 1.467 ± 0.995
0.0PhePro: 0.0 ± 0.0
2.935PheGln: 2.935 ± 1.276
2.935PheArg: 2.935 ± 1.41
5.136PheSer: 5.136 ± 2.82
2.935PheThr: 2.935 ± 0.698
1.467PheVal: 1.467 ± 0.673
2.201PheTrp: 2.201 ± 0.914
3.668PheTyr: 3.668 ± 1.018
0.0PheXaa: 0.0 ± 0.0
Gly
1.467GlyAla: 1.467 ± 0.638
0.734GlyCys: 0.734 ± 0.498
6.603GlyAsp: 6.603 ± 2.026
3.668GlyGlu: 3.668 ± 1.191
3.668GlyPhe: 3.668 ± 1.191
2.935GlyGly: 2.935 ± 0.951
1.467GlyHis: 1.467 ± 1.523
2.935GlyIle: 2.935 ± 0.698
3.668GlyLys: 3.668 ± 1.018
2.201GlyLeu: 2.201 ± 1.751
1.467GlyMet: 1.467 ± 0.97
2.935GlyAsn: 2.935 ± 1.167
2.935GlyPro: 2.935 ± 0.698
1.467GlyGln: 1.467 ± 0.638
2.935GlyArg: 2.935 ± 1.89
2.201GlySer: 2.201 ± 0.914
3.668GlyThr: 3.668 ± 1.018
3.668GlyVal: 3.668 ± 1.9
0.0GlyTrp: 0.0 ± 0.0
3.668GlyTyr: 3.668 ± 1.783
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.734HisAsp: 0.734 ± 0.498
0.0HisGlu: 0.0 ± 0.0
1.467HisPhe: 1.467 ± 0.995
1.467HisGly: 1.467 ± 0.638
1.467HisHis: 1.467 ± 0.995
0.0HisIle: 0.0 ± 0.0
0.734HisLys: 0.734 ± 0.761
2.201HisLeu: 2.201 ± 1.348
0.734HisMet: 0.734 ± 0.689
2.201HisAsn: 2.201 ± 1.231
0.734HisPro: 0.734 ± 0.498
0.734HisGln: 0.734 ± 0.761
0.734HisArg: 0.734 ± 0.761
1.467HisSer: 1.467 ± 1.392
0.0HisThr: 0.0 ± 0.0
1.467HisVal: 1.467 ± 0.673
0.0HisTrp: 0.0 ± 0.0
2.935HisTyr: 2.935 ± 1.272
0.0HisXaa: 0.0 ± 0.0
Ile
2.201IleAla: 2.201 ± 1.55
0.0IleCys: 0.0 ± 0.0
3.668IleAsp: 3.668 ± 1.018
2.201IleGlu: 2.201 ± 0.658
2.201IlePhe: 2.201 ± 0.906
5.869IleGly: 5.869 ± 1.814
0.734IleHis: 0.734 ± 0.761
3.668IleIle: 3.668 ± 1.9
2.201IleLys: 2.201 ± 2.066
8.804IleLeu: 8.804 ± 3.862
1.467IleMet: 1.467 ± 0.971
5.136IleAsn: 5.136 ± 1.227
2.935IlePro: 2.935 ± 0.969
0.0IleGln: 0.0 ± 0.0
2.201IleArg: 2.201 ± 2.284
6.603IleSer: 6.603 ± 1.289
2.935IleThr: 2.935 ± 1.843
3.668IleVal: 3.668 ± 2.457
0.734IleTrp: 0.734 ± 0.498
1.467IleTyr: 1.467 ± 0.673
0.0IleXaa: 0.0 ± 0.0
Lys
2.201LysAla: 2.201 ± 1.231
2.935LysCys: 2.935 ± 2.083
3.668LysAsp: 3.668 ± 2.013
0.0LysGlu: 0.0 ± 0.0
6.603LysPhe: 6.603 ± 2.038
2.201LysGly: 2.201 ± 0.914
2.201LysHis: 2.201 ± 0.658
2.201LysIle: 2.201 ± 0.914
2.201LysLys: 2.201 ± 1.303
5.869LysLeu: 5.869 ± 2.413
0.734LysMet: 0.734 ± 0.636
5.869LysAsn: 5.869 ± 2.277
3.668LysPro: 3.668 ± 2.583
2.935LysGln: 2.935 ± 1.272
2.935LysArg: 2.935 ± 2.27
4.402LysSer: 4.402 ± 1.813
2.935LysThr: 2.935 ± 0.698
4.402LysVal: 4.402 ± 0.838
0.734LysTrp: 0.734 ± 0.689
2.935LysTyr: 2.935 ± 1.647
0.0LysXaa: 0.0 ± 0.0
Leu
7.337LeuAla: 7.337 ± 3.491
0.734LeuCys: 0.734 ± 0.761
3.668LeuAsp: 3.668 ± 1.139
5.869LeuGlu: 5.869 ± 1.863
4.402LeuPhe: 4.402 ± 1.185
3.668LeuGly: 3.668 ± 1.783
2.201LeuHis: 2.201 ± 2.066
7.337LeuIle: 7.337 ± 4.938
5.869LeuLys: 5.869 ± 1.395
8.804LeuLeu: 8.804 ± 4.082
3.668LeuMet: 3.668 ± 2.186
5.869LeuAsn: 5.869 ± 3.06
5.869LeuPro: 5.869 ± 1.834
6.603LeuGln: 6.603 ± 2.075
5.136LeuArg: 5.136 ± 2.272
4.402LeuSer: 4.402 ± 1.15
2.935LeuThr: 2.935 ± 1.167
5.136LeuVal: 5.136 ± 1.446
0.0LeuTrp: 0.0 ± 0.0
2.935LeuTyr: 2.935 ± 1.361
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.467MetCys: 1.467 ± 0.673
3.668MetAsp: 3.668 ± 1.882
2.935MetGlu: 2.935 ± 2.245
0.734MetPhe: 0.734 ± 1.02
0.734MetGly: 0.734 ± 0.689
0.734MetHis: 0.734 ± 0.689
0.734MetIle: 0.734 ± 1.02
1.467MetLys: 1.467 ± 0.638
0.734MetLeu: 0.734 ± 1.402
0.734MetMet: 0.734 ± 0.761
2.935MetAsn: 2.935 ± 1.552
2.201MetPro: 2.201 ± 0.94
0.0MetGln: 0.0 ± 0.0
0.734MetArg: 0.734 ± 1.402
2.935MetSer: 2.935 ± 1.41
1.467MetThr: 1.467 ± 0.995
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.734MetTyr: 0.734 ± 0.498
0.0MetXaa: 0.0 ± 0.0
Asn
9.538AsnAla: 9.538 ± 5.744
0.734AsnCys: 0.734 ± 0.761
5.136AsnAsp: 5.136 ± 1.607
6.603AsnGlu: 6.603 ± 1.888
2.201AsnPhe: 2.201 ± 0.94
0.734AsnGly: 0.734 ± 0.498
0.0AsnHis: 0.0 ± 0.0
5.136AsnIle: 5.136 ± 1.9
5.136AsnLys: 5.136 ± 1.8
7.337AsnLeu: 7.337 ± 1.887
1.467AsnMet: 1.467 ± 1.377
2.935AsnAsn: 2.935 ± 2.142
1.467AsnPro: 1.467 ± 1.129
3.668AsnGln: 3.668 ± 1.8
2.201AsnArg: 2.201 ± 0.914
5.869AsnSer: 5.869 ± 2.552
3.668AsnThr: 3.668 ± 1.783
2.201AsnVal: 2.201 ± 1.189
0.0AsnTrp: 0.0 ± 0.0
3.668AsnTyr: 3.668 ± 2.832
0.0AsnXaa: 0.0 ± 0.0
Pro
1.467ProAla: 1.467 ± 0.995
0.734ProCys: 0.734 ± 0.761
2.201ProAsp: 2.201 ± 2.04
0.734ProGlu: 0.734 ± 0.498
2.201ProPhe: 2.201 ± 1.189
5.869ProGly: 5.869 ± 1.814
1.467ProHis: 1.467 ± 0.673
4.402ProIle: 4.402 ± 0.937
2.935ProLys: 2.935 ± 1.41
3.668ProLeu: 3.668 ± 0.574
0.734ProMet: 0.734 ± 0.498
4.402ProAsn: 4.402 ± 1.606
0.734ProPro: 0.734 ± 0.761
5.136ProGln: 5.136 ± 1.974
1.467ProArg: 1.467 ± 0.673
4.402ProSer: 4.402 ± 0.728
0.734ProThr: 0.734 ± 0.498
1.467ProVal: 1.467 ± 0.638
1.467ProTrp: 1.467 ± 1.392
0.734ProTyr: 0.734 ± 0.498
0.0ProXaa: 0.0 ± 0.0
Gln
6.603GlnAla: 6.603 ± 1.319
0.0GlnCys: 0.0 ± 0.0
0.734GlnAsp: 0.734 ± 1.02
1.467GlnGlu: 1.467 ± 0.995
3.668GlnPhe: 3.668 ± 1.745
3.668GlnGly: 3.668 ± 1.783
0.0GlnHis: 0.0 ± 0.0
1.467GlnIle: 1.467 ± 0.935
4.402GlnLys: 4.402 ± 1.973
3.668GlnLeu: 3.668 ± 1.221
1.467GlnMet: 1.467 ± 1.377
6.603GlnAsn: 6.603 ± 2.645
1.467GlnPro: 1.467 ± 0.935
0.734GlnGln: 0.734 ± 0.498
2.935GlnArg: 2.935 ± 0.698
4.402GlnSer: 4.402 ± 0.937
4.402GlnThr: 4.402 ± 1.324
1.467GlnVal: 1.467 ± 1.166
0.0GlnTrp: 0.0 ± 0.0
1.467GlnTyr: 1.467 ± 0.638
0.0GlnXaa: 0.0 ± 0.0
Arg
4.402ArgAla: 4.402 ± 3.136
0.0ArgCys: 0.0 ± 0.0
3.668ArgAsp: 3.668 ± 2.061
1.467ArgGlu: 1.467 ± 0.673
2.935ArgPhe: 2.935 ± 2.083
0.734ArgGly: 0.734 ± 0.498
0.734ArgHis: 0.734 ± 0.761
3.668ArgIle: 3.668 ± 0.574
2.201ArgLys: 2.201 ± 1.348
5.136ArgLeu: 5.136 ± 1.106
2.201ArgMet: 2.201 ± 1.231
2.201ArgAsn: 2.201 ± 0.94
3.668ArgPro: 3.668 ± 1.745
1.467ArgGln: 1.467 ± 1.129
0.734ArgArg: 0.734 ± 0.498
2.935ArgSer: 2.935 ± 2.892
1.467ArgThr: 1.467 ± 0.638
2.935ArgVal: 2.935 ± 0.79
0.0ArgTrp: 0.0 ± 0.0
2.935ArgTyr: 2.935 ± 1.346
0.0ArgXaa: 0.0 ± 0.0
Ser
5.136SerAla: 5.136 ± 1.976
1.467SerCys: 1.467 ± 1.392
6.603SerAsp: 6.603 ± 2.418
3.668SerGlu: 3.668 ± 2.876
3.668SerPhe: 3.668 ± 1.539
2.201SerGly: 2.201 ± 1.155
0.734SerHis: 0.734 ± 0.498
5.136SerIle: 5.136 ± 2.882
4.402SerLys: 4.402 ± 1.905
5.136SerLeu: 5.136 ± 1.248
0.734SerMet: 0.734 ± 1.402
3.668SerAsn: 3.668 ± 1.539
8.07SerPro: 8.07 ± 2.031
2.201SerGln: 2.201 ± 0.94
4.402SerArg: 4.402 ± 2.214
3.668SerSer: 3.668 ± 1.146
5.869SerThr: 5.869 ± 1.807
5.136SerVal: 5.136 ± 1.227
2.201SerTrp: 2.201 ± 1.56
2.201SerTyr: 2.201 ± 1.751
0.0SerXaa: 0.0 ± 0.0
Thr
5.136ThrAla: 5.136 ± 1.227
0.734ThrCys: 0.734 ± 0.498
0.0ThrAsp: 0.0 ± 0.0
4.402ThrGlu: 4.402 ± 1.155
2.935ThrPhe: 2.935 ± 0.698
1.467ThrGly: 1.467 ± 0.995
0.734ThrHis: 0.734 ± 1.402
5.136ThrIle: 5.136 ± 2.223
2.935ThrLys: 2.935 ± 1.298
6.603ThrLeu: 6.603 ± 2.396
0.734ThrMet: 0.734 ± 0.498
3.668ThrAsn: 3.668 ± 1.049
2.935ThrPro: 2.935 ± 1.314
2.201ThrGln: 2.201 ± 1.155
0.734ThrArg: 0.734 ± 0.689
5.136ThrSer: 5.136 ± 1.451
2.201ThrThr: 2.201 ± 1.55
0.734ThrVal: 0.734 ± 0.689
0.0ThrTrp: 0.0 ± 0.0
2.201ThrTyr: 2.201 ± 0.94
0.0ThrXaa: 0.0 ± 0.0
Val
2.935ValAla: 2.935 ± 1.314
0.0ValCys: 0.0 ± 0.0
3.668ValAsp: 3.668 ± 1.745
1.467ValGlu: 1.467 ± 0.638
2.935ValPhe: 2.935 ± 2.141
2.935ValGly: 2.935 ± 0.79
0.0ValHis: 0.0 ± 0.0
0.734ValIle: 0.734 ± 0.761
4.402ValLys: 4.402 ± 1.914
1.467ValLeu: 1.467 ± 2.804
0.734ValMet: 0.734 ± 0.498
3.668ValAsn: 3.668 ± 2.957
5.869ValPro: 5.869 ± 1.882
2.935ValGln: 2.935 ± 1.272
2.201ValArg: 2.201 ± 0.927
3.668ValSer: 3.668 ± 1.702
2.935ValThr: 2.935 ± 2.141
0.0ValVal: 0.0 ± 0.0
1.467ValTrp: 1.467 ± 0.995
2.935ValTyr: 2.935 ± 1.835
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.734TrpCys: 0.734 ± 0.761
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
5.136TrpLeu: 5.136 ± 1.262
0.0TrpMet: 0.0 ± 1.232
0.734TrpAsn: 0.734 ± 0.498
0.0TrpPro: 0.0 ± 0.0
1.467TrpGln: 1.467 ± 0.935
0.734TrpArg: 0.734 ± 0.498
1.467TrpSer: 1.467 ± 0.995
1.467TrpThr: 1.467 ± 0.673
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
2.201TyrCys: 2.201 ± 2.284
2.201TyrAsp: 2.201 ± 1.493
0.0TyrGlu: 0.0 ± 0.0
4.402TyrPhe: 4.402 ± 2.214
4.402TyrGly: 4.402 ± 2.262
2.201TyrHis: 2.201 ± 0.906
2.201TyrIle: 2.201 ± 1.568
3.668TyrLys: 3.668 ± 1.499
4.402TyrLeu: 4.402 ± 1.213
0.0TyrMet: 0.0 ± 0.0
4.402TyrAsn: 4.402 ± 2.462
0.734TyrPro: 0.734 ± 0.498
2.935TyrGln: 2.935 ± 1.326
2.935TyrArg: 2.935 ± 2.083
1.467TyrSer: 1.467 ± 0.995
0.734TyrThr: 0.734 ± 0.761
4.402TyrVal: 4.402 ± 1.682
0.734TyrTrp: 0.734 ± 0.761
2.935TyrTyr: 2.935 ± 0.951
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1364 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski