Amino acid dipepetide frequency for Ceratobasidium endornavirus B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.147AlaAla: 3.147 ± 0.936
0.525AlaCys: 0.525 ± 0.241
1.967AlaAsp: 1.967 ± 0.548
2.491AlaGlu: 2.491 ± 0.165
1.442AlaPhe: 1.442 ± 0.409
2.885AlaGly: 2.885 ± 0.341
0.525AlaHis: 0.525 ± 0.235
4.983AlaIle: 4.983 ± 0.625
3.541AlaLys: 3.541 ± 0.397
4.065AlaLeu: 4.065 ± 0.633
1.705AlaMet: 1.705 ± 0.288
3.409AlaAsn: 3.409 ± 0.616
1.705AlaPro: 1.705 ± 0.288
0.656AlaGln: 0.656 ± 0.294
3.147AlaArg: 3.147 ± 0.697
3.409AlaSer: 3.409 ± 0.139
3.934AlaThr: 3.934 ± 0.812
1.967AlaVal: 1.967 ± 0.645
0.262AlaTrp: 0.262 ± 0.121
2.229AlaTyr: 2.229 ± 0.285
0.0AlaXaa: 0.0 ± 0.0
Cys
0.918CysAla: 0.918 ± 0.174
0.525CysCys: 0.525 ± 0.235
0.918CysAsp: 0.918 ± 0.174
0.787CysGlu: 0.787 ± 0.601
0.393CysPhe: 0.393 ± 0.177
1.574CysGly: 1.574 ± 0.486
0.787CysHis: 0.787 ± 0.115
1.705CysIle: 1.705 ± 0.189
1.18CysLys: 1.18 ± 0.424
1.18CysLeu: 1.18 ± 0.053
0.525CysMet: 0.525 ± 0.003
1.18CysAsn: 1.18 ± 0.053
0.656CysPro: 0.656 ± 0.056
0.525CysGln: 0.525 ± 0.003
0.262CysArg: 0.262 ± 0.118
0.525CysSer: 0.525 ± 0.003
1.967CysThr: 1.967 ± 0.548
1.705CysVal: 1.705 ± 0.189
0.393CysTrp: 0.393 ± 0.3
0.787CysTyr: 0.787 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
1.836AspAla: 1.836 ± 0.586
1.574AspCys: 1.574 ± 0.248
4.196AspAsp: 4.196 ± 1.407
4.983AspGlu: 4.983 ± 0.386
1.18AspPhe: 1.18 ± 0.291
2.885AspGly: 2.885 ± 0.136
1.049AspHis: 1.049 ± 0.006
4.721AspIle: 4.721 ± 0.266
4.59AspLys: 4.59 ± 0.086
3.541AspLeu: 3.541 ± 0.397
1.442AspMet: 1.442 ± 0.171
5.639AspAsn: 5.639 ± 1.339
2.098AspPro: 2.098 ± 0.703
2.229AspGln: 2.229 ± 0.192
2.36AspArg: 2.36 ± 0.344
2.229AspSer: 2.229 ± 0.524
2.491AspThr: 2.491 ± 1.119
1.049AspVal: 1.049 ± 0.471
0.656AspTrp: 0.656 ± 0.183
3.147AspTyr: 3.147 ± 0.733
0.0AspXaa: 0.0 ± 0.0
Glu
2.623GluAla: 2.623 ± 0.015
0.787GluCys: 0.787 ± 0.124
1.18GluAsp: 1.18 ± 0.291
3.016GluGlu: 3.016 ± 0.162
2.229GluPhe: 2.229 ± 0.524
2.885GluGly: 2.885 ± 0.136
2.098GluHis: 2.098 ± 0.465
4.196GluIle: 4.196 ± 1.216
2.491GluLys: 2.491 ± 1.119
7.343GluLeu: 7.343 ± 0.281
0.918GluMet: 0.918 ± 0.174
5.376GluAsn: 5.376 ± 0.029
3.016GluPro: 3.016 ± 0.554
2.885GluGln: 2.885 ± 0.341
2.623GluArg: 2.623 ± 0.223
2.229GluSer: 2.229 ± 0.047
3.541GluThr: 3.541 ± 1.034
3.016GluVal: 3.016 ± 0.315
1.705GluTrp: 1.705 ± 0.765
3.016GluTyr: 3.016 ± 0.639
0.0GluXaa: 0.0 ± 0.0
Phe
1.049PheAla: 1.049 ± 0.471
0.131PheCys: 0.131 ± 0.059
1.574PheAsp: 1.574 ± 0.706
2.098PheGlu: 2.098 ± 0.465
0.787PhePhe: 0.787 ± 0.115
1.311PheGly: 1.311 ± 0.365
0.262PheHis: 0.262 ± 0.121
1.705PheIle: 1.705 ± 0.527
2.229PheLys: 2.229 ± 0.907
2.098PheLeu: 2.098 ± 0.465
0.787PheMet: 0.787 ± 0.115
4.327PheAsn: 4.327 ± 0.273
0.656PhePro: 0.656 ± 0.056
0.525PheGln: 0.525 ± 0.003
0.787PheArg: 0.787 ± 0.115
1.574PheSer: 1.574 ± 0.468
2.36PheThr: 2.36 ± 0.344
1.442PheVal: 1.442 ± 0.068
0.262PheTrp: 0.262 ± 0.121
0.525PheTyr: 0.525 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
2.36GlyAla: 2.36 ± 0.848
1.049GlyCys: 1.049 ± 0.721
1.967GlyAsp: 1.967 ± 0.168
4.065GlyGlu: 4.065 ± 0.083
1.705GlyPhe: 1.705 ± 0.05
3.541GlyGly: 3.541 ± 1.034
1.18GlyHis: 1.18 ± 0.053
3.803GlyIle: 3.803 ± 0.038
5.507GlyLys: 5.507 ± 0.803
5.245GlyLeu: 5.245 ± 0.208
1.442GlyMet: 1.442 ± 0.306
4.065GlyAsn: 4.065 ± 0.798
2.36GlyPro: 2.36 ± 0.371
2.098GlyGln: 2.098 ± 0.489
2.491GlyArg: 2.491 ± 0.789
2.36GlySer: 2.36 ± 0.583
5.114GlyThr: 5.114 ± 1.043
3.016GlyVal: 3.016 ± 0.4
0.656GlyTrp: 0.656 ± 0.056
3.016GlyTyr: 3.016 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
1.836HisAla: 1.836 ± 0.109
0.393HisCys: 0.393 ± 0.177
1.311HisAsp: 1.311 ± 0.35
1.311HisGlu: 1.311 ± 0.604
0.393HisPhe: 0.393 ± 0.062
1.311HisGly: 1.311 ± 0.365
1.049HisHis: 1.049 ± 0.244
3.278HisIle: 3.278 ± 0.518
3.016HisLys: 3.016 ± 0.4
2.491HisLeu: 2.491 ± 0.165
0.393HisMet: 0.393 ± 0.177
2.754HisAsn: 2.754 ± 0.195
0.918HisPro: 0.918 ± 0.065
0.393HisGln: 0.393 ± 0.177
1.311HisArg: 1.311 ± 0.127
1.836HisSer: 1.836 ± 0.109
1.836HisThr: 1.836 ± 0.13
1.442HisVal: 1.442 ± 0.545
0.656HisTrp: 0.656 ± 0.183
1.311HisTyr: 1.311 ± 0.842
0.0HisXaa: 0.0 ± 0.0
Ile
5.245IleAla: 5.245 ± 1.162
1.049IleCys: 1.049 ± 0.244
5.245IleAsp: 5.245 ± 0.746
4.852IleGlu: 4.852 ± 0.207
2.098IlePhe: 2.098 ± 0.226
6.819IleGly: 6.819 ± 0.039
2.491IleHis: 2.491 ± 0.551
7.999IleIle: 7.999 ± 1.656
6.032IleLys: 6.032 ± 0.869
7.474IleLeu: 7.474 ± 1.176
1.836IleMet: 1.836 ± 0.368
8.261IleAsn: 8.261 ± 0.584
2.623IlePro: 2.623 ± 0.254
2.885IleGln: 2.885 ± 1.057
3.147IleArg: 3.147 ± 0.257
5.114IleSer: 5.114 ± 0.327
7.606IleThr: 7.606 ± 1.355
3.147IleVal: 3.147 ± 0.22
0.918IleTrp: 0.918 ± 0.065
4.065IleTyr: 4.065 ± 0.798
0.0IleXaa: 0.0 ± 0.0
Lys
2.754LysAla: 2.754 ± 0.195
2.098LysCys: 2.098 ± 0.727
3.278LysAsp: 3.278 ± 0.041
3.409LysGlu: 3.409 ± 0.1
2.229LysPhe: 2.229 ± 0.047
3.016LysGly: 3.016 ± 0.077
3.409LysHis: 3.409 ± 1.054
5.901LysIle: 5.901 ± 0.026
2.885LysLys: 2.885 ± 0.341
7.737LysLeu: 7.737 ± 0.373
1.049LysMet: 1.049 ± 0.006
4.721LysAsn: 4.721 ± 0.027
4.458LysPro: 4.458 ± 0.145
4.065LysGln: 4.065 ± 0.633
2.623LysArg: 2.623 ± 0.015
3.672LysSer: 3.672 ± 0.217
4.458LysThr: 4.458 ± 0.571
3.934LysVal: 3.934 ± 0.619
0.787LysTrp: 0.787 ± 0.353
3.147LysTyr: 3.147 ± 0.459
0.0LysXaa: 0.0 ± 0.0
Leu
3.147LeuAla: 3.147 ± 0.257
1.836LeuCys: 1.836 ± 0.13
6.032LeuAsp: 6.032 ± 1.754
3.278LeuGlu: 3.278 ± 0.756
2.098LeuPhe: 2.098 ± 0.251
4.59LeuGly: 4.59 ± 0.391
1.836LeuHis: 1.836 ± 0.368
6.557LeuIle: 6.557 ± 0.157
7.343LeuLys: 7.343 ± 0.042
8.13LeuLeu: 8.13 ± 0.55
1.442LeuMet: 1.442 ± 0.409
8.523LeuAsn: 8.523 ± 0.466
4.983LeuPro: 4.983 ± 0.863
3.278LeuGln: 3.278 ± 0.041
4.852LeuArg: 4.852 ± 0.032
5.77LeuSer: 5.77 ± 0.205
7.868LeuThr: 7.868 ± 0.193
3.409LeuVal: 3.409 ± 0.815
1.18LeuTrp: 1.18 ± 0.424
2.623LeuTyr: 2.623 ± 0.223
0.0LeuXaa: 0.0 ± 0.0
Met
1.442MetAla: 1.442 ± 0.648
0.262MetCys: 0.262 ± 0.121
1.442MetAsp: 1.442 ± 0.306
1.311MetGlu: 1.311 ± 0.127
0.656MetPhe: 0.656 ± 0.056
0.787MetGly: 0.787 ± 0.362
1.18MetHis: 1.18 ± 0.053
1.049MetIle: 1.049 ± 0.006
0.918MetLys: 0.918 ± 0.065
2.229MetLeu: 2.229 ± 0.762
0.525MetMet: 0.525 ± 0.241
1.18MetAsn: 1.18 ± 0.291
1.18MetPro: 1.18 ± 0.53
0.918MetGln: 0.918 ± 0.303
1.311MetArg: 1.311 ± 0.127
0.656MetSer: 0.656 ± 0.294
1.705MetThr: 1.705 ± 0.527
1.442MetVal: 1.442 ± 0.171
0.787MetTrp: 0.787 ± 0.353
1.442MetTyr: 1.442 ± 0.783
0.0MetXaa: 0.0 ± 0.0
Asn
4.196AsnAla: 4.196 ± 0.263
2.491AsnCys: 2.491 ± 0.403
3.409AsnAsp: 3.409 ± 0.815
6.819AsnGlu: 6.819 ± 0.199
2.229AsnPhe: 2.229 ± 0.43
4.721AsnGly: 4.721 ± 0.504
1.836AsnHis: 1.836 ± 0.13
10.228AsnIle: 10.228 ± 1.132
7.999AsnLys: 7.999 ± 0.225
8.786AsnLeu: 8.786 ± 0.348
1.967AsnMet: 1.967 ± 0.883
8.917AsnAsn: 8.917 ± 2.197
2.229AsnPro: 2.229 ± 0.047
3.147AsnGln: 3.147 ± 0.495
2.36AsnArg: 2.36 ± 0.61
4.327AsnSer: 4.327 ± 1.396
7.212AsnThr: 7.212 ± 0.816
3.934AsnVal: 3.934 ± 0.38
1.049AsnTrp: 1.049 ± 0.471
3.672AsnTyr: 3.672 ± 0.498
0.0AsnXaa: 0.0 ± 0.0
Pro
1.311ProAla: 1.311 ± 0.112
0.393ProCys: 0.393 ± 0.177
2.36ProAsp: 2.36 ± 0.821
2.36ProGlu: 2.36 ± 0.61
0.918ProPhe: 0.918 ± 0.174
1.574ProGly: 1.574 ± 0.009
0.918ProHis: 0.918 ± 0.065
3.147ProIle: 3.147 ± 0.936
1.967ProLys: 1.967 ± 0.406
3.147ProLeu: 3.147 ± 0.972
1.18ProMet: 1.18 ± 0.291
4.327ProAsn: 4.327 ± 0.204
0.656ProPro: 0.656 ± 0.294
1.442ProGln: 1.442 ± 0.648
0.918ProArg: 0.918 ± 0.542
1.967ProSer: 1.967 ± 0.168
4.458ProThr: 4.458 ± 1.099
2.754ProVal: 2.754 ± 0.282
1.18ProTrp: 1.18 ± 0.663
1.967ProTyr: 1.967 ± 0.071
0.0ProXaa: 0.0 ± 0.0
Gln
1.311GlnAla: 1.311 ± 0.365
0.393GlnCys: 0.393 ± 0.062
0.656GlnAsp: 0.656 ± 0.056
1.967GlnGlu: 1.967 ± 0.406
0.918GlnPhe: 0.918 ± 0.412
2.623GlnGly: 2.623 ± 0.223
1.574GlnHis: 1.574 ± 0.009
3.278GlnIle: 3.278 ± 0.198
1.967GlnLys: 1.967 ± 0.309
3.409GlnLeu: 3.409 ± 0.577
1.18GlnMet: 1.18 ± 0.108
3.278GlnAsn: 3.278 ± 0.518
1.836GlnPro: 1.836 ± 0.347
1.18GlnGln: 1.18 ± 0.053
1.574GlnArg: 1.574 ± 0.009
1.705GlnSer: 1.705 ± 0.288
3.409GlnThr: 3.409 ± 0.1
1.705GlnVal: 1.705 ± 0.288
0.393GlnTrp: 0.393 ± 0.177
1.574GlnTyr: 1.574 ± 0.009
0.0GlnXaa: 0.0 ± 0.0
Arg
1.836ArgAla: 1.836 ± 0.347
0.787ArgCys: 0.787 ± 0.115
2.229ArgAsp: 2.229 ± 0.285
2.491ArgGlu: 2.491 ± 0.074
1.442ArgPhe: 1.442 ± 0.171
1.574ArgGly: 1.574 ± 0.486
1.442ArgHis: 1.442 ± 1.737
3.934ArgIle: 3.934 ± 0.574
2.36ArgLys: 2.36 ± 0.821
5.77ArgLeu: 5.77 ± 0.205
0.918ArgMet: 0.918 ± 0.174
3.278ArgAsn: 3.278 ± 1.39
1.836ArgPro: 1.836 ± 0.13
1.836ArgGln: 1.836 ± 0.347
1.967ArgArg: 1.967 ± 0.168
1.442ArgSer: 1.442 ± 0.545
2.885ArgThr: 2.885 ± 0.136
2.098ArgVal: 2.098 ± 0.703
0.262ArgTrp: 0.262 ± 0.121
1.311ArgTyr: 1.311 ± 0.127
0.0ArgXaa: 0.0 ± 0.0
Ser
2.754SerAla: 2.754 ± 0.282
0.656SerCys: 0.656 ± 0.183
3.934SerAsp: 3.934 ± 0.142
1.836SerGlu: 1.836 ± 0.347
1.311SerPhe: 1.311 ± 0.35
3.541SerGly: 3.541 ± 0.397
1.574SerHis: 1.574 ± 0.486
5.376SerIle: 5.376 ± 0.448
2.754SerLys: 2.754 ± 0.195
2.885SerLeu: 2.885 ± 0.58
0.787SerMet: 0.787 ± 0.124
4.852SerAsn: 4.852 ± 0.207
1.18SerPro: 1.18 ± 0.291
1.836SerGln: 1.836 ± 0.13
1.705SerArg: 1.705 ± 0.05
2.229SerSer: 2.229 ± 0.524
4.196SerThr: 4.196 ± 0.214
1.705SerVal: 1.705 ± 0.189
0.262SerTrp: 0.262 ± 0.121
2.754SerTyr: 2.754 ± 0.759
0.0SerXaa: 0.0 ± 0.0
Thr
4.458ThrAla: 4.458 ± 0.571
2.098ThrCys: 2.098 ± 0.703
4.458ThrAsp: 4.458 ± 0.86
4.327ThrGlu: 4.327 ± 0.035
1.967ThrPhe: 1.967 ± 0.645
4.721ThrGly: 4.721 ± 0.504
2.36ThrHis: 2.36 ± 0.583
7.737ThrIle: 7.737 ± 1.058
5.376ThrLys: 5.376 ± 0.267
5.376ThrLeu: 5.376 ± 0.267
2.098ThrMet: 2.098 ± 0.226
7.474ThrAsn: 7.474 ± 1.653
2.491ThrPro: 2.491 ± 0.403
3.016ThrGln: 3.016 ± 0.077
3.672ThrArg: 3.672 ± 0.26
2.491ThrSer: 2.491 ± 0.403
6.95ThrThr: 6.95 ± 0.735
2.623ThrVal: 2.623 ± 0.254
1.311ThrTrp: 1.311 ± 0.35
4.196ThrTyr: 4.196 ± 1.455
0.0ThrXaa: 0.0 ± 0.0
Val
2.098ValAla: 2.098 ± 0.465
0.656ValCys: 0.656 ± 0.183
2.623ValAsp: 2.623 ± 0.223
2.098ValGlu: 2.098 ± 0.465
1.311ValPhe: 1.311 ± 0.112
3.147ValGly: 3.147 ± 0.459
1.705ValHis: 1.705 ± 0.189
4.458ValIle: 4.458 ± 1.099
3.672ValLys: 3.672 ± 1.171
2.885ValLeu: 2.885 ± 0.103
0.525ValMet: 0.525 ± 0.056
4.458ValAsn: 4.458 ± 0.383
2.098ValPro: 2.098 ± 0.012
1.049ValGln: 1.049 ± 0.006
2.098ValArg: 2.098 ± 0.226
1.574ValSer: 1.574 ± 0.229
3.278ValThr: 3.278 ± 0.518
2.754ValVal: 2.754 ± 0.195
0.525ValTrp: 0.525 ± 0.003
1.967ValTyr: 1.967 ± 0.309
0.0ValXaa: 0.0 ± 0.0
Trp
1.442TrpAla: 1.442 ± 0.171
0.262TrpCys: 0.262 ± 0.359
1.442TrpAsp: 1.442 ± 0.068
0.656TrpGlu: 0.656 ± 0.294
0.262TrpPhe: 0.262 ± 0.118
0.656TrpGly: 0.656 ± 0.056
0.656TrpHis: 0.656 ± 0.294
0.656TrpIle: 0.656 ± 0.183
0.656TrpLys: 0.656 ± 0.183
1.705TrpLeu: 1.705 ± 0.427
0.656TrpMet: 0.656 ± 0.183
1.049TrpAsn: 1.049 ± 0.006
0.918TrpPro: 0.918 ± 0.542
0.525TrpGln: 0.525 ± 0.235
0.525TrpArg: 0.525 ± 0.003
1.18TrpSer: 1.18 ± 0.291
0.525TrpThr: 0.525 ± 0.003
0.525TrpVal: 0.525 ± 0.235
0.262TrpTrp: 0.262 ± 0.118
0.262TrpTyr: 0.262 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.229TyrAla: 2.229 ± 0.047
0.525TyrCys: 0.525 ± 0.241
3.672TyrAsp: 3.672 ± 0.694
3.016TyrGlu: 3.016 ± 0.4
0.918TyrPhe: 0.918 ± 0.303
2.885TyrGly: 2.885 ± 1.09
1.311TyrHis: 1.311 ± 0.127
4.196TyrIle: 4.196 ± 1.216
3.147TyrLys: 3.147 ± 0.018
3.278TyrLeu: 3.278 ± 0.279
0.918TyrMet: 0.918 ± 0.303
4.458TyrAsn: 4.458 ± 0.383
1.049TyrPro: 1.049 ± 0.006
1.442TyrGln: 1.442 ± 0.171
1.967TyrArg: 1.967 ± 0.071
1.836TyrSer: 1.836 ± 0.368
3.409TyrThr: 3.409 ± 0.1
1.442TyrVal: 1.442 ± 0.171
1.311TyrTrp: 1.311 ± 0.604
1.705TyrTyr: 1.705 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (7627 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski