Amino acid dipepetide frequency for Athtab bunya-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.085AlaAla: 2.085 ± 1.076
2.085AlaCys: 2.085 ± 0.763
3.277AlaAsp: 3.277 ± 0.789
2.681AlaGlu: 2.681 ± 0.672
1.787AlaPhe: 1.787 ± 0.324
1.787AlaGly: 1.787 ± 0.433
0.596AlaHis: 0.596 ± 1.305
3.873AlaIle: 3.873 ± 1.58
3.575AlaLys: 3.575 ± 1.012
3.575AlaLeu: 3.575 ± 0.649
0.894AlaMet: 0.894 ± 0.464
1.489AlaAsn: 1.489 ± 0.592
0.894AlaPro: 0.894 ± 0.464
2.383AlaGln: 2.383 ± 1.577
1.787AlaArg: 1.787 ± 0.433
2.979AlaSer: 2.979 ± 1.129
1.787AlaThr: 1.787 ± 0.433
3.873AlaVal: 3.873 ± 1.306
0.0AlaTrp: 0.0 ± 0.0
1.787AlaTyr: 1.787 ± 1.082
0.0AlaXaa: 0.0 ± 0.0
Cys
2.085CysAla: 2.085 ± 0.95
0.596CysCys: 0.596 ± 0.25
1.787CysAsp: 1.787 ± 1.386
2.085CysGlu: 2.085 ± 1.236
1.192CysPhe: 1.192 ± 1.457
1.192CysGly: 1.192 ± 0.948
0.596CysHis: 0.596 ± 0.711
1.192CysIle: 1.192 ± 0.546
2.085CysLys: 2.085 ± 0.685
2.383CysLeu: 2.383 ± 0.195
0.894CysMet: 0.894 ± 0.316
1.489CysAsn: 1.489 ± 1.118
0.894CysPro: 0.894 ± 0.464
1.192CysGln: 1.192 ± 0.948
0.298CysArg: 0.298 ± 0.355
2.085CysSer: 2.085 ± 1.061
2.085CysThr: 2.085 ± 1.717
0.894CysVal: 0.894 ± 0.595
0.596CysTrp: 0.596 ± 0.309
1.489CysTyr: 1.489 ± 0.636
0.0CysXaa: 0.0 ± 0.0
Asp
2.979AspAla: 2.979 ± 1.129
1.787AspCys: 1.787 ± 0.484
2.979AspAsp: 2.979 ± 1.546
4.17AspGlu: 4.17 ± 1.549
4.766AspPhe: 4.766 ± 1.299
3.575AspGly: 3.575 ± 0.529
0.894AspHis: 0.894 ± 0.217
2.681AspIle: 2.681 ± 0.978
4.766AspLys: 4.766 ± 0.645
6.553AspLeu: 6.553 ± 1.621
0.894AspMet: 0.894 ± 0.217
3.575AspAsn: 3.575 ± 0.95
2.085AspPro: 2.085 ± 0.344
0.894AspGln: 0.894 ± 0.464
3.873AspArg: 3.873 ± 0.417
3.873AspSer: 3.873 ± 1.09
2.979AspThr: 2.979 ± 0.748
3.575AspVal: 3.575 ± 1.434
0.894AspTrp: 0.894 ± 0.464
1.787AspTyr: 1.787 ± 1.187
0.0AspXaa: 0.0 ± 0.0
Glu
4.17GluAla: 4.17 ± 1.073
0.596GluCys: 0.596 ± 0.711
3.575GluAsp: 3.575 ± 0.843
5.66GluGlu: 5.66 ± 1.311
2.979GluPhe: 2.979 ± 0.311
4.766GluGly: 4.766 ± 1.012
0.894GluHis: 0.894 ± 0.464
3.873GluIle: 3.873 ± 1.213
7.447GluLys: 7.447 ± 2.283
6.851GluLeu: 6.851 ± 1.272
2.979GluMet: 2.979 ± 1.546
2.979GluAsn: 2.979 ± 0.797
1.489GluPro: 1.489 ± 1.118
2.383GluGln: 2.383 ± 0.943
1.787GluArg: 1.787 ± 0.535
4.468GluSer: 4.468 ± 1.031
3.277GluThr: 3.277 ± 0.174
4.468GluVal: 4.468 ± 1.031
1.192GluTrp: 1.192 ± 0.618
2.383GluTyr: 2.383 ± 1.091
0.0GluXaa: 0.0 ± 0.0
Phe
3.277PheAla: 3.277 ± 1.432
1.192PheCys: 1.192 ± 1.158
1.787PheAsp: 1.787 ± 0.916
2.979PheGlu: 2.979 ± 0.182
2.383PhePhe: 2.383 ± 0.649
2.085PheGly: 2.085 ± 0.763
1.489PheHis: 1.489 ± 0.398
4.17PheIle: 4.17 ± 1.245
5.362PheLys: 5.362 ± 1.531
5.064PheLeu: 5.064 ± 0.478
0.894PheMet: 0.894 ± 0.464
0.596PheAsn: 0.596 ± 0.25
0.894PhePro: 0.894 ± 0.217
2.085PheGln: 2.085 ± 0.344
2.383PheArg: 2.383 ± 0.562
4.468PheSer: 4.468 ± 0.326
2.979PheThr: 2.979 ± 1.488
3.575PheVal: 3.575 ± 0.866
0.894PheTrp: 0.894 ± 0.217
2.383PheTyr: 2.383 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
1.489GlyAla: 1.489 ± 0.442
1.787GlyCys: 1.787 ± 0.916
4.766GlyAsp: 4.766 ± 1.328
3.873GlyGlu: 3.873 ± 0.885
2.681GlyPhe: 2.681 ± 0.932
2.383GlyGly: 2.383 ± 0.195
0.596GlyHis: 0.596 ± 0.25
0.596GlyIle: 0.596 ± 0.579
3.873GlyLys: 3.873 ± 1.09
6.256GlyLeu: 6.256 ± 1.739
2.383GlyMet: 2.383 ± 1.087
1.787GlyAsn: 1.787 ± 0.535
1.787GlyPro: 1.787 ± 0.916
1.787GlyGln: 1.787 ± 0.433
2.085GlyArg: 2.085 ± 0.335
2.085GlySer: 2.085 ± 0.344
2.383GlyThr: 2.383 ± 0.195
3.873GlyVal: 3.873 ± 0.95
0.298GlyTrp: 0.298 ± 0.155
1.192GlyTyr: 1.192 ± 0.789
0.0GlyXaa: 0.0 ± 0.0
His
0.894HisAla: 0.894 ± 0.541
1.192HisCys: 1.192 ± 0.501
0.596HisAsp: 0.596 ± 0.309
1.489HisGlu: 1.489 ± 0.398
2.979HisPhe: 2.979 ± 0.311
0.298HisGly: 0.298 ± 0.155
0.596HisHis: 0.596 ± 0.579
2.085HisIle: 2.085 ± 0.68
1.787HisLys: 1.787 ± 0.535
2.383HisLeu: 2.383 ± 1.693
0.298HisMet: 0.298 ± 0.355
0.894HisAsn: 0.894 ± 0.464
1.192HisPro: 1.192 ± 0.281
0.298HisGln: 0.298 ± 0.155
1.192HisArg: 1.192 ± 0.281
0.298HisSer: 0.298 ± 0.155
0.894HisThr: 0.894 ± 0.541
1.192HisVal: 1.192 ± 0.471
0.298HisTrp: 0.298 ± 0.355
0.596HisTyr: 0.596 ± 0.711
0.0HisXaa: 0.0 ± 0.0
Ile
1.787IleAla: 1.787 ± 0.928
0.596IleCys: 0.596 ± 0.579
2.383IleAsp: 2.383 ± 0.892
4.17IleGlu: 4.17 ± 2.165
2.085IlePhe: 2.085 ± 0.685
3.575IleGly: 3.575 ± 1.503
2.383IleHis: 2.383 ± 0.562
2.085IleIle: 2.085 ± 0.477
3.873IleLys: 3.873 ± 0.62
5.064IleLeu: 5.064 ± 1.363
2.383IleMet: 2.383 ± 1.237
5.064IleAsn: 5.064 ± 1.229
1.787IlePro: 1.787 ± 0.324
3.277IleGln: 3.277 ± 0.32
2.383IleArg: 2.383 ± 0.562
5.66IleSer: 5.66 ± 1.466
4.468IleThr: 4.468 ± 4.695
2.979IleVal: 2.979 ± 2.235
0.894IleTrp: 0.894 ± 0.217
3.277IleTyr: 3.277 ± 1.701
0.0IleXaa: 0.0 ± 0.0
Lys
1.787LysAla: 1.787 ± 0.535
2.085LysCys: 2.085 ± 0.763
4.468LysAsp: 4.468 ± 1.255
4.766LysGlu: 4.766 ± 0.645
3.277LysPhe: 3.277 ± 1.281
2.979LysGly: 2.979 ± 1.252
1.787LysHis: 1.787 ± 0.535
8.341LysIle: 8.341 ± 0.482
5.362LysLys: 5.362 ± 2.355
8.043LysLeu: 8.043 ± 1.847
1.192LysMet: 1.192 ± 0.58
3.873LysAsn: 3.873 ± 1.213
2.681LysPro: 2.681 ± 1.061
3.277LysGln: 3.277 ± 1.281
3.873LysArg: 3.873 ± 1.587
4.468LysSer: 4.468 ± 1.679
4.17LysThr: 4.17 ± 2.7
6.256LysVal: 6.256 ± 2.055
1.489LysTrp: 1.489 ± 0.442
2.681LysTyr: 2.681 ± 0.978
0.0LysXaa: 0.0 ± 0.0
Leu
3.575LeuAla: 3.575 ± 0.649
1.787LeuCys: 1.787 ± 0.324
5.362LeuAsp: 5.362 ± 1.387
7.447LeuGlu: 7.447 ± 1.193
5.362LeuPhe: 5.362 ± 0.602
4.766LeuGly: 4.766 ± 1.22
2.085LeuHis: 2.085 ± 0.68
5.958LeuIle: 5.958 ± 1.087
8.937LeuLys: 8.937 ± 1.779
9.83LeuLeu: 9.83 ± 2.786
3.277LeuMet: 3.277 ± 0.931
4.468LeuAsn: 4.468 ± 0.326
3.873LeuPro: 3.873 ± 0.417
2.979LeuGln: 2.979 ± 0.182
5.064LeuArg: 5.064 ± 2.201
8.341LeuSer: 8.341 ± 1.083
6.851LeuThr: 6.851 ± 1.3
6.553LeuVal: 6.553 ± 0.348
0.894LeuTrp: 0.894 ± 0.595
3.873LeuTyr: 3.873 ± 0.95
0.0LeuXaa: 0.0 ± 0.0
Met
2.085MetAla: 2.085 ± 0.477
1.192MetCys: 1.192 ± 0.281
1.489MetAsp: 1.489 ± 0.398
1.489MetGlu: 1.489 ± 0.442
0.894MetPhe: 0.894 ± 0.464
1.787MetGly: 1.787 ± 0.324
1.192MetHis: 1.192 ± 0.546
2.383MetIle: 2.383 ± 0.562
1.489MetLys: 1.489 ± 0.773
2.085MetLeu: 2.085 ± 0.763
0.596MetMet: 0.596 ± 0.25
2.979MetAsn: 2.979 ± 0.797
0.596MetPro: 0.596 ± 0.25
0.596MetGln: 0.596 ± 0.309
0.894MetArg: 0.894 ± 0.217
3.873MetSer: 3.873 ± 1.587
1.787MetThr: 1.787 ± 0.484
0.894MetVal: 0.894 ± 0.464
0.596MetTrp: 0.596 ± 0.309
1.192MetTyr: 1.192 ± 0.281
0.0MetXaa: 0.0 ± 0.0
Asn
1.192AsnAla: 1.192 ± 0.789
2.979AsnCys: 2.979 ± 1.272
1.787AsnAsp: 1.787 ± 0.535
1.787AsnGlu: 1.787 ± 0.535
3.277AsnPhe: 3.277 ± 0.806
1.489AsnGly: 1.489 ± 0.592
0.894AsnHis: 0.894 ± 0.594
3.575AsnIle: 3.575 ± 1.323
2.383AsnLys: 2.383 ± 0.195
7.149AsnLeu: 7.149 ± 0.936
0.894AsnMet: 0.894 ± 0.464
2.383AsnAsn: 2.383 ± 0.828
2.383AsnPro: 2.383 ± 1.237
0.894AsnGln: 0.894 ± 0.464
2.979AsnArg: 2.979 ± 0.797
2.383AsnSer: 2.383 ± 0.828
3.277AsnThr: 3.277 ± 0.886
5.362AsnVal: 5.362 ± 1.344
1.192AsnTrp: 1.192 ± 0.281
3.277AsnTyr: 3.277 ± 0.931
0.0AsnXaa: 0.0 ± 0.0
Pro
0.596ProAla: 0.596 ± 1.305
0.298ProCys: 0.298 ± 0.355
1.787ProAsp: 1.787 ± 0.928
3.873ProGlu: 3.873 ± 1.43
2.085ProPhe: 2.085 ± 0.775
1.489ProGly: 1.489 ± 0.442
0.298ProHis: 0.298 ± 0.155
2.681ProIle: 2.681 ± 1.913
1.489ProLys: 1.489 ± 0.773
2.383ProLeu: 2.383 ± 0.649
0.298ProMet: 0.298 ± 0.355
1.787ProAsn: 1.787 ± 0.672
0.596ProPro: 0.596 ± 0.579
0.894ProGln: 0.894 ± 0.595
2.085ProArg: 2.085 ± 0.775
2.681ProSer: 2.681 ± 1.529
0.894ProThr: 0.894 ± 0.541
2.383ProVal: 2.383 ± 0.195
0.0ProTrp: 0.0 ± 0.0
1.489ProTyr: 1.489 ± 0.592
0.0ProXaa: 0.0 ± 0.0
Gln
2.085GlnAla: 2.085 ± 0.344
0.596GlnCys: 0.596 ± 0.579
3.277GlnAsp: 3.277 ± 0.753
2.383GlnGlu: 2.383 ± 0.562
1.489GlnPhe: 1.489 ± 0.374
2.085GlnGly: 2.085 ± 0.95
0.596GlnHis: 0.596 ± 0.25
1.192GlnIle: 1.192 ± 0.501
2.085GlnLys: 2.085 ± 1.082
1.787GlnLeu: 1.787 ± 0.484
1.787GlnMet: 1.787 ± 0.535
2.085GlnAsn: 2.085 ± 0.344
0.298GlnPro: 0.298 ± 0.155
0.0GlnGln: 0.0 ± 0.0
1.192GlnArg: 1.192 ± 0.618
3.277GlnSer: 3.277 ± 0.174
2.085GlnThr: 2.085 ± 0.68
2.383GlnVal: 2.383 ± 1.087
0.298GlnTrp: 0.298 ± 0.155
0.298GlnTyr: 0.298 ± 0.355
0.0GlnXaa: 0.0 ± 0.0
Arg
2.979ArgAla: 2.979 ± 0.311
0.894ArgCys: 0.894 ± 0.595
3.575ArgAsp: 3.575 ± 1.855
1.787ArgGlu: 1.787 ± 0.928
2.681ArgPhe: 2.681 ± 0.65
2.383ArgGly: 2.383 ± 1.237
0.596ArgHis: 0.596 ± 0.309
2.085ArgIle: 2.085 ± 0.68
2.979ArgLys: 2.979 ± 0.667
4.766ArgLeu: 4.766 ± 1.079
0.894ArgMet: 0.894 ± 0.248
3.575ArgAsn: 3.575 ± 1.125
1.787ArgPro: 1.787 ± 0.535
0.894ArgGln: 0.894 ± 0.217
2.979ArgArg: 2.979 ± 0.797
4.468ArgSer: 4.468 ± 1.083
2.383ArgThr: 2.383 ± 0.828
3.277ArgVal: 3.277 ± 0.931
0.0ArgTrp: 0.0 ± 0.0
0.894ArgTyr: 0.894 ± 0.541
0.0ArgXaa: 0.0 ± 0.0
Ser
2.383SerAla: 2.383 ± 0.424
2.085SerCys: 2.085 ± 1.089
4.468SerAsp: 4.468 ± 1.894
4.468SerGlu: 4.468 ± 0.494
3.575SerPhe: 3.575 ± 0.665
3.873SerGly: 3.873 ± 1.43
1.192SerHis: 1.192 ± 0.281
4.17SerIle: 4.17 ± 1.549
5.064SerLys: 5.064 ± 1.162
8.639SerLeu: 8.639 ± 1.981
3.575SerMet: 3.575 ± 0.109
1.787SerAsn: 1.787 ± 0.324
1.489SerPro: 1.489 ± 0.374
3.575SerGln: 3.575 ± 0.468
3.575SerArg: 3.575 ± 1.125
6.256SerSer: 6.256 ± 0.759
2.681SerThr: 2.681 ± 3.263
5.958SerVal: 5.958 ± 1.607
1.489SerTrp: 1.489 ± 0.398
3.575SerTyr: 3.575 ± 0.843
0.0SerXaa: 0.0 ± 0.0
Thr
2.085ThrAla: 2.085 ± 0.335
2.681ThrCys: 2.681 ± 2.775
2.681ThrAsp: 2.681 ± 0.837
2.979ThrGlu: 2.979 ± 1.488
2.979ThrPhe: 2.979 ± 1.653
3.277ThrGly: 3.277 ± 1.587
1.787ThrHis: 1.787 ± 1.082
3.575ThrIle: 3.575 ± 2.051
3.873ThrLys: 3.873 ± 0.885
5.66ThrLeu: 5.66 ± 0.846
2.085ThrMet: 2.085 ± 0.68
4.17ThrAsn: 4.17 ± 1.97
1.489ThrPro: 1.489 ± 0.592
1.787ThrGln: 1.787 ± 0.324
0.894ThrArg: 0.894 ± 0.464
4.468ThrSer: 4.468 ± 4.749
6.851ThrThr: 6.851 ± 11.548
3.277ThrVal: 3.277 ± 2.886
0.894ThrTrp: 0.894 ± 0.541
1.787ThrTyr: 1.787 ± 1.026
0.0ThrXaa: 0.0 ± 0.0
Val
3.277ValAla: 3.277 ± 0.32
1.787ValCys: 1.787 ± 1.19
7.149ValAsp: 7.149 ± 2.004
7.447ValGlu: 7.447 ± 1.007
2.681ValPhe: 2.681 ± 0.108
2.383ValGly: 2.383 ± 1.237
1.489ValHis: 1.489 ± 0.442
3.575ValIle: 3.575 ± 0.468
3.873ValLys: 3.873 ± 1.43
5.362ValLeu: 5.362 ± 1.673
1.787ValMet: 1.787 ± 0.484
3.277ValAsn: 3.277 ± 0.683
2.085ValPro: 2.085 ± 1.731
1.192ValGln: 1.192 ± 0.618
4.468ValArg: 4.468 ± 1.507
4.468ValSer: 4.468 ± 1.031
3.873ValThr: 3.873 ± 2.005
6.256ValVal: 6.256 ± 3.672
1.489ValTrp: 1.489 ± 0.374
1.787ValTyr: 1.787 ± 1.026
0.0ValXaa: 0.0 ± 0.0
Trp
0.894TrpAla: 0.894 ± 0.595
0.596TrpCys: 0.596 ± 0.25
0.298TrpAsp: 0.298 ± 0.155
0.894TrpGlu: 0.894 ± 0.464
0.298TrpPhe: 0.298 ± 0.155
0.596TrpGly: 0.596 ± 0.309
0.298TrpHis: 0.298 ± 0.355
1.489TrpIle: 1.489 ± 0.398
1.192TrpLys: 1.192 ± 0.281
1.787TrpLeu: 1.787 ± 0.535
0.298TrpMet: 0.298 ± 0.155
0.596TrpAsn: 0.596 ± 0.309
0.596TrpPro: 0.596 ± 0.25
0.298TrpGln: 0.298 ± 0.155
0.298TrpArg: 0.298 ± 0.155
0.298TrpSer: 0.298 ± 0.155
0.894TrpThr: 0.894 ± 1.224
0.894TrpVal: 0.894 ± 0.217
0.298TrpTrp: 0.298 ± 0.355
0.894TrpTyr: 0.894 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.489TyrAla: 1.489 ± 0.442
0.596TyrCys: 0.596 ± 0.25
2.085TyrAsp: 2.085 ± 1.082
1.787TyrGlu: 1.787 ± 0.433
1.489TyrPhe: 1.489 ± 0.773
1.192TyrGly: 1.192 ± 0.618
1.192TyrHis: 1.192 ± 0.789
0.596TyrIle: 0.596 ± 0.25
5.362TyrLys: 5.362 ± 2.423
5.66TyrLeu: 5.66 ± 1.105
1.489TyrMet: 1.489 ± 0.442
2.681TyrAsn: 2.681 ± 0.978
1.192TyrPro: 1.192 ± 0.281
0.596TyrGln: 0.596 ± 0.25
1.787TyrArg: 1.787 ± 0.324
2.979TyrSer: 2.979 ± 0.182
2.681TyrThr: 2.681 ± 2.956
1.787TyrVal: 1.787 ± 0.535
0.0TyrTrp: 0.0 ± 0.0
1.192TyrTyr: 1.192 ± 0.546
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3358 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski