Amino acid dipepetide frequency for Tete orthobunyavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.763AlaAla: 1.763 ± 1.292
1.008AlaCys: 1.008 ± 0.26
2.015AlaAsp: 2.015 ± 0.911
3.275AlaGlu: 3.275 ± 0.197
1.259AlaPhe: 1.259 ± 0.757
3.275AlaGly: 3.275 ± 0.96
1.763AlaHis: 1.763 ± 0.54
5.29AlaIle: 5.29 ± 0.534
4.534AlaLys: 4.534 ± 1.602
3.275AlaLeu: 3.275 ± 1.066
2.267AlaMet: 2.267 ± 0.513
3.275AlaAsn: 3.275 ± 0.252
1.511AlaPro: 1.511 ± 0.513
1.511AlaGln: 1.511 ± 0.828
2.267AlaArg: 2.267 ± 2.026
2.519AlaSer: 2.519 ± 0.655
1.763AlaThr: 1.763 ± 0.477
2.771AlaVal: 2.771 ± 1.904
0.252AlaTrp: 0.252 ± 0.156
2.267AlaTyr: 2.267 ± 1.197
0.0AlaXaa: 0.0 ± 0.0
Cys
2.015CysAla: 2.015 ± 0.521
0.0CysCys: 0.0 ± 0.0
0.756CysAsp: 0.756 ± 0.677
1.259CysGlu: 1.259 ± 0.454
1.259CysPhe: 1.259 ± 0.454
2.519CysGly: 2.519 ± 2.256
1.008CysHis: 1.008 ± 0.956
2.267CysIle: 2.267 ± 0.705
1.763CysLys: 1.763 ± 0.887
2.519CysLeu: 2.519 ± 0.908
0.504CysMet: 0.504 ± 0.726
1.259CysAsn: 1.259 ± 0.454
1.259CysPro: 1.259 ± 0.454
1.511CysGln: 1.511 ± 0.668
2.015CysArg: 2.015 ± 1.453
1.008CysSer: 1.008 ± 0.902
0.504CysThr: 0.504 ± 0.13
0.756CysVal: 0.756 ± 0.334
0.0CysTrp: 0.0 ± 0.0
1.259CysTyr: 1.259 ± 0.269
0.0CysXaa: 0.0 ± 0.0
Asp
2.015AspAla: 2.015 ± 0.706
1.008AspCys: 1.008 ± 0.26
5.038AspAsp: 5.038 ± 1.477
3.526AspGlu: 3.526 ± 1.46
5.038AspPhe: 5.038 ± 1.138
1.763AspGly: 1.763 ± 1.292
0.756AspHis: 0.756 ± 0.467
5.542AspIle: 5.542 ± 1.565
3.023AspLys: 3.023 ± 0.473
4.786AspLeu: 4.786 ± 0.758
2.267AspMet: 2.267 ± 0.532
3.275AspAsn: 3.275 ± 1.066
3.023AspPro: 3.023 ± 1.168
2.519AspGln: 2.519 ± 0.586
2.771AspArg: 2.771 ± 0.611
2.771AspSer: 2.771 ± 0.832
3.275AspThr: 3.275 ± 0.596
3.778AspVal: 3.778 ± 0.887
1.008AspTrp: 1.008 ± 0.307
2.771AspTyr: 2.771 ± 0.249
0.0AspXaa: 0.0 ± 0.0
Glu
2.771GluAla: 2.771 ± 1.047
1.511GluCys: 1.511 ± 1.003
4.282GluAsp: 4.282 ± 0.805
5.542GluGlu: 5.542 ± 0.876
4.534GluPhe: 4.534 ± 1.752
2.519GluGly: 2.519 ± 0.655
1.259GluHis: 1.259 ± 0.454
6.297GluIle: 6.297 ± 1.393
6.801GluLys: 6.801 ± 0.966
5.29GluLeu: 5.29 ± 0.418
3.023GluMet: 3.023 ± 1.036
3.275GluAsn: 3.275 ± 0.759
2.519GluPro: 2.519 ± 0.325
2.771GluGln: 2.771 ± 1.047
2.771GluArg: 2.771 ± 1.058
5.038GluSer: 5.038 ± 0.566
1.763GluThr: 1.763 ± 0.384
3.526GluVal: 3.526 ± 0.732
0.504GluTrp: 0.504 ± 0.311
2.267GluTyr: 2.267 ± 0.345
0.0GluXaa: 0.0 ± 0.0
Phe
3.526PheAla: 3.526 ± 1.515
1.511PheCys: 1.511 ± 0.605
3.275PheAsp: 3.275 ± 0.759
3.275PheGlu: 3.275 ± 0.252
1.763PhePhe: 1.763 ± 2.114
2.519PheGly: 2.519 ± 1.382
1.008PheHis: 1.008 ± 0.26
3.275PheIle: 3.275 ± 1.066
5.29PheLys: 5.29 ± 0.533
4.03PheLeu: 4.03 ± 1.037
2.015PheMet: 2.015 ± 2.121
3.778PheAsn: 3.778 ± 0.874
1.008PhePro: 1.008 ± 1.452
0.756PheGln: 0.756 ± 0.467
1.259PheArg: 1.259 ± 0.454
4.534PheSer: 4.534 ± 0.598
2.519PheThr: 2.519 ± 1.382
2.771PheVal: 2.771 ± 0.375
0.504PheTrp: 0.504 ± 0.13
2.015PheTyr: 2.015 ± 0.615
0.0PheXaa: 0.0 ± 0.0
Gly
1.511GlyAla: 1.511 ± 2.173
1.511GlyCys: 1.511 ± 0.391
4.786GlyAsp: 4.786 ± 0.753
4.03GlyGlu: 4.03 ± 1.229
2.771GlyPhe: 2.771 ± 1.754
1.763GlyGly: 1.763 ± 1.292
1.008GlyHis: 1.008 ± 0.26
2.519GlyIle: 2.519 ± 0.651
3.275GlyLys: 3.275 ± 1.46
4.534GlyLeu: 4.534 ± 0.783
0.252GlyMet: 0.252 ± 0.226
2.015GlyAsn: 2.015 ± 0.615
0.756GlyPro: 0.756 ± 0.177
2.267GlyGln: 2.267 ± 0.507
1.763GlyArg: 1.763 ± 1.09
3.778GlySer: 3.778 ± 2.412
3.778GlyThr: 3.778 ± 2.085
3.023GlyVal: 3.023 ± 1.041
1.008GlyTrp: 1.008 ± 0.555
1.511GlyTyr: 1.511 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
1.511HisAla: 1.511 ± 0.952
1.008HisCys: 1.008 ± 0.26
0.756HisAsp: 0.756 ± 0.467
1.259HisGlu: 1.259 ± 0.778
2.015HisPhe: 2.015 ± 1.028
2.267HisGly: 2.267 ± 0.504
1.008HisHis: 1.008 ± 0.615
1.008HisIle: 1.008 ± 0.26
3.023HisLys: 3.023 ± 1.138
3.275HisLeu: 3.275 ± 0.197
0.504HisMet: 0.504 ± 0.13
0.756HisAsn: 0.756 ± 0.177
1.008HisPro: 1.008 ± 0.716
0.0HisGln: 0.0 ± 0.0
1.511HisArg: 1.511 ± 0.513
3.023HisSer: 3.023 ± 1.031
0.756HisThr: 0.756 ± 0.334
0.252HisVal: 0.252 ± 0.156
0.0HisTrp: 0.0 ± 0.0
1.259HisTyr: 1.259 ± 1.128
0.0HisXaa: 0.0 ± 0.0
Ile
4.03IleAla: 4.03 ± 0.291
1.763IleCys: 1.763 ± 0.887
4.282IleAsp: 4.282 ± 0.444
6.801IleGlu: 6.801 ± 1.016
2.519IlePhe: 2.519 ± 0.908
3.023IleGly: 3.023 ± 0.781
3.023IleHis: 3.023 ± 0.922
4.534IleIle: 4.534 ± 0.598
7.809IleLys: 7.809 ± 1.62
8.564IleLeu: 8.564 ± 2.085
1.763IleMet: 1.763 ± 0.629
4.786IleAsn: 4.786 ± 1.495
1.511IlePro: 1.511 ± 0.355
2.267IleGln: 2.267 ± 1.066
2.267IleArg: 2.267 ± 0.501
5.542IleSer: 5.542 ± 1.188
6.045IleThr: 6.045 ± 1.312
3.023IleVal: 3.023 ± 1.209
1.008IleTrp: 1.008 ± 0.307
2.771IleTyr: 2.771 ± 1.119
0.0IleXaa: 0.0 ± 0.0
Lys
4.534LysAla: 4.534 ± 3.279
2.771LysCys: 2.771 ± 2.129
4.282LysAsp: 4.282 ± 0.307
6.549LysGlu: 6.549 ± 2.76
5.29LysPhe: 5.29 ± 0.958
4.786LysGly: 4.786 ± 0.814
2.015LysHis: 2.015 ± 0.454
5.038LysIle: 5.038 ± 0.332
6.045LysLys: 6.045 ± 1.53
5.29LysLeu: 5.29 ± 1.141
3.023LysMet: 3.023 ± 0.683
4.282LysAsn: 4.282 ± 0.775
3.526LysPro: 3.526 ± 1.1
1.763LysGln: 1.763 ± 0.477
3.275LysArg: 3.275 ± 1.318
4.282LysSer: 4.282 ± 0.906
7.557LysThr: 7.557 ± 0.817
4.282LysVal: 4.282 ± 0.943
1.008LysTrp: 1.008 ± 0.26
3.526LysTyr: 3.526 ± 0.954
0.0LysXaa: 0.0 ± 0.0
Leu
4.534LeuAla: 4.534 ± 1.538
2.015LeuCys: 2.015 ± 0.786
7.305LeuAsp: 7.305 ± 1.631
4.786LeuGlu: 4.786 ± 0.411
3.275LeuPhe: 3.275 ± 0.826
4.282LeuGly: 4.282 ± 0.665
1.259LeuHis: 1.259 ± 0.269
4.282LeuIle: 4.282 ± 1.127
7.053LeuLys: 7.053 ± 1.834
9.32LeuLeu: 9.32 ± 2.744
2.519LeuMet: 2.519 ± 0.907
4.282LeuAsn: 4.282 ± 1.392
3.526LeuPro: 3.526 ± 0.736
3.526LeuGln: 3.526 ± 1.156
3.526LeuArg: 3.526 ± 2.179
6.045LeuSer: 6.045 ± 0.463
6.297LeuThr: 6.297 ± 2.83
4.786LeuVal: 4.786 ± 1.609
0.504LeuTrp: 0.504 ± 0.13
4.534LeuTyr: 4.534 ± 0.973
0.0LeuXaa: 0.0 ± 0.0
Met
2.267MetAla: 2.267 ± 0.81
0.504MetCys: 0.504 ± 0.13
1.763MetAsp: 1.763 ± 0.92
1.259MetGlu: 1.259 ± 0.757
1.008MetPhe: 1.008 ± 0.714
1.008MetGly: 1.008 ± 0.26
1.008MetHis: 1.008 ± 0.26
3.023MetIle: 3.023 ± 0.71
0.252MetLys: 0.252 ± 0.156
3.526MetLeu: 3.526 ± 0.917
0.252MetMet: 0.252 ± 0.156
1.763MetAsn: 1.763 ± 0.459
1.259MetPro: 1.259 ± 0.579
1.008MetGln: 1.008 ± 0.307
1.763MetArg: 1.763 ± 1.301
2.519MetSer: 2.519 ± 0.539
2.771MetThr: 2.771 ± 1.152
1.008MetVal: 1.008 ± 0.623
0.252MetTrp: 0.252 ± 0.226
1.511MetTyr: 1.511 ± 0.355
0.0MetXaa: 0.0 ± 0.0
Asn
2.015AsnAla: 2.015 ± 0.615
1.008AsnCys: 1.008 ± 0.902
4.786AsnAsp: 4.786 ± 1.666
4.03AsnGlu: 4.03 ± 1.002
2.267AsnPhe: 2.267 ± 0.759
1.259AsnGly: 1.259 ± 0.454
1.008AsnHis: 1.008 ± 0.555
3.526AsnIle: 3.526 ± 0.076
5.038AsnLys: 5.038 ± 1.138
6.549AsnLeu: 6.549 ± 0.505
2.015AsnMet: 2.015 ± 0.324
4.534AsnAsn: 4.534 ± 1.409
2.267AsnPro: 2.267 ± 1.197
1.763AsnGln: 1.763 ± 0.477
1.008AsnArg: 1.008 ± 1.462
3.275AsnSer: 3.275 ± 0.826
2.519AsnThr: 2.519 ± 0.655
4.534AsnVal: 4.534 ± 0.783
1.259AsnTrp: 1.259 ± 0.454
3.023AsnTyr: 3.023 ± 0.71
0.0AsnXaa: 0.0 ± 0.0
Pro
1.259ProAla: 1.259 ± 0.454
0.252ProCys: 0.252 ± 0.226
1.511ProAsp: 1.511 ± 0.605
2.267ProGlu: 2.267 ± 0.501
1.511ProPhe: 1.511 ± 0.513
2.519ProGly: 2.519 ± 0.586
0.252ProHis: 0.252 ± 0.226
3.526ProIle: 3.526 ± 1.1
3.023ProLys: 3.023 ± 1.138
2.015ProLeu: 2.015 ± 0.454
1.259ProMet: 1.259 ± 1.421
2.267ProAsn: 2.267 ± 0.81
0.756ProPro: 0.756 ± 0.467
0.504ProGln: 0.504 ± 0.13
1.008ProArg: 1.008 ± 0.307
1.763ProSer: 1.763 ± 1.408
1.763ProThr: 1.763 ± 0.578
2.519ProVal: 2.519 ± 0.908
0.756ProTrp: 0.756 ± 0.177
1.008ProTyr: 1.008 ± 0.26
0.0ProXaa: 0.0 ± 0.0
Gln
1.259GlnAla: 1.259 ± 0.579
0.252GlnCys: 0.252 ± 0.226
2.771GlnAsp: 2.771 ± 1.07
1.763GlnGlu: 1.763 ± 0.384
1.008GlnPhe: 1.008 ± 0.26
1.511GlnGly: 1.511 ± 0.391
0.504GlnHis: 0.504 ± 0.311
2.771GlnIle: 2.771 ± 1.375
3.023GlnLys: 3.023 ± 1.53
1.259GlnLeu: 1.259 ± 0.454
0.756GlnMet: 0.756 ± 0.685
2.015GlnAsn: 2.015 ± 0.437
1.259GlnPro: 1.259 ± 0.269
0.756GlnGln: 0.756 ± 0.334
3.023GlnArg: 3.023 ± 0.71
2.015GlnSer: 2.015 ± 0.437
1.763GlnThr: 1.763 ± 0.477
3.778GlnVal: 3.778 ± 1.317
0.504GlnTrp: 0.504 ± 0.311
0.252GlnTyr: 0.252 ± 0.156
0.0GlnXaa: 0.0 ± 0.0
Arg
2.771ArgAla: 2.771 ± 1.375
1.008ArgCys: 1.008 ± 0.615
2.267ArgAsp: 2.267 ± 0.345
3.778ArgGlu: 3.778 ± 1.239
2.015ArgPhe: 2.015 ± 0.706
1.008ArgGly: 1.008 ± 0.26
2.015ArgHis: 2.015 ± 0.454
4.786ArgIle: 4.786 ± 1.396
3.526ArgLys: 3.526 ± 0.076
3.778ArgLeu: 3.778 ± 1.361
0.756ArgMet: 0.756 ± 0.177
2.771ArgAsn: 2.771 ± 0.783
1.008ArgPro: 1.008 ± 0.307
1.259ArgGln: 1.259 ± 1.393
1.763ArgArg: 1.763 ± 0.758
2.015ArgSer: 2.015 ± 0.426
1.511ArgThr: 1.511 ± 0.605
2.015ArgVal: 2.015 ± 0.706
0.0ArgTrp: 0.0 ± 0.0
2.015ArgTyr: 2.015 ± 0.521
0.0ArgXaa: 0.0 ± 0.0
Ser
2.519SerAla: 2.519 ± 0.655
2.519SerCys: 2.519 ± 1.557
4.786SerAsp: 4.786 ± 0.411
3.275SerGlu: 3.275 ± 0.96
3.275SerPhe: 3.275 ± 1.614
2.519SerGly: 2.519 ± 0.651
0.756SerHis: 0.756 ± 0.177
5.793SerIle: 5.793 ± 0.657
5.793SerLys: 5.793 ± 0.839
7.809SerLeu: 7.809 ± 1.321
1.763SerMet: 1.763 ± 0.384
4.03SerAsn: 4.03 ± 0.806
1.008SerPro: 1.008 ± 0.623
1.511SerGln: 1.511 ± 0.391
3.023SerArg: 3.023 ± 1.209
4.786SerSer: 4.786 ± 1.034
5.29SerThr: 5.29 ± 1.735
1.763SerVal: 1.763 ± 0.384
1.259SerTrp: 1.259 ± 0.778
1.259SerTyr: 1.259 ± 0.454
0.0SerXaa: 0.0 ± 0.0
Thr
3.275ThrAla: 3.275 ± 0.96
1.763ThrCys: 1.763 ± 0.887
1.763ThrAsp: 1.763 ± 0.8
4.534ThrGlu: 4.534 ± 1.009
3.023ThrPhe: 3.023 ± 1.879
4.282ThrGly: 4.282 ± 1.757
2.015ThrHis: 2.015 ± 1.65
5.542ThrIle: 5.542 ± 0.498
5.038ThrLys: 5.038 ± 0.566
4.03ThrLeu: 4.03 ± 0.655
1.511ThrMet: 1.511 ± 0.668
2.519ThrAsn: 2.519 ± 0.651
2.267ThrPro: 2.267 ± 0.705
1.511ThrGln: 1.511 ± 0.391
1.008ThrArg: 1.008 ± 0.623
3.778ThrSer: 3.778 ± 0.887
2.519ThrThr: 2.519 ± 0.908
3.023ThrVal: 3.023 ± 0.781
1.259ThrTrp: 1.259 ± 0.603
4.282ThrTyr: 4.282 ± 1.373
0.0ThrXaa: 0.0 ± 0.0
Val
3.023ValAla: 3.023 ± 0.473
2.267ValCys: 2.267 ± 0.81
2.015ValAsp: 2.015 ± 0.437
4.03ValGlu: 4.03 ± 0.801
3.526ValPhe: 3.526 ± 0.732
2.267ValGly: 2.267 ± 0.501
1.763ValHis: 1.763 ± 0.629
3.526ValIle: 3.526 ± 0.954
4.534ValLys: 4.534 ± 0.89
3.778ValLeu: 3.778 ± 0.808
1.008ValMet: 1.008 ± 0.307
2.519ValAsn: 2.519 ± 0.651
1.511ValPro: 1.511 ± 0.391
2.267ValGln: 2.267 ± 0.705
3.023ValArg: 3.023 ± 0.781
3.275ValSer: 3.275 ± 0.759
3.023ValThr: 3.023 ± 1.036
4.534ValVal: 4.534 ± 1.252
0.252ValTrp: 0.252 ± 0.78
2.519ValTyr: 2.519 ± 0.995
0.0ValXaa: 0.0 ± 0.0
Trp
0.504TrpAla: 0.504 ± 0.451
0.252TrpCys: 0.252 ± 0.226
0.252TrpAsp: 0.252 ± 0.226
1.008TrpGlu: 1.008 ± 0.26
1.259TrpPhe: 1.259 ± 0.454
1.008TrpGly: 1.008 ± 0.307
0.504TrpHis: 0.504 ± 0.451
1.259TrpIle: 1.259 ± 0.778
0.252TrpLys: 0.252 ± 0.78
0.756TrpLeu: 0.756 ± 0.703
0.504TrpMet: 0.504 ± 0.451
1.008TrpAsn: 1.008 ± 0.307
0.252TrpPro: 0.252 ± 0.226
1.008TrpGln: 1.008 ± 0.623
0.0TrpArg: 0.0 ± 0.0
0.756TrpSer: 0.756 ± 0.177
0.252TrpThr: 0.252 ± 0.156
0.756TrpVal: 0.756 ± 0.467
0.0TrpTrp: 0.0 ± 0.0
0.252TrpTyr: 0.252 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.756TyrAla: 0.756 ± 0.677
1.511TyrCys: 1.511 ± 1.353
1.259TyrAsp: 1.259 ± 0.454
2.015TyrGlu: 2.015 ± 0.911
2.015TyrPhe: 2.015 ± 1.245
1.763TyrGly: 1.763 ± 0.477
2.267TyrHis: 2.267 ± 0.81
3.526TyrIle: 3.526 ± 0.911
4.282TyrLys: 4.282 ± 1.153
3.275TyrLeu: 3.275 ± 1.029
1.511TyrMet: 1.511 ± 0.934
3.275TyrAsn: 3.275 ± 0.826
0.504TyrPro: 0.504 ± 0.13
1.763TyrGln: 1.763 ± 0.578
3.023TyrArg: 3.023 ± 0.651
2.015TyrSer: 2.015 ± 0.786
3.275TyrThr: 3.275 ± 0.704
1.763TyrVal: 1.763 ± 0.477
0.504TyrTrp: 0.504 ± 0.13
1.763TyrTyr: 1.763 ± 0.887
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3971 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski