Amino acid dipepetide frequency for West African Asystasia virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.486AlaAla: 3.486 ± 0.93
1.162AlaCys: 1.162 ± 0.686
4.648AlaAsp: 4.648 ± 0.967
1.743AlaGlu: 1.743 ± 0.699
1.743AlaPhe: 1.743 ± 1.126
1.743AlaGly: 1.743 ± 0.83
1.162AlaHis: 1.162 ± 0.757
1.162AlaIle: 1.162 ± 0.844
3.486AlaLys: 3.486 ± 0.904
6.973AlaLeu: 6.973 ± 1.347
0.581AlaMet: 0.581 ± 0.452
2.905AlaAsn: 2.905 ± 0.643
1.743AlaPro: 1.743 ± 1.286
2.324AlaGln: 2.324 ± 0.914
5.811AlaArg: 5.811 ± 1.669
7.554AlaSer: 7.554 ± 1.915
2.905AlaThr: 2.905 ± 1.5
0.581AlaVal: 0.581 ± 0.631
1.162AlaTrp: 1.162 ± 0.643
1.162AlaTyr: 1.162 ± 0.643
0.0AlaXaa: 0.0 ± 0.0
Cys
1.162CysAla: 1.162 ± 0.707
0.0CysCys: 0.0 ± 0.0
0.581CysAsp: 0.581 ± 0.573
0.581CysGlu: 0.581 ± 0.552
0.0CysPhe: 0.0 ± 0.0
1.743CysGly: 1.743 ± 0.943
0.0CysHis: 0.0 ± 0.0
1.162CysIle: 1.162 ± 1.104
1.162CysLys: 1.162 ± 1.104
2.324CysLeu: 2.324 ± 0.914
1.162CysMet: 1.162 ± 0.716
1.162CysAsn: 1.162 ± 0.502
1.162CysPro: 1.162 ± 0.714
0.581CysGln: 0.581 ± 0.573
0.581CysArg: 0.581 ± 0.452
2.905CysSer: 2.905 ± 1.287
2.324CysThr: 2.324 ± 0.79
1.162CysVal: 1.162 ± 0.659
0.581CysTrp: 0.581 ± 0.631
0.581CysTyr: 0.581 ± 0.605
0.0CysXaa: 0.0 ± 0.0
Asp
2.324AspAla: 2.324 ± 0.868
0.581AspCys: 0.581 ± 0.487
1.743AspAsp: 1.743 ± 0.697
1.162AspGlu: 1.162 ± 0.686
2.324AspPhe: 2.324 ± 0.635
2.324AspGly: 2.324 ± 1.655
2.905AspHis: 2.905 ± 0.825
4.648AspIle: 4.648 ± 1.576
1.162AspLys: 1.162 ± 0.643
2.905AspLeu: 2.905 ± 0.858
0.0AspMet: 0.0 ± 0.0
1.743AspAsn: 1.743 ± 0.843
5.811AspPro: 5.811 ± 1.561
1.743AspGln: 1.743 ± 0.847
2.905AspArg: 2.905 ± 1.157
4.648AspSer: 4.648 ± 1.527
1.743AspThr: 1.743 ± 0.759
4.648AspVal: 4.648 ± 1.941
0.0AspTrp: 0.0 ± 0.0
1.162AspTyr: 1.162 ± 0.757
0.0AspXaa: 0.0 ± 0.0
Glu
4.067GluAla: 4.067 ± 1.153
0.581GluCys: 0.581 ± 0.564
2.324GluAsp: 2.324 ± 0.871
3.486GluGlu: 3.486 ± 1.644
1.743GluPhe: 1.743 ± 1.043
4.648GluGly: 4.648 ± 0.949
0.581GluHis: 0.581 ± 0.487
2.324GluIle: 2.324 ± 1.172
1.743GluLys: 1.743 ± 0.822
3.486GluLeu: 3.486 ± 1.722
0.581GluMet: 0.581 ± 0.564
2.324GluAsn: 2.324 ± 1.722
2.324GluPro: 2.324 ± 0.865
2.905GluGln: 2.905 ± 0.856
0.0GluArg: 0.0 ± 0.0
3.486GluSer: 3.486 ± 1.392
2.905GluThr: 2.905 ± 1.089
2.324GluVal: 2.324 ± 1.044
1.162GluTrp: 1.162 ± 0.708
2.905GluTyr: 2.905 ± 1.565
0.0GluXaa: 0.0 ± 0.0
Phe
1.162PheAla: 1.162 ± 0.502
0.581PheCys: 0.581 ± 0.552
2.905PheAsp: 2.905 ± 1.512
2.324PheGlu: 2.324 ± 1.145
3.486PhePhe: 3.486 ± 0.96
1.162PheGly: 1.162 ± 0.719
0.0PheHis: 0.0 ± 0.0
2.324PheIle: 2.324 ± 1.353
2.905PheLys: 2.905 ± 0.898
4.648PheLeu: 4.648 ± 1.784
0.581PheMet: 0.581 ± 0.452
3.486PheAsn: 3.486 ± 1.259
2.905PhePro: 2.905 ± 1.116
1.743PheGln: 1.743 ± 0.79
2.324PheArg: 2.324 ± 1.212
1.162PheSer: 1.162 ± 0.974
3.486PheThr: 3.486 ± 1.27
1.162PheVal: 1.162 ± 0.72
0.581PheTrp: 0.581 ± 0.487
1.162PheTyr: 1.162 ± 1.104
0.0PheXaa: 0.0 ± 0.0
Gly
1.743GlyAla: 1.743 ± 0.958
2.905GlyCys: 2.905 ± 1.45
4.648GlyAsp: 4.648 ± 0.806
4.067GlyGlu: 4.067 ± 1.439
1.162GlyPhe: 1.162 ± 0.795
3.486GlyGly: 3.486 ± 0.762
0.581GlyHis: 0.581 ± 0.452
4.067GlyIle: 4.067 ± 1.121
2.324GlyLys: 2.324 ± 1.141
0.581GlyLeu: 0.581 ± 0.568
1.162GlyMet: 1.162 ± 0.587
3.486GlyAsn: 3.486 ± 1.083
4.648GlyPro: 4.648 ± 1.999
2.905GlyGln: 2.905 ± 1.068
1.743GlyArg: 1.743 ± 0.87
3.486GlySer: 3.486 ± 1.6
4.648GlyThr: 4.648 ± 1.607
3.486GlyVal: 3.486 ± 0.909
0.0GlyTrp: 0.0 ± 0.0
0.581GlyTyr: 0.581 ± 0.487
0.0GlyXaa: 0.0 ± 0.0
His
0.581HisAla: 0.581 ± 0.552
0.581HisCys: 0.581 ± 0.568
1.162HisAsp: 1.162 ± 0.733
1.743HisGlu: 1.743 ± 1.334
0.581HisPhe: 0.581 ± 0.452
2.324HisGly: 2.324 ± 1.049
1.162HisHis: 1.162 ± 1.135
3.486HisIle: 3.486 ± 1.112
1.162HisLys: 1.162 ± 1.209
4.648HisLeu: 4.648 ± 1.408
0.581HisMet: 0.581 ± 0.605
4.648HisAsn: 4.648 ± 1.538
0.581HisPro: 0.581 ± 0.452
2.324HisGln: 2.324 ± 0.721
2.905HisArg: 2.905 ± 1.377
1.743HisSer: 1.743 ± 1.126
2.324HisThr: 2.324 ± 1.274
2.905HisVal: 2.905 ± 0.952
0.0HisTrp: 0.0 ± 0.0
1.743HisTyr: 1.743 ± 0.583
0.0HisXaa: 0.0 ± 0.0
Ile
0.581IleAla: 0.581 ± 0.487
1.743IleCys: 1.743 ± 0.87
4.067IleAsp: 4.067 ± 1.714
2.324IleGlu: 2.324 ± 1.004
2.324IlePhe: 2.324 ± 0.865
1.743IleGly: 1.743 ± 1.054
3.486IleHis: 3.486 ± 1.399
2.905IleIle: 2.905 ± 1.051
7.554IleLys: 7.554 ± 1.386
4.067IleLeu: 4.067 ± 1.976
0.581IleMet: 0.581 ± 0.487
4.067IleAsn: 4.067 ± 1.945
1.743IlePro: 1.743 ± 1.356
2.905IleGln: 2.905 ± 1.386
6.392IleArg: 6.392 ± 1.608
7.554IleSer: 7.554 ± 1.055
3.486IleThr: 3.486 ± 1.507
3.486IleVal: 3.486 ± 1.046
2.324IleTrp: 2.324 ± 0.869
1.162IleTyr: 1.162 ± 0.757
0.0IleXaa: 0.0 ± 0.0
Lys
2.324LysAla: 2.324 ± 1.084
2.905LysCys: 2.905 ± 0.905
1.743LysAsp: 1.743 ± 0.958
5.23LysGlu: 5.23 ± 2.045
1.743LysPhe: 1.743 ± 0.748
2.905LysGly: 2.905 ± 1.072
2.905LysHis: 2.905 ± 0.512
4.067LysIle: 4.067 ± 0.861
2.905LysLys: 2.905 ± 1.635
2.905LysLeu: 2.905 ± 0.922
0.0LysMet: 0.0 ± 0.0
2.324LysAsn: 2.324 ± 0.865
1.743LysPro: 1.743 ± 0.549
0.581LysGln: 0.581 ± 0.552
3.486LysArg: 3.486 ± 1.497
2.905LysSer: 2.905 ± 0.643
1.162LysThr: 1.162 ± 0.643
2.905LysVal: 2.905 ± 0.962
0.581LysTrp: 0.581 ± 0.452
4.067LysTyr: 4.067 ± 1.128
0.0LysXaa: 0.0 ± 0.0
Leu
2.905LeuAla: 2.905 ± 1.266
0.581LeuCys: 0.581 ± 0.452
3.486LeuAsp: 3.486 ± 1.461
5.23LeuGlu: 5.23 ± 1.36
0.581LeuPhe: 0.581 ± 0.552
4.648LeuGly: 4.648 ± 1.088
5.23LeuHis: 5.23 ± 1.398
5.23LeuIle: 5.23 ± 1.079
3.486LeuLys: 3.486 ± 1.456
6.392LeuLeu: 6.392 ± 2.543
1.743LeuMet: 1.743 ± 1.022
3.486LeuAsn: 3.486 ± 1.022
2.905LeuPro: 2.905 ± 0.842
4.648LeuGln: 4.648 ± 1.626
5.23LeuArg: 5.23 ± 1.879
4.067LeuSer: 4.067 ± 1.125
2.905LeuThr: 2.905 ± 0.868
1.162LeuVal: 1.162 ± 1.104
0.0LeuTrp: 0.0 ± 0.0
4.067LeuTyr: 4.067 ± 1.359
0.0LeuXaa: 0.0 ± 0.0
Met
1.162MetAla: 1.162 ± 0.758
1.162MetCys: 1.162 ± 0.661
2.324MetAsp: 2.324 ± 0.794
0.581MetGlu: 0.581 ± 0.487
1.162MetPhe: 1.162 ± 0.828
2.324MetGly: 2.324 ± 1.106
0.581MetHis: 0.581 ± 0.564
0.0MetIle: 0.0 ± 0.0
1.743MetLys: 1.743 ± 0.549
1.743MetLeu: 1.743 ± 0.984
0.0MetMet: 0.0 ± 0.0
0.581MetAsn: 0.581 ± 0.605
1.743MetPro: 1.743 ± 1.043
0.0MetGln: 0.0 ± 0.0
1.743MetArg: 1.743 ± 1.101
1.743MetSer: 1.743 ± 0.798
1.162MetThr: 1.162 ± 0.974
0.0MetVal: 0.0 ± 0.0
0.581MetTrp: 0.581 ± 0.573
1.743MetTyr: 1.743 ± 1.168
0.0MetXaa: 0.0 ± 0.0
Asn
5.23AsnAla: 5.23 ± 1.854
0.581AsnCys: 0.581 ± 0.568
1.162AsnAsp: 1.162 ± 0.686
0.581AsnGlu: 0.581 ± 0.552
1.743AsnPhe: 1.743 ± 0.766
0.581AsnGly: 0.581 ± 0.605
4.067AsnHis: 4.067 ± 2.112
6.973AsnIle: 6.973 ± 1.967
1.162AsnLys: 1.162 ± 0.974
2.324AsnLeu: 2.324 ± 1.172
2.324AsnMet: 2.324 ± 1.022
3.486AsnAsn: 3.486 ± 1.394
2.905AsnPro: 2.905 ± 0.833
2.324AsnGln: 2.324 ± 0.809
3.486AsnArg: 3.486 ± 1.882
1.743AsnSer: 1.743 ± 1.265
2.324AsnThr: 2.324 ± 0.916
7.554AsnVal: 7.554 ± 1.851
0.581AsnTrp: 0.581 ± 0.452
3.486AsnTyr: 3.486 ± 1.34
0.0AsnXaa: 0.0 ± 0.0
Pro
3.486ProAla: 3.486 ± 1.433
0.581ProCys: 0.581 ± 0.552
1.162ProAsp: 1.162 ± 0.904
1.743ProGlu: 1.743 ± 1.218
2.324ProPhe: 2.324 ± 1.135
3.486ProGly: 3.486 ± 1.217
1.743ProHis: 1.743 ± 1.356
4.067ProIle: 4.067 ± 0.995
1.743ProLys: 1.743 ± 0.958
3.486ProLeu: 3.486 ± 1.163
1.162ProMet: 1.162 ± 0.719
1.162ProAsn: 1.162 ± 0.571
3.486ProPro: 3.486 ± 1.471
5.23ProGln: 5.23 ± 1.516
5.23ProArg: 5.23 ± 1.879
7.554ProSer: 7.554 ± 0.957
5.23ProThr: 5.23 ± 2.111
4.067ProVal: 4.067 ± 1.239
0.581ProTrp: 0.581 ± 0.487
4.067ProTyr: 4.067 ± 1.039
0.0ProXaa: 0.0 ± 0.0
Gln
5.811GlnAla: 5.811 ± 1.378
2.324GlnCys: 2.324 ± 1.449
1.743GlnAsp: 1.743 ± 0.654
2.324GlnGlu: 2.324 ± 1.137
1.743GlnPhe: 1.743 ± 0.822
1.743GlnGly: 1.743 ± 0.83
2.324GlnHis: 2.324 ± 1.37
3.486GlnIle: 3.486 ± 1.423
1.162GlnLys: 1.162 ± 0.714
0.581GlnLeu: 0.581 ± 0.452
1.162GlnMet: 1.162 ± 0.844
2.324GlnAsn: 2.324 ± 0.865
2.324GlnPro: 2.324 ± 1.258
0.0GlnGln: 0.0 ± 0.0
1.743GlnArg: 1.743 ± 0.697
4.648GlnSer: 4.648 ± 1.153
1.743GlnThr: 1.743 ± 0.732
4.067GlnVal: 4.067 ± 1.353
0.0GlnTrp: 0.0 ± 0.0
1.162GlnTyr: 1.162 ± 0.75
0.0GlnXaa: 0.0 ± 0.0
Arg
3.486ArgAla: 3.486 ± 0.945
2.324ArgCys: 2.324 ± 1.14
5.23ArgAsp: 5.23 ± 2.524
3.486ArgGlu: 3.486 ± 1.356
4.648ArgPhe: 4.648 ± 1.273
2.905ArgGly: 2.905 ± 0.962
1.162ArgHis: 1.162 ± 0.716
2.905ArgIle: 2.905 ± 1.175
2.324ArgLys: 2.324 ± 1.161
5.23ArgLeu: 5.23 ± 2.248
1.162ArgMet: 1.162 ± 1.128
4.067ArgAsn: 4.067 ± 1.316
7.554ArgPro: 7.554 ± 1.219
2.324ArgGln: 2.324 ± 1.421
10.459ArgArg: 10.459 ± 3.834
5.811ArgSer: 5.811 ± 2.349
1.743ArgThr: 1.743 ± 0.936
4.648ArgVal: 4.648 ± 1.621
0.0ArgTrp: 0.0 ± 0.0
2.324ArgTyr: 2.324 ± 0.853
0.0ArgXaa: 0.0 ± 0.0
Ser
5.811SerAla: 5.811 ± 1.821
0.581SerCys: 0.581 ± 0.573
1.743SerAsp: 1.743 ± 0.549
1.743SerGlu: 1.743 ± 1.049
6.392SerPhe: 6.392 ± 1.297
4.067SerGly: 4.067 ± 1.444
1.743SerHis: 1.743 ± 0.834
3.486SerIle: 3.486 ± 1.037
5.23SerLys: 5.23 ± 1.251
5.811SerLeu: 5.811 ± 2.028
4.648SerMet: 4.648 ± 1.781
5.811SerAsn: 5.811 ± 1.148
4.648SerPro: 4.648 ± 1.104
2.324SerGln: 2.324 ± 1.135
6.392SerArg: 6.392 ± 1.11
15.689SerSer: 15.689 ± 3.736
8.135SerThr: 8.135 ± 2.206
7.554SerVal: 7.554 ± 2.415
0.581SerTrp: 0.581 ± 0.631
2.324SerTyr: 2.324 ± 0.809
0.0SerXaa: 0.0 ± 0.0
Thr
1.743ThrAla: 1.743 ± 0.829
0.581ThrCys: 0.581 ± 0.552
0.581ThrAsp: 0.581 ± 0.487
1.162ThrGlu: 1.162 ± 0.757
2.905ThrPhe: 2.905 ± 1.286
5.811ThrGly: 5.811 ± 1.44
3.486ThrHis: 3.486 ± 1.406
5.811ThrIle: 5.811 ± 1.78
2.324ThrLys: 2.324 ± 0.72
2.905ThrLeu: 2.905 ± 1.602
0.581ThrMet: 0.581 ± 0.536
2.905ThrAsn: 2.905 ± 0.896
6.392ThrPro: 6.392 ± 1.754
1.162ThrGln: 1.162 ± 0.804
5.23ThrArg: 5.23 ± 1.59
9.297ThrSer: 9.297 ± 1.771
4.067ThrThr: 4.067 ± 1.496
1.162ThrVal: 1.162 ± 0.701
1.743ThrTrp: 1.743 ± 0.898
1.743ThrTyr: 1.743 ± 0.87
0.0ThrXaa: 0.0 ± 0.0
Val
1.743ValAla: 1.743 ± 0.775
0.581ValCys: 0.581 ± 0.452
2.324ValAsp: 2.324 ± 1.015
2.324ValGlu: 2.324 ± 1.476
1.162ValPhe: 1.162 ± 0.719
1.743ValGly: 1.743 ± 0.858
2.324ValHis: 2.324 ± 0.981
4.067ValIle: 4.067 ± 1.399
4.648ValLys: 4.648 ± 1.428
4.648ValLeu: 4.648 ± 1.62
1.162ValMet: 1.162 ± 0.645
2.324ValAsn: 2.324 ± 1.681
4.648ValPro: 4.648 ± 1.042
3.486ValGln: 3.486 ± 1.305
4.648ValArg: 4.648 ± 2.821
4.648ValSer: 4.648 ± 1.676
3.486ValThr: 3.486 ± 1.613
1.743ValVal: 1.743 ± 1.151
1.743ValTrp: 1.743 ± 1.254
4.648ValTyr: 4.648 ± 1.269
0.0ValXaa: 0.0 ± 0.0
Trp
2.905TrpAla: 2.905 ± 1.091
0.0TrpCys: 0.0 ± 0.0
0.581TrpAsp: 0.581 ± 0.573
1.743TrpGlu: 1.743 ± 1.263
0.581TrpPhe: 0.581 ± 0.631
0.0TrpGly: 0.0 ± 0.0
0.581TrpHis: 0.581 ± 0.568
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.581TrpMet: 0.581 ± 0.552
0.581TrpAsn: 0.581 ± 0.536
0.0TrpPro: 0.0 ± 0.0
0.581TrpGln: 0.581 ± 0.452
0.0TrpArg: 0.0 ± 0.0
1.162TrpSer: 1.162 ± 0.659
1.162TrpThr: 1.162 ± 0.848
0.581TrpVal: 0.581 ± 0.564
0.0TrpTrp: 0.0 ± 0.0
1.162TrpTyr: 1.162 ± 0.75
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.324TyrAla: 2.324 ± 0.532
0.581TyrCys: 0.581 ± 0.573
2.324TyrAsp: 2.324 ± 1.257
2.324TyrGlu: 2.324 ± 0.885
1.743TyrPhe: 1.743 ± 0.549
2.324TyrGly: 2.324 ± 1.144
1.162TyrHis: 1.162 ± 0.904
2.324TyrIle: 2.324 ± 0.864
1.162TyrLys: 1.162 ± 0.502
2.324TyrLeu: 2.324 ± 1.021
1.743TyrMet: 1.743 ± 1.217
1.743TyrAsn: 1.743 ± 0.869
2.324TyrPro: 2.324 ± 0.72
2.324TyrGln: 2.324 ± 1.001
3.486TyrArg: 3.486 ± 1.782
2.905TyrSer: 2.905 ± 0.92
4.648TyrThr: 4.648 ± 1.139
2.905TyrVal: 2.905 ± 1.68
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1722 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski