Amino acid dipepetide frequency for Asystasia mosaic Madagascar virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.621AlaAla: 3.621 ± 0.965
1.207AlaCys: 1.207 ± 0.902
2.414AlaAsp: 2.414 ± 1.561
4.225AlaGlu: 4.225 ± 2.001
1.811AlaPhe: 1.811 ± 1.234
2.414AlaGly: 2.414 ± 0.945
0.604AlaHis: 0.604 ± 0.536
1.207AlaIle: 1.207 ± 1.072
3.621AlaLys: 3.621 ± 1.42
3.621AlaLeu: 3.621 ± 1.205
0.604AlaMet: 0.604 ± 0.526
1.811AlaAsn: 1.811 ± 0.905
1.811AlaPro: 1.811 ± 0.862
5.432AlaGln: 5.432 ± 1.598
5.432AlaArg: 5.432 ± 1.684
4.828AlaSer: 4.828 ± 2.363
6.035AlaThr: 6.035 ± 1.528
0.0AlaVal: 0.0 ± 0.0
1.207AlaTrp: 1.207 ± 0.75
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.604CysAla: 0.604 ± 0.526
0.0CysCys: 0.0 ± 0.0
0.604CysAsp: 0.604 ± 0.582
0.604CysGlu: 0.604 ± 0.645
0.0CysPhe: 0.0 ± 0.0
1.811CysGly: 1.811 ± 0.842
0.0CysHis: 0.0 ± 0.0
1.811CysIle: 1.811 ± 1.063
0.604CysLys: 0.604 ± 0.645
0.604CysLeu: 0.604 ± 0.536
1.207CysMet: 1.207 ± 0.891
2.414CysAsn: 2.414 ± 0.905
0.604CysPro: 0.604 ± 0.771
0.0CysGln: 0.0 ± 0.0
0.604CysArg: 0.604 ± 0.526
3.621CysSer: 3.621 ± 1.772
1.207CysThr: 1.207 ± 0.75
2.414CysVal: 2.414 ± 1.382
0.0CysTrp: 0.0 ± 0.0
1.207CysTyr: 1.207 ± 1.048
0.0CysXaa: 0.0 ± 0.0
Asp
3.018AspAla: 3.018 ± 1.403
0.604AspCys: 0.604 ± 0.536
1.811AspAsp: 1.811 ± 0.749
1.811AspGlu: 1.811 ± 0.842
3.018AspPhe: 3.018 ± 0.828
3.018AspGly: 3.018 ± 1.18
1.811AspHis: 1.811 ± 0.635
3.018AspIle: 3.018 ± 1.213
2.414AspLys: 2.414 ± 1.099
4.225AspLeu: 4.225 ± 1.257
0.0AspMet: 0.0 ± 0.0
2.414AspAsn: 2.414 ± 0.845
4.225AspPro: 4.225 ± 1.406
1.207AspGln: 1.207 ± 1.062
1.811AspArg: 1.811 ± 1.316
4.225AspSer: 4.225 ± 1.822
0.604AspThr: 0.604 ± 0.536
4.225AspVal: 4.225 ± 1.747
0.0AspTrp: 0.0 ± 0.0
1.207AspTyr: 1.207 ± 0.801
0.0AspXaa: 0.0 ± 0.0
Glu
2.414GluAla: 2.414 ± 0.945
0.0GluCys: 0.0 ± 0.0
1.811GluAsp: 1.811 ± 1.17
7.242GluGlu: 7.242 ± 2.45
2.414GluPhe: 2.414 ± 1.707
3.621GluGly: 3.621 ± 1.35
1.811GluHis: 1.811 ± 0.948
3.018GluIle: 3.018 ± 1.77
1.811GluLys: 1.811 ± 0.753
3.621GluLeu: 3.621 ± 1.36
0.604GluMet: 0.604 ± 0.526
3.621GluAsn: 3.621 ± 2.094
3.621GluPro: 3.621 ± 0.891
1.811GluGln: 1.811 ± 0.933
0.604GluArg: 0.604 ± 0.526
4.828GluSer: 4.828 ± 1.764
3.018GluThr: 3.018 ± 1.42
0.0GluVal: 0.0 ± 0.0
0.604GluTrp: 0.604 ± 0.56
2.414GluTyr: 2.414 ± 2.145
0.0GluXaa: 0.0 ± 0.0
Phe
1.207PheAla: 1.207 ± 0.526
0.0PheCys: 0.0 ± 0.0
3.621PheAsp: 3.621 ± 1.02
1.207PheGlu: 1.207 ± 0.791
3.621PhePhe: 3.621 ± 1.255
1.811PheGly: 1.811 ± 1.181
1.207PheHis: 1.207 ± 0.81
2.414PheIle: 2.414 ± 1.561
3.621PheLys: 3.621 ± 0.672
3.621PheLeu: 3.621 ± 2.041
2.414PheMet: 2.414 ± 0.928
4.828PheAsn: 4.828 ± 2.297
3.018PhePro: 3.018 ± 1.451
1.811PheGln: 1.811 ± 0.749
3.018PheArg: 3.018 ± 1.417
4.225PheSer: 4.225 ± 1.946
1.811PheThr: 1.811 ± 0.648
1.207PheVal: 1.207 ± 1.165
1.207PheTrp: 1.207 ± 0.811
1.207PheTyr: 1.207 ± 1.29
0.0PheXaa: 0.0 ± 0.0
Gly
1.811GlyAla: 1.811 ± 1.124
1.811GlyCys: 1.811 ± 1.123
2.414GlyAsp: 2.414 ± 0.923
4.225GlyGlu: 4.225 ± 1.423
3.018GlyPhe: 3.018 ± 1.824
2.414GlyGly: 2.414 ± 1.053
1.207GlyHis: 1.207 ± 0.681
3.018GlyIle: 3.018 ± 1.364
3.018GlyLys: 3.018 ± 1.837
3.621GlyLeu: 3.621 ± 0.952
1.811GlyMet: 1.811 ± 0.914
1.811GlyAsn: 1.811 ± 1.481
4.225GlyPro: 4.225 ± 2.015
4.225GlyGln: 4.225 ± 1.54
1.811GlyArg: 1.811 ± 0.635
4.225GlySer: 4.225 ± 2.014
2.414GlyThr: 2.414 ± 1.162
3.018GlyVal: 3.018 ± 1.18
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.604HisAla: 0.604 ± 0.645
0.604HisCys: 0.604 ± 0.56
1.207HisAsp: 1.207 ± 0.801
0.604HisGlu: 0.604 ± 0.56
1.207HisPhe: 1.207 ± 0.81
1.207HisGly: 1.207 ± 0.81
1.207HisHis: 1.207 ± 0.901
3.018HisIle: 3.018 ± 1.465
1.207HisLys: 1.207 ± 0.811
2.414HisLeu: 2.414 ± 1.11
0.0HisMet: 0.0 ± 0.0
3.018HisAsn: 3.018 ± 1.72
1.811HisPro: 1.811 ± 1.266
1.811HisGln: 1.811 ± 0.862
3.018HisArg: 3.018 ± 1.624
1.811HisSer: 1.811 ± 1.17
2.414HisThr: 2.414 ± 1.294
3.621HisVal: 3.621 ± 1.358
0.0HisTrp: 0.0 ± 0.0
1.811HisTyr: 1.811 ± 0.635
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.207IleCys: 1.207 ± 0.753
5.432IleAsp: 5.432 ± 1.982
3.621IleGlu: 3.621 ± 1.025
1.811IlePhe: 1.811 ± 1.068
0.0IleGly: 0.0 ± 0.0
2.414IleHis: 2.414 ± 1.738
5.432IleIle: 5.432 ± 2.216
5.432IleLys: 5.432 ± 1.619
4.225IleLeu: 4.225 ± 1.976
0.604IleMet: 0.604 ± 0.644
4.828IleAsn: 4.828 ± 2.574
2.414IlePro: 2.414 ± 1.384
3.018IleGln: 3.018 ± 0.891
5.432IleArg: 5.432 ± 1.98
7.846IleSer: 7.846 ± 1.534
3.018IleThr: 3.018 ± 1.202
2.414IleVal: 2.414 ± 0.923
1.207IleTrp: 1.207 ± 0.753
3.018IleTyr: 3.018 ± 1.908
0.0IleXaa: 0.0 ± 0.0
Lys
2.414LysAla: 2.414 ± 1.068
1.207LysCys: 1.207 ± 0.811
1.207LysAsp: 1.207 ± 1.165
4.828LysGlu: 4.828 ± 3.052
2.414LysPhe: 2.414 ± 0.75
4.225LysGly: 4.225 ± 1.127
1.811LysHis: 1.811 ± 0.648
2.414LysIle: 2.414 ± 0.752
3.621LysLys: 3.621 ± 1.824
2.414LysLeu: 2.414 ± 0.599
1.207LysMet: 1.207 ± 0.81
3.621LysAsn: 3.621 ± 1.607
2.414LysPro: 2.414 ± 0.599
2.414LysGln: 2.414 ± 0.912
4.225LysArg: 4.225 ± 0.884
4.225LysSer: 4.225 ± 1.109
4.225LysThr: 4.225 ± 1.635
3.621LysVal: 3.621 ± 1.091
0.0LysTrp: 0.0 ± 0.0
4.225LysTyr: 4.225 ± 0.917
0.0LysXaa: 0.0 ± 0.0
Leu
1.811LeuAla: 1.811 ± 1.127
3.018LeuCys: 3.018 ± 1.678
3.018LeuAsp: 3.018 ± 1.143
1.811LeuGlu: 1.811 ± 1.005
2.414LeuPhe: 2.414 ± 0.934
4.225LeuGly: 4.225 ± 1.58
2.414LeuHis: 2.414 ± 0.891
3.621LeuIle: 3.621 ± 1.109
3.018LeuLys: 3.018 ± 1.487
5.432LeuLeu: 5.432 ± 1.11
2.414LeuMet: 2.414 ± 1.288
4.225LeuAsn: 4.225 ± 1.912
1.207LeuPro: 1.207 ± 0.753
3.621LeuGln: 3.621 ± 1.166
7.846LeuArg: 7.846 ± 3.715
7.846LeuSer: 7.846 ± 1.925
3.018LeuThr: 3.018 ± 1.298
1.811LeuVal: 1.811 ± 0.842
0.604LeuTrp: 0.604 ± 0.526
2.414LeuTyr: 2.414 ± 1.142
0.0LeuXaa: 0.0 ± 0.0
Met
0.604MetAla: 0.604 ± 0.645
0.604MetCys: 0.604 ± 0.787
1.811MetAsp: 1.811 ± 1.316
0.604MetGlu: 0.604 ± 0.536
1.207MetPhe: 1.207 ± 0.811
4.225MetGly: 4.225 ± 1.786
0.0MetHis: 0.0 ± 0.0
0.604MetIle: 0.604 ± 0.526
2.414MetLys: 2.414 ± 0.719
1.811MetLeu: 1.811 ± 1.022
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.207MetPro: 1.207 ± 0.526
0.0MetGln: 0.0 ± 0.0
1.811MetArg: 1.811 ± 0.791
3.018MetSer: 3.018 ± 1.145
0.604MetThr: 0.604 ± 0.536
0.604MetVal: 0.604 ± 0.771
1.811MetTrp: 1.811 ± 0.862
1.811MetTyr: 1.811 ± 1.276
0.0MetXaa: 0.0 ± 0.0
Asn
6.035AsnAla: 6.035 ± 2.707
1.207AsnCys: 1.207 ± 0.935
2.414AsnAsp: 2.414 ± 0.94
1.207AsnGlu: 1.207 ± 0.749
1.207AsnPhe: 1.207 ± 0.811
1.207AsnGly: 1.207 ± 0.681
4.828AsnHis: 4.828 ± 2.888
7.242AsnIle: 7.242 ± 2.357
3.621AsnLys: 3.621 ± 1.846
3.018AsnLeu: 3.018 ± 1.165
1.811AsnMet: 1.811 ± 1.214
2.414AsnAsn: 2.414 ± 1.096
2.414AsnPro: 2.414 ± 0.719
2.414AsnGln: 2.414 ± 1.098
3.018AsnArg: 3.018 ± 0.676
3.621AsnSer: 3.621 ± 2.013
1.811AsnThr: 1.811 ± 1.551
6.035AsnVal: 6.035 ± 1.534
0.604AsnTrp: 0.604 ± 0.526
3.621AsnTyr: 3.621 ± 1.263
0.0AsnXaa: 0.0 ± 0.0
Pro
3.018ProAla: 3.018 ± 0.842
0.604ProCys: 0.604 ± 0.645
0.604ProAsp: 0.604 ± 0.582
1.811ProGlu: 1.811 ± 0.876
3.018ProPhe: 3.018 ± 1.12
3.018ProGly: 3.018 ± 1.298
3.018ProHis: 3.018 ± 1.89
3.018ProIle: 3.018 ± 1.333
4.225ProLys: 4.225 ± 2.464
1.811ProLeu: 1.811 ± 0.953
1.811ProMet: 1.811 ± 1.276
3.018ProAsn: 3.018 ± 1.678
1.207ProPro: 1.207 ± 0.753
3.621ProGln: 3.621 ± 1.692
4.828ProArg: 4.828 ± 1.441
3.621ProSer: 3.621 ± 1.176
3.018ProThr: 3.018 ± 1.533
2.414ProVal: 2.414 ± 1.053
0.604ProTrp: 0.604 ± 0.536
3.018ProTyr: 3.018 ± 0.786
0.0ProXaa: 0.0 ± 0.0
Gln
4.225GlnAla: 4.225 ± 1.701
1.207GlnCys: 1.207 ± 1.052
0.604GlnAsp: 0.604 ± 0.645
1.811GlnGlu: 1.811 ± 0.635
1.811GlnPhe: 1.811 ± 0.635
3.621GlnGly: 3.621 ± 1.175
1.811GlnHis: 1.811 ± 1.029
3.621GlnIle: 3.621 ± 1.796
0.604GlnLys: 0.604 ± 0.771
3.621GlnLeu: 3.621 ± 1.547
1.207GlnMet: 1.207 ± 0.865
3.621GlnAsn: 3.621 ± 1.216
1.811GlnPro: 1.811 ± 1.287
0.604GlnGln: 0.604 ± 0.582
2.414GlnArg: 2.414 ± 1.202
5.432GlnSer: 5.432 ± 1.213
3.018GlnThr: 3.018 ± 0.791
4.225GlnVal: 4.225 ± 1.822
0.604GlnTrp: 0.604 ± 0.526
0.604GlnTyr: 0.604 ± 0.526
0.0GlnXaa: 0.0 ± 0.0
Arg
4.225ArgAla: 4.225 ± 1.345
2.414ArgCys: 2.414 ± 1.539
5.432ArgAsp: 5.432 ± 1.839
1.811ArgGlu: 1.811 ± 0.832
6.035ArgPhe: 6.035 ± 2.255
1.811ArgGly: 1.811 ± 0.791
2.414ArgHis: 2.414 ± 0.869
2.414ArgIle: 2.414 ± 1.253
4.225ArgLys: 4.225 ± 1.537
6.035ArgLeu: 6.035 ± 1.986
1.811ArgMet: 1.811 ± 1.45
1.811ArgAsn: 1.811 ± 1.215
5.432ArgPro: 5.432 ± 1.4
4.225ArgGln: 4.225 ± 1.843
8.449ArgArg: 8.449 ± 3.456
5.432ArgSer: 5.432 ± 1.18
4.225ArgThr: 4.225 ± 1.049
4.225ArgVal: 4.225 ± 0.738
0.0ArgTrp: 0.0 ± 0.0
1.207ArgTyr: 1.207 ± 0.935
0.0ArgXaa: 0.0 ± 0.0
Ser
6.639SerAla: 6.639 ± 1.67
3.018SerCys: 3.018 ± 1.174
3.621SerAsp: 3.621 ± 1.684
0.604SerGlu: 0.604 ± 0.536
4.828SerPhe: 4.828 ± 2.72
2.414SerGly: 2.414 ± 1.189
1.207SerHis: 1.207 ± 0.778
6.035SerIle: 6.035 ± 2.067
4.828SerLys: 4.828 ± 1.233
4.225SerLeu: 4.225 ± 0.926
1.207SerMet: 1.207 ± 0.841
6.035SerAsn: 6.035 ± 1.602
4.225SerPro: 4.225 ± 1.082
6.035SerGln: 6.035 ± 2.919
5.432SerArg: 5.432 ± 1.673
15.088SerSer: 15.088 ± 2.931
7.242SerThr: 7.242 ± 1.754
10.26SerVal: 10.26 ± 3.398
0.0SerTrp: 0.0 ± 0.0
4.828SerTyr: 4.828 ± 1.073
0.0SerXaa: 0.0 ± 0.0
Thr
2.414ThrAla: 2.414 ± 0.858
0.604ThrCys: 0.604 ± 0.644
1.207ThrAsp: 1.207 ± 0.841
3.018ThrGlu: 3.018 ± 1.208
1.811ThrPhe: 1.811 ± 0.832
4.225ThrGly: 4.225 ± 0.738
2.414ThrHis: 2.414 ± 1.219
6.035ThrIle: 6.035 ± 1.972
2.414ThrLys: 2.414 ± 0.949
3.018ThrLeu: 3.018 ± 1.279
3.018ThrMet: 3.018 ± 1.105
3.018ThrAsn: 3.018 ± 0.911
3.018ThrPro: 3.018 ± 0.786
0.0ThrGln: 0.0 ± 0.0
5.432ThrArg: 5.432 ± 2.459
3.621ThrSer: 3.621 ± 1.967
2.414ThrThr: 2.414 ± 1.16
3.018ThrVal: 3.018 ± 1.876
1.207ThrTrp: 1.207 ± 0.841
2.414ThrTyr: 2.414 ± 1.116
0.0ThrXaa: 0.0 ± 0.0
Val
3.018ValAla: 3.018 ± 0.626
0.604ValCys: 0.604 ± 0.526
2.414ValAsp: 2.414 ± 1.038
3.621ValGlu: 3.621 ± 2.27
1.811ValPhe: 1.811 ± 0.87
2.414ValGly: 2.414 ± 1.33
1.811ValHis: 1.811 ± 1.022
2.414ValIle: 2.414 ± 0.996
4.225ValLys: 4.225 ± 1.551
6.035ValLeu: 6.035 ± 1.61
0.604ValMet: 0.604 ± 0.526
3.018ValAsn: 3.018 ± 0.987
4.828ValPro: 4.828 ± 0.912
3.018ValGln: 3.018 ± 1.136
5.432ValArg: 5.432 ± 2.555
6.035ValSer: 6.035 ± 3.156
1.207ValThr: 1.207 ± 0.753
3.018ValVal: 3.018 ± 1.26
1.207ValTrp: 1.207 ± 0.801
3.621ValTyr: 3.621 ± 1.885
0.0ValXaa: 0.0 ± 0.0
Trp
1.811TrpAla: 1.811 ± 0.905
0.0TrpCys: 0.0 ± 0.0
0.604TrpAsp: 0.604 ± 0.771
1.207TrpGlu: 1.207 ± 0.811
0.604TrpPhe: 0.604 ± 0.787
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.604TrpMet: 0.604 ± 0.645
0.604TrpAsn: 0.604 ± 0.56
0.0TrpPro: 0.0 ± 0.0
0.604TrpGln: 0.604 ± 0.526
0.604TrpArg: 0.604 ± 0.645
1.207TrpSer: 1.207 ± 0.753
0.604TrpThr: 0.604 ± 0.644
1.811TrpVal: 1.811 ± 0.749
0.0TrpTrp: 0.0 ± 0.0
0.604TrpTyr: 0.604 ± 0.526
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.414TyrAla: 2.414 ± 1.182
0.0TyrCys: 0.0 ± 0.0
2.414TyrAsp: 2.414 ± 1.899
3.018TyrGlu: 3.018 ± 1.14
3.018TyrPhe: 3.018 ± 0.911
2.414TyrGly: 2.414 ± 0.813
0.604TyrHis: 0.604 ± 0.526
3.018TyrIle: 3.018 ± 0.586
1.811TyrLys: 1.811 ± 0.635
2.414TyrLeu: 2.414 ± 1.249
1.207TyrMet: 1.207 ± 0.771
3.621TyrAsn: 3.621 ± 1.314
1.811TyrPro: 1.811 ± 0.635
0.604TyrGln: 0.604 ± 0.526
3.018TyrArg: 3.018 ± 1.466
3.018TyrSer: 3.018 ± 1.359
2.414TyrThr: 2.414 ± 0.75
2.414TyrVal: 2.414 ± 1.448
0.0TyrTrp: 0.0 ± 0.0
0.604TyrTyr: 0.604 ± 0.787
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1658 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski