Amino acid dipepetide frequency for Maize white line mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.923AlaAla: 7.923 ± 2.345
3.396AlaCys: 3.396 ± 1.78
2.83AlaAsp: 2.83 ± 0.957
2.83AlaGlu: 2.83 ± 0.753
2.83AlaPhe: 2.83 ± 1.164
7.357AlaGly: 7.357 ± 4.513
1.132AlaHis: 1.132 ± 0.565
4.527AlaIle: 4.527 ± 1.441
2.264AlaLys: 2.264 ± 0.535
11.885AlaLeu: 11.885 ± 3.975
1.132AlaMet: 1.132 ± 0.593
5.093AlaAsn: 5.093 ± 1.687
3.396AlaPro: 3.396 ± 1.084
0.0AlaGln: 0.0 ± 0.0
6.225AlaArg: 6.225 ± 1.789
5.093AlaSer: 5.093 ± 1.175
7.357AlaThr: 7.357 ± 2.488
12.45AlaVal: 12.45 ± 2.12
1.132AlaTrp: 1.132 ± 0.593
1.698AlaTyr: 1.698 ± 0.715
0.0AlaXaa: 0.0 ± 0.0
Cys
1.132CysAla: 1.132 ± 0.99
1.698CysCys: 1.698 ± 0.715
2.264CysAsp: 2.264 ± 0.88
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.264CysGly: 2.264 ± 1.618
0.0CysHis: 0.0 ± 0.0
0.566CysIle: 0.566 ± 0.668
0.0CysLys: 0.0 ± 0.0
6.225CysLeu: 6.225 ± 1.958
0.566CysMet: 0.566 ± 0.589
0.0CysAsn: 0.0 ± 0.0
2.264CysPro: 2.264 ± 1.097
1.132CysGln: 1.132 ± 0.798
2.264CysArg: 2.264 ± 0.951
1.132CysSer: 1.132 ± 0.565
2.83CysThr: 2.83 ± 1.268
3.962CysVal: 3.962 ± 0.982
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.489AspAla: 8.489 ± 2.547
2.83AspCys: 2.83 ± 0.796
2.83AspAsp: 2.83 ± 0.728
1.698AspGlu: 1.698 ± 0.715
0.0AspPhe: 0.0 ± 0.0
4.527AspGly: 4.527 ± 0.821
0.566AspHis: 0.566 ± 0.668
0.566AspIle: 0.566 ± 0.343
1.132AspLys: 1.132 ± 0.687
3.962AspLeu: 3.962 ± 2.175
1.132AspMet: 1.132 ± 0.565
1.132AspAsn: 1.132 ± 0.99
2.264AspPro: 2.264 ± 0.535
2.264AspGln: 2.264 ± 0.854
3.962AspArg: 3.962 ± 1.007
1.698AspSer: 1.698 ± 0.528
2.264AspThr: 2.264 ± 1.373
3.396AspVal: 3.396 ± 0.888
0.0AspTrp: 0.0 ± 0.0
1.698AspTyr: 1.698 ± 0.715
0.0AspXaa: 0.0 ± 0.0
Glu
1.698GluAla: 1.698 ± 0.718
0.0GluCys: 0.0 ± 0.0
3.396GluAsp: 3.396 ± 0.461
2.83GluGlu: 2.83 ± 1.239
2.264GluPhe: 2.264 ± 0.887
5.093GluGly: 5.093 ± 1.269
1.132GluHis: 1.132 ± 0.687
0.0GluIle: 0.0 ± 0.0
2.83GluLys: 2.83 ± 1.174
3.396GluLeu: 3.396 ± 1.679
1.132GluMet: 1.132 ± 0.593
0.566GluAsn: 0.566 ± 0.854
2.264GluPro: 2.264 ± 1.077
0.566GluGln: 0.566 ± 0.343
3.962GluArg: 3.962 ± 1.302
2.83GluSer: 2.83 ± 0.979
1.698GluThr: 1.698 ± 1.277
4.527GluVal: 4.527 ± 0.492
0.566GluTrp: 0.566 ± 0.668
3.396GluTyr: 3.396 ± 1.026
0.0GluXaa: 0.0 ± 0.0
Phe
2.83PheAla: 2.83 ± 0.985
3.962PheCys: 3.962 ± 0.668
2.83PheAsp: 2.83 ± 1.005
1.132PheGlu: 1.132 ± 1.335
0.566PhePhe: 0.566 ± 0.589
3.962PheGly: 3.962 ± 0.846
1.132PheHis: 1.132 ± 0.565
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
2.83PheLeu: 2.83 ± 0.978
0.0PheMet: 0.0 ± 0.0
1.698PheAsn: 1.698 ± 0.644
0.566PhePro: 0.566 ± 0.343
0.0PheGln: 0.0 ± 0.0
3.396PheArg: 3.396 ± 1.026
2.264PheSer: 2.264 ± 1.489
1.698PheThr: 1.698 ± 0.644
3.396PheVal: 3.396 ± 1.036
0.566PheTrp: 0.566 ± 0.343
0.566PheTyr: 0.566 ± 0.343
0.0PheXaa: 0.0 ± 0.0
Gly
5.093GlyAla: 5.093 ± 3.288
3.396GlyCys: 3.396 ± 1.534
5.093GlyAsp: 5.093 ± 1.086
2.264GlyGlu: 2.264 ± 1.843
2.264GlyPhe: 2.264 ± 2.354
9.621GlyGly: 9.621 ± 1.623
1.132GlyHis: 1.132 ± 1.106
2.83GlyIle: 2.83 ± 1.256
1.698GlyLys: 1.698 ± 1.188
7.923GlyLeu: 7.923 ± 2.281
0.566GlyMet: 0.566 ± 0.343
1.132GlyAsn: 1.132 ± 0.56
5.659GlyPro: 5.659 ± 0.454
2.83GlyGln: 2.83 ± 1.005
4.527GlyArg: 4.527 ± 0.974
6.791GlySer: 6.791 ± 0.882
6.225GlyThr: 6.225 ± 1.908
7.923GlyVal: 7.923 ± 2.552
1.698GlyTrp: 1.698 ± 0.884
2.83GlyTyr: 2.83 ± 0.96
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.132HisCys: 1.132 ± 0.56
0.0HisAsp: 0.0 ± 0.0
0.566HisGlu: 0.566 ± 0.854
1.132HisPhe: 1.132 ± 1.335
1.132HisGly: 1.132 ± 1.106
0.0HisHis: 0.0 ± 0.0
1.132HisIle: 1.132 ± 0.565
1.698HisLys: 1.698 ± 0.655
0.566HisLeu: 0.566 ± 0.343
0.566HisMet: 0.566 ± 0.559
0.566HisAsn: 0.566 ± 0.343
1.132HisPro: 1.132 ± 1.708
0.566HisGln: 0.566 ± 0.668
1.132HisArg: 1.132 ± 0.99
0.566HisSer: 0.566 ± 0.343
0.566HisThr: 0.566 ± 0.343
0.0HisVal: 0.0 ± 0.0
0.566HisTrp: 0.566 ± 0.343
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.264IleAla: 2.264 ± 1.489
2.264IleCys: 2.264 ± 1.07
3.396IleAsp: 3.396 ± 0.763
1.698IleGlu: 1.698 ± 0.655
0.566IlePhe: 0.566 ± 0.343
2.264IleGly: 2.264 ± 1.664
0.566IleHis: 0.566 ± 0.668
0.566IleIle: 0.566 ± 0.668
1.698IleLys: 1.698 ± 0.715
2.264IleLeu: 2.264 ± 0.733
1.132IleMet: 1.132 ± 0.687
0.566IleAsn: 0.566 ± 0.343
1.132IlePro: 1.132 ± 1.106
2.83IleGln: 2.83 ± 0.673
1.132IleArg: 1.132 ± 0.798
3.396IleSer: 3.396 ± 1.409
4.527IleThr: 4.527 ± 1.823
3.962IleVal: 3.962 ± 1.846
0.566IleTrp: 0.566 ± 0.589
0.566IleTyr: 0.566 ± 0.343
0.0IleXaa: 0.0 ± 0.0
Lys
5.659LysAla: 5.659 ± 1.21
0.0LysCys: 0.0 ± 0.0
0.566LysAsp: 0.566 ± 0.343
1.698LysGlu: 1.698 ± 0.884
1.132LysPhe: 1.132 ± 0.687
4.527LysGly: 4.527 ± 1.598
0.0LysHis: 0.0 ± 0.0
2.83LysIle: 2.83 ± 1.268
3.962LysLys: 3.962 ± 0.886
5.093LysLeu: 5.093 ± 0.862
0.566LysMet: 0.566 ± 0.343
0.566LysAsn: 0.566 ± 0.343
3.396LysPro: 3.396 ± 1.004
0.566LysGln: 0.566 ± 0.343
2.83LysArg: 2.83 ± 0.985
1.132LysSer: 1.132 ± 1.335
1.698LysThr: 1.698 ± 1.188
3.962LysVal: 3.962 ± 1.117
1.132LysTrp: 1.132 ± 0.593
0.566LysTyr: 0.566 ± 0.589
0.0LysXaa: 0.0 ± 0.0
Leu
11.319LeuAla: 11.319 ± 2.486
1.698LeuCys: 1.698 ± 0.908
4.527LeuAsp: 4.527 ± 0.881
5.093LeuGlu: 5.093 ± 1.075
2.264LeuPhe: 2.264 ± 0.61
4.527LeuGly: 4.527 ± 1.536
1.698LeuHis: 1.698 ± 0.858
3.962LeuIle: 3.962 ± 2.088
6.791LeuLys: 6.791 ± 2.249
3.396LeuLeu: 3.396 ± 1.429
2.264LeuMet: 2.264 ± 1.136
2.83LeuAsn: 2.83 ± 1.578
6.225LeuPro: 6.225 ± 1.811
1.132LeuGln: 1.132 ± 0.56
3.396LeuArg: 3.396 ± 1.716
4.527LeuSer: 4.527 ± 1.371
7.357LeuThr: 7.357 ± 1.815
10.753LeuVal: 10.753 ± 3.049
1.132LeuTrp: 1.132 ± 0.565
2.83LeuTyr: 2.83 ± 0.957
0.566LeuXaa: 0.566 ± 0.343
Met
4.527MetAla: 4.527 ± 0.821
0.566MetCys: 0.566 ± 0.668
0.0MetAsp: 0.0 ± 0.0
0.566MetGlu: 0.566 ± 0.343
0.0MetPhe: 0.0 ± 0.0
1.698MetGly: 1.698 ± 0.718
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.83MetLys: 2.83 ± 0.829
0.0MetLeu: 0.0 ± 0.0
0.566MetMet: 0.566 ± 0.668
1.698MetAsn: 1.698 ± 0.718
1.132MetPro: 1.132 ± 0.593
0.0MetGln: 0.0 ± 0.0
1.132MetArg: 1.132 ± 0.687
2.264MetSer: 2.264 ± 1.13
1.698MetThr: 1.698 ± 1.096
2.83MetVal: 2.83 ± 1.268
0.0MetTrp: 0.0 ± 0.0
0.566MetTyr: 0.566 ± 0.668
0.0MetXaa: 0.0 ± 0.0
Asn
1.698AsnAla: 1.698 ± 0.528
0.566AsnCys: 0.566 ± 0.343
1.132AsnAsp: 1.132 ± 0.798
1.132AsnGlu: 1.132 ± 0.798
1.132AsnPhe: 1.132 ± 0.832
1.132AsnGly: 1.132 ± 0.56
0.0AsnHis: 0.0 ± 0.0
1.132AsnIle: 1.132 ± 0.798
1.132AsnLys: 1.132 ± 0.593
0.566AsnLeu: 0.566 ± 0.668
0.566AsnMet: 0.566 ± 0.589
0.566AsnAsn: 0.566 ± 0.343
0.566AsnPro: 0.566 ± 0.589
1.132AsnGln: 1.132 ± 0.99
3.396AsnArg: 3.396 ± 1.572
2.83AsnSer: 2.83 ± 1.638
1.698AsnThr: 1.698 ± 1.096
3.396AsnVal: 3.396 ± 0.698
0.0AsnTrp: 0.0 ± 0.0
0.566AsnTyr: 0.566 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
7.923ProAla: 7.923 ± 1.433
0.0ProCys: 0.0 ± 0.0
2.83ProAsp: 2.83 ± 1.239
2.264ProGlu: 2.264 ± 1.618
0.566ProPhe: 0.566 ± 0.668
2.264ProGly: 2.264 ± 0.671
1.698ProHis: 1.698 ± 0.655
2.83ProIle: 2.83 ± 1.297
1.132ProLys: 1.132 ± 0.56
5.093ProLeu: 5.093 ± 0.744
0.0ProMet: 0.0 ± 0.0
1.132ProAsn: 1.132 ± 1.106
2.83ProPro: 2.83 ± 1.447
1.132ProGln: 1.132 ± 0.56
3.962ProArg: 3.962 ± 1.302
3.396ProSer: 3.396 ± 1.196
5.093ProThr: 5.093 ± 2.264
5.093ProVal: 5.093 ± 0.787
1.698ProTrp: 1.698 ± 1.752
1.132ProTyr: 1.132 ± 0.565
0.0ProXaa: 0.0 ± 0.0
Gln
1.132GlnAla: 1.132 ± 0.832
0.0GlnCys: 0.0 ± 0.0
1.132GlnAsp: 1.132 ± 0.593
1.132GlnGlu: 1.132 ± 0.687
0.566GlnPhe: 0.566 ± 0.668
2.83GlnGly: 2.83 ± 1.82
0.566GlnHis: 0.566 ± 0.343
1.698GlnIle: 1.698 ± 0.528
2.264GlnLys: 2.264 ± 0.88
1.698GlnLeu: 1.698 ± 0.655
1.132GlnMet: 1.132 ± 0.565
0.0GlnAsn: 0.0 ± 0.0
3.396GlnPro: 3.396 ± 1.196
0.0GlnGln: 0.0 ± 0.0
1.698GlnArg: 1.698 ± 0.715
0.0GlnSer: 0.0 ± 0.0
1.698GlnThr: 1.698 ± 1.386
2.83GlnVal: 2.83 ± 1.638
1.132GlnTrp: 1.132 ± 0.687
1.132GlnTyr: 1.132 ± 0.56
0.0GlnXaa: 0.0 ± 0.0
Arg
7.923ArgAla: 7.923 ± 1.602
0.566ArgCys: 0.566 ± 0.854
1.698ArgAsp: 1.698 ± 0.858
3.396ArgGlu: 3.396 ± 1.192
5.093ArgPhe: 5.093 ± 1.016
7.357ArgGly: 7.357 ± 2.028
0.566ArgHis: 0.566 ± 0.854
3.396ArgIle: 3.396 ± 1.139
3.396ArgLys: 3.396 ± 0.763
3.962ArgLeu: 3.962 ± 1.647
3.396ArgMet: 3.396 ± 1.004
1.132ArgAsn: 1.132 ± 0.593
3.962ArgPro: 3.962 ± 1.313
2.264ArgGln: 2.264 ± 0.535
3.396ArgArg: 3.396 ± 0.698
3.962ArgSer: 3.962 ± 0.886
7.357ArgThr: 7.357 ± 0.81
13.016ArgVal: 13.016 ± 1.98
0.566ArgTrp: 0.566 ± 0.343
2.83ArgTyr: 2.83 ± 0.979
0.0ArgXaa: 0.0 ± 0.0
Ser
5.659SerAla: 5.659 ± 0.567
0.566SerCys: 0.566 ± 0.343
1.698SerAsp: 1.698 ± 0.789
4.527SerGlu: 4.527 ± 0.716
2.264SerPhe: 2.264 ± 0.977
4.527SerGly: 4.527 ± 1.29
0.566SerHis: 0.566 ± 0.854
3.396SerIle: 3.396 ± 1.661
0.566SerLys: 0.566 ± 0.343
5.093SerLeu: 5.093 ± 1.527
2.264SerMet: 2.264 ± 0.61
1.132SerAsn: 1.132 ± 1.177
2.264SerPro: 2.264 ± 1.424
3.396SerGln: 3.396 ± 1.167
6.791SerArg: 6.791 ± 2.042
2.83SerSer: 2.83 ± 1.206
3.396SerThr: 3.396 ± 1.288
3.962SerVal: 3.962 ± 1.137
0.566SerTrp: 0.566 ± 0.854
3.396SerTyr: 3.396 ± 1.192
0.0SerXaa: 0.0 ± 0.0
Thr
5.659ThrAla: 5.659 ± 1.022
1.698ThrCys: 1.698 ± 0.528
3.962ThrAsp: 3.962 ± 1.004
1.698ThrGlu: 1.698 ± 0.644
4.527ThrPhe: 4.527 ± 1.503
5.093ThrGly: 5.093 ± 1.797
1.698ThrHis: 1.698 ± 1.172
1.698ThrIle: 1.698 ± 1.172
6.225ThrLys: 6.225 ± 1.151
8.489ThrLeu: 8.489 ± 3.317
1.698ThrMet: 1.698 ± 0.648
0.566ThrAsn: 0.566 ± 0.589
3.962ThrPro: 3.962 ± 1.655
0.0ThrGln: 0.0 ± 0.0
7.357ThrArg: 7.357 ± 1.394
5.093ThrSer: 5.093 ± 1.563
3.396ThrThr: 3.396 ± 0.92
6.791ThrVal: 6.791 ± 1.131
3.396ThrTrp: 3.396 ± 0.982
2.83ThrTyr: 2.83 ± 1.736
0.0ThrXaa: 0.0 ± 0.0
Val
7.357ValAla: 7.357 ± 1.804
1.132ValCys: 1.132 ± 0.593
4.527ValAsp: 4.527 ± 1.532
7.357ValGlu: 7.357 ± 2.005
5.093ValPhe: 5.093 ± 0.71
7.357ValGly: 7.357 ± 1.405
0.566ValHis: 0.566 ± 0.343
4.527ValIle: 4.527 ± 0.821
1.698ValLys: 1.698 ± 0.655
11.885ValLeu: 11.885 ± 2.934
2.83ValMet: 2.83 ± 0.621
0.566ValAsn: 0.566 ± 0.343
3.396ValPro: 3.396 ± 1.969
2.264ValGln: 2.264 ± 1.236
15.28ValArg: 15.28 ± 2.38
3.962ValSer: 3.962 ± 1.533
11.319ValThr: 11.319 ± 1.462
8.489ValVal: 8.489 ± 2.113
2.264ValTrp: 2.264 ± 1.187
3.396ValTyr: 3.396 ± 1.429
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.698TrpCys: 1.698 ± 0.528
1.698TrpAsp: 1.698 ± 0.908
0.566TrpGlu: 0.566 ± 0.343
1.132TrpPhe: 1.132 ± 0.56
0.566TrpGly: 0.566 ± 0.343
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.264TrpLeu: 2.264 ± 0.951
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.698TrpPro: 1.698 ± 0.908
1.132TrpGln: 1.132 ± 0.565
2.264TrpArg: 2.264 ± 1.07
0.566TrpSer: 0.566 ± 0.589
0.566TrpThr: 0.566 ± 0.343
3.396TrpVal: 3.396 ± 1.271
0.0TrpTrp: 0.0 ± 0.0
0.566TrpTyr: 0.566 ± 0.343
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.264TyrAla: 2.264 ± 0.854
1.132TyrCys: 1.132 ± 0.687
0.0TyrAsp: 0.0 ± 0.0
1.698TyrGlu: 1.698 ± 1.03
0.0TyrPhe: 0.0 ± 0.0
2.83TyrGly: 2.83 ± 0.673
0.0TyrHis: 0.0 ± 0.0
1.698TyrIle: 1.698 ± 0.715
0.566TyrLys: 0.566 ± 0.343
1.698TyrLeu: 1.698 ± 0.884
0.0TyrMet: 0.0 ± 0.0
3.396TyrAsn: 3.396 ± 1.004
0.566TyrPro: 0.566 ± 0.668
2.83TyrGln: 2.83 ± 0.462
1.132TyrArg: 1.132 ± 0.56
4.527TyrSer: 4.527 ± 1.441
3.396TyrThr: 3.396 ± 0.786
1.698TyrVal: 1.698 ± 1.766
1.132TyrTrp: 1.132 ± 0.593
2.264TyrTyr: 2.264 ± 0.671
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.566XaaGly: 0.566 ± 0.343
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1768 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski