Amino acid dipepetide frequency for Lettuce virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.664AlaAla: 9.664 ± 2.675
0.0AlaCys: 0.0 ± 0.0
3.782AlaAsp: 3.782 ± 2.364
3.361AlaGlu: 3.361 ± 1.249
3.361AlaPhe: 3.361 ± 1.548
3.782AlaGly: 3.782 ± 1.277
2.521AlaHis: 2.521 ± 0.783
2.941AlaIle: 2.941 ± 2.863
5.882AlaLys: 5.882 ± 1.428
8.403AlaLeu: 8.403 ± 1.447
2.941AlaMet: 2.941 ± 1.011
5.042AlaAsn: 5.042 ± 1.381
6.723AlaPro: 6.723 ± 4.179
1.261AlaGln: 1.261 ± 1.236
5.042AlaArg: 5.042 ± 1.4
7.983AlaSer: 7.983 ± 1.994
5.882AlaThr: 5.882 ± 1.586
6.303AlaVal: 6.303 ± 1.927
0.84AlaTrp: 0.84 ± 0.429
4.622AlaTyr: 4.622 ± 1.099
0.0AlaXaa: 0.0 ± 0.0
Cys
0.84CysAla: 0.84 ± 0.429
0.42CysCys: 0.42 ± 1.198
0.84CysAsp: 0.84 ± 1.389
0.84CysGlu: 0.84 ± 0.429
0.0CysPhe: 0.0 ± 0.0
1.261CysGly: 1.261 ± 0.981
0.0CysHis: 0.0 ± 0.0
0.42CysIle: 0.42 ± 1.577
0.0CysLys: 0.0 ± 0.0
0.42CysLeu: 0.42 ± 0.695
0.42CysMet: 0.42 ± 0.686
0.42CysAsn: 0.42 ± 0.214
1.681CysPro: 1.681 ± 1.476
0.84CysGln: 0.84 ± 1.481
1.261CysArg: 1.261 ± 1.646
2.521CysSer: 2.521 ± 2.165
0.42CysThr: 0.42 ± 0.695
0.84CysVal: 0.84 ± 1.074
0.0CysTrp: 0.0 ± 0.0
0.84CysTyr: 0.84 ± 0.429
0.0CysXaa: 0.0 ± 0.0
Asp
4.202AspAla: 4.202 ± 2.014
0.84AspCys: 0.84 ± 1.79
2.521AspAsp: 2.521 ± 1.286
2.521AspGlu: 2.521 ± 0.849
2.521AspPhe: 2.521 ± 0.783
3.782AspGly: 3.782 ± 1.209
1.261AspHis: 1.261 ± 0.643
1.681AspIle: 1.681 ± 0.591
1.681AspLys: 1.681 ± 0.858
7.563AspLeu: 7.563 ± 2.052
1.261AspMet: 1.261 ± 0.643
1.261AspAsn: 1.261 ± 0.643
5.462AspPro: 5.462 ± 3.581
1.681AspGln: 1.681 ± 0.928
1.261AspArg: 1.261 ± 0.643
2.941AspSer: 2.941 ± 1.501
2.521AspThr: 2.521 ± 1.286
4.202AspVal: 4.202 ± 1.383
1.261AspTrp: 1.261 ± 0.643
0.84AspTyr: 0.84 ± 0.429
0.0AspXaa: 0.0 ± 0.0
Glu
6.303GluAla: 6.303 ± 2.58
0.84GluCys: 0.84 ± 0.429
2.941GluAsp: 2.941 ± 0.957
1.681GluGlu: 1.681 ± 0.858
3.782GluPhe: 3.782 ± 1.026
1.681GluGly: 1.681 ± 0.529
0.42GluHis: 0.42 ± 1.198
1.681GluIle: 1.681 ± 1.81
1.681GluLys: 1.681 ± 0.858
4.622GluLeu: 4.622 ± 1.54
0.42GluMet: 0.42 ± 0.214
2.941GluAsn: 2.941 ± 1.022
5.462GluPro: 5.462 ± 2.161
0.84GluGln: 0.84 ± 0.588
2.101GluArg: 2.101 ± 1.072
3.361GluSer: 3.361 ± 1.715
2.101GluThr: 2.101 ± 1.072
2.101GluVal: 2.101 ± 1.072
0.84GluTrp: 0.84 ± 0.562
1.261GluTyr: 1.261 ± 0.643
0.0GluXaa: 0.0 ± 0.0
Phe
2.521PheAla: 2.521 ± 1.004
2.101PheCys: 2.101 ± 1.359
3.361PheAsp: 3.361 ± 1.059
3.361PheGlu: 3.361 ± 1.208
3.782PhePhe: 3.782 ± 1.342
2.101PheGly: 2.101 ± 1.041
1.681PheHis: 1.681 ± 0.858
4.202PheIle: 4.202 ± 1.353
0.42PheLys: 0.42 ± 0.214
5.042PheLeu: 5.042 ± 0.785
1.681PheMet: 1.681 ± 0.687
2.101PheAsn: 2.101 ± 1.072
2.941PhePro: 2.941 ± 1.256
2.941PheGln: 2.941 ± 0.892
1.261PheArg: 1.261 ± 0.549
2.521PheSer: 2.521 ± 1.099
1.681PheThr: 1.681 ± 0.858
2.521PheVal: 2.521 ± 1.004
0.0PheTrp: 0.0 ± 0.0
0.84PheTyr: 0.84 ± 0.588
0.0PheXaa: 0.0 ± 0.0
Gly
3.361GlyAla: 3.361 ± 1.218
1.261GlyCys: 1.261 ± 1.411
3.361GlyAsp: 3.361 ± 1.176
1.261GlyGlu: 1.261 ± 0.502
2.101GlyPhe: 2.101 ± 0.924
2.941GlyGly: 2.941 ± 0.918
3.361GlyHis: 3.361 ± 1.53
2.941GlyIle: 2.941 ± 1.417
1.681GlyLys: 1.681 ± 0.591
3.361GlyLeu: 3.361 ± 2.706
0.84GlyMet: 0.84 ± 0.429
2.101GlyAsn: 2.101 ± 1.081
3.361GlyPro: 3.361 ± 0.991
2.101GlyGln: 2.101 ± 0.699
1.681GlyArg: 1.681 ± 0.529
3.361GlySer: 3.361 ± 1.208
3.361GlyThr: 3.361 ± 2.367
2.101GlyVal: 2.101 ± 1.412
0.84GlyTrp: 0.84 ± 0.429
1.261GlyTyr: 1.261 ± 0.502
0.0GlyXaa: 0.0 ± 0.0
His
1.261HisAla: 1.261 ± 0.981
0.84HisCys: 0.84 ± 1.074
1.261HisAsp: 1.261 ± 0.643
1.681HisGlu: 1.681 ± 0.928
1.681HisPhe: 1.681 ± 0.591
2.101HisGly: 2.101 ± 0.642
2.101HisHis: 2.101 ± 1.365
0.0HisIle: 0.0 ± 0.0
1.681HisLys: 1.681 ± 0.591
4.202HisLeu: 4.202 ± 2.144
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.941HisPro: 2.941 ± 1.45
3.361HisGln: 3.361 ± 1.146
2.101HisArg: 2.101 ± 1.072
1.681HisSer: 1.681 ± 0.928
4.622HisThr: 4.622 ± 3.77
1.261HisVal: 1.261 ± 0.643
0.0HisTrp: 0.0 ± 0.0
0.84HisTyr: 0.84 ± 1.485
0.0HisXaa: 0.0 ± 0.0
Ile
2.521IleAla: 2.521 ± 1.843
0.42IleCys: 0.42 ± 0.214
0.42IleAsp: 0.42 ± 0.214
3.361IleGlu: 3.361 ± 1.059
2.521IlePhe: 2.521 ± 1.099
0.84IleGly: 0.84 ± 1.532
2.521IleHis: 2.521 ± 0.969
3.361IleIle: 3.361 ± 2.932
0.84IleLys: 0.84 ± 0.429
4.202IleLeu: 4.202 ± 3.985
1.261IleMet: 1.261 ± 0.643
2.941IleAsn: 2.941 ± 1.283
2.521IlePro: 2.521 ± 1.286
2.101IleGln: 2.101 ± 0.633
2.941IleArg: 2.941 ± 1.022
3.361IleSer: 3.361 ± 1.269
4.202IleThr: 4.202 ± 2.646
0.84IleVal: 0.84 ± 0.429
0.0IleTrp: 0.0 ± 0.0
1.681IleTyr: 1.681 ± 2.147
0.0IleXaa: 0.0 ± 0.0
Lys
4.622LysAla: 4.622 ± 2.358
0.0LysCys: 0.0 ± 0.0
2.521LysAsp: 2.521 ± 1.286
2.101LysGlu: 2.101 ± 1.072
1.681LysPhe: 1.681 ± 1.476
0.84LysGly: 0.84 ± 0.429
0.84LysHis: 0.84 ± 0.429
1.681LysIle: 1.681 ± 0.858
1.261LysLys: 1.261 ± 0.549
8.403LysLeu: 8.403 ± 1.173
2.101LysMet: 2.101 ± 1.072
1.261LysAsn: 1.261 ± 0.643
2.521LysPro: 2.521 ± 0.849
0.84LysGln: 0.84 ± 0.588
4.202LysArg: 4.202 ± 1.482
2.101LysSer: 2.101 ± 0.633
4.622LysThr: 4.622 ± 1.804
4.622LysVal: 4.622 ± 1.407
0.42LysTrp: 0.42 ± 0.214
0.84LysTyr: 0.84 ± 0.562
0.0LysXaa: 0.0 ± 0.0
Leu
7.983LeuAla: 7.983 ± 3.839
0.42LeuCys: 0.42 ± 1.198
5.882LeuAsp: 5.882 ± 1.795
5.042LeuGlu: 5.042 ± 1.773
5.042LeuPhe: 5.042 ± 2.01
4.202LeuGly: 4.202 ± 2.84
2.521LeuHis: 2.521 ± 1.099
2.521LeuIle: 2.521 ± 1.286
9.244LeuLys: 9.244 ± 4.114
10.504LeuLeu: 10.504 ± 4.824
1.681LeuMet: 1.681 ± 1.469
3.361LeuAsn: 3.361 ± 0.709
6.303LeuPro: 6.303 ± 2.736
4.202LeuGln: 4.202 ± 1.041
3.782LeuArg: 3.782 ± 1.506
6.303LeuSer: 6.303 ± 1.066
6.723LeuThr: 6.723 ± 2.089
5.462LeuVal: 5.462 ± 2.587
0.0LeuTrp: 0.0 ± 0.0
2.521LeuTyr: 2.521 ± 1.286
0.0LeuXaa: 0.0 ± 0.0
Met
2.101MetAla: 2.101 ± 0.699
0.84MetCys: 0.84 ± 0.429
0.84MetAsp: 0.84 ± 0.429
0.84MetGlu: 0.84 ± 0.562
0.0MetPhe: 0.0 ± 0.0
0.84MetGly: 0.84 ± 0.562
1.681MetHis: 1.681 ± 0.858
0.84MetIle: 0.84 ± 0.429
0.42MetLys: 0.42 ± 0.214
1.681MetLeu: 1.681 ± 0.529
0.0MetMet: 0.0 ± 0.0
0.42MetAsn: 0.42 ± 0.214
1.681MetPro: 1.681 ± 0.928
1.261MetGln: 1.261 ± 0.643
1.261MetArg: 1.261 ± 0.643
2.101MetSer: 2.101 ± 1.365
0.84MetThr: 0.84 ± 0.429
0.42MetVal: 0.42 ± 0.214
1.261MetTrp: 1.261 ± 0.502
0.84MetTyr: 0.84 ± 1.373
0.0MetXaa: 0.0 ± 0.0
Asn
4.622AsnAla: 4.622 ± 1.919
1.681AsnCys: 1.681 ± 0.591
2.101AsnAsp: 2.101 ± 1.072
2.521AsnGlu: 2.521 ± 0.849
2.521AsnPhe: 2.521 ± 0.783
2.521AsnGly: 2.521 ± 2.821
2.521AsnHis: 2.521 ± 2.165
1.261AsnIle: 1.261 ± 0.502
1.681AsnLys: 1.681 ± 0.858
2.521AsnLeu: 2.521 ± 0.783
0.0AsnMet: 0.0 ± 0.0
1.681AsnAsn: 1.681 ± 0.529
3.782AsnPro: 3.782 ± 1.383
1.681AsnGln: 1.681 ± 1.168
2.101AsnArg: 2.101 ± 0.633
2.941AsnSer: 2.941 ± 2.147
3.782AsnThr: 3.782 ± 0.846
5.042AsnVal: 5.042 ± 1.754
0.0AsnTrp: 0.0 ± 0.0
1.261AsnTyr: 1.261 ± 0.643
0.0AsnXaa: 0.0 ± 0.0
Pro
8.824ProAla: 8.824 ± 4.28
1.681ProCys: 1.681 ± 1.476
4.622ProAsp: 4.622 ± 1.706
4.622ProGlu: 4.622 ± 1.747
2.941ProPhe: 2.941 ± 1.056
4.202ProGly: 4.202 ± 1.482
2.101ProHis: 2.101 ± 1.041
2.101ProIle: 2.101 ± 0.699
4.202ProLys: 4.202 ± 1.602
5.882ProLeu: 5.882 ± 2.799
0.42ProMet: 0.42 ± 0.214
5.042ProAsn: 5.042 ± 4.531
7.563ProPro: 7.563 ± 3.402
3.361ProGln: 3.361 ± 1.269
3.782ProArg: 3.782 ± 1.506
4.622ProSer: 4.622 ± 2.945
7.563ProThr: 7.563 ± 1.403
3.782ProVal: 3.782 ± 1.147
2.101ProTrp: 2.101 ± 2.884
2.521ProTyr: 2.521 ± 0.849
0.0ProXaa: 0.0 ± 0.0
Gln
5.042GlnAla: 5.042 ± 1.588
0.0GlnCys: 0.0 ± 0.0
2.941GlnAsp: 2.941 ± 2.103
1.261GlnGlu: 1.261 ± 0.643
1.261GlnPhe: 1.261 ± 0.502
1.261GlnGly: 1.261 ± 0.643
1.261GlnHis: 1.261 ± 1.319
2.941GlnIle: 2.941 ± 0.918
0.84GlnLys: 0.84 ± 0.429
2.941GlnLeu: 2.941 ± 1.339
1.681GlnMet: 1.681 ± 0.858
0.42GlnAsn: 0.42 ± 0.214
3.361GlnPro: 3.361 ± 1.269
2.101GlnGln: 2.101 ± 0.924
1.681GlnArg: 1.681 ± 1.49
3.361GlnSer: 3.361 ± 1.146
4.202GlnThr: 4.202 ± 1.602
2.941GlnVal: 2.941 ± 1.835
0.84GlnTrp: 0.84 ± 1.074
0.84GlnTyr: 0.84 ± 0.588
0.0GlnXaa: 0.0 ± 0.0
Arg
3.782ArgAla: 3.782 ± 1.147
0.42ArgCys: 0.42 ± 1.198
3.782ArgAsp: 3.782 ± 1.929
1.261ArgGlu: 1.261 ± 0.502
2.101ArgPhe: 2.101 ± 0.642
1.261ArgGly: 1.261 ± 1.411
0.84ArgHis: 0.84 ± 0.588
3.361ArgIle: 3.361 ± 1.3
0.84ArgLys: 0.84 ± 0.429
2.941ArgLeu: 2.941 ± 1.121
1.261ArgMet: 1.261 ± 0.643
3.782ArgAsn: 3.782 ± 1.929
5.042ArgPro: 5.042 ± 1.735
4.202ArgGln: 4.202 ± 2.809
2.941ArgArg: 2.941 ± 1.693
2.941ArgSer: 2.941 ± 2.037
5.462ArgThr: 5.462 ± 1.462
2.521ArgVal: 2.521 ± 1.661
0.42ArgTrp: 0.42 ± 0.214
2.101ArgTyr: 2.101 ± 1.072
0.0ArgXaa: 0.0 ± 0.0
Ser
5.462SerAla: 5.462 ± 2.143
0.0SerCys: 0.0 ± 0.0
3.361SerAsp: 3.361 ± 1.269
2.521SerGlu: 2.521 ± 0.783
2.941SerPhe: 2.941 ± 1.022
4.202SerGly: 4.202 ± 2.904
1.261SerHis: 1.261 ± 0.643
2.941SerIle: 2.941 ± 1.022
2.941SerLys: 2.941 ± 1.056
5.462SerLeu: 5.462 ± 2.961
1.261SerMet: 1.261 ± 1.236
4.622SerAsn: 4.622 ± 1.15
4.202SerPro: 4.202 ± 1.021
2.521SerGln: 2.521 ± 1.099
4.202SerArg: 4.202 ± 1.237
6.723SerSer: 6.723 ± 4.489
6.723SerThr: 6.723 ± 2.188
3.782SerVal: 3.782 ± 1.32
1.261SerTrp: 1.261 ± 3.303
2.521SerTyr: 2.521 ± 1.26
0.0SerXaa: 0.0 ± 0.0
Thr
5.042ThrAla: 5.042 ± 2.198
1.261ThrCys: 1.261 ± 1.319
2.941ThrAsp: 2.941 ± 1.573
5.042ThrGlu: 5.042 ± 2.573
5.462ThrPhe: 5.462 ± 1.462
4.622ThrGly: 4.622 ± 1.101
3.782ThrHis: 3.782 ± 1.342
3.361ThrIle: 3.361 ± 2.721
5.042ThrLys: 5.042 ± 0.785
6.723ThrLeu: 6.723 ± 1.779
1.261ThrMet: 1.261 ± 0.643
2.101ThrAsn: 2.101 ± 0.924
9.244ThrPro: 9.244 ± 2.252
2.101ThrGln: 2.101 ± 0.633
5.462ThrArg: 5.462 ± 1.572
3.782ThrSer: 3.782 ± 1.402
4.202ThrThr: 4.202 ± 1.603
2.941ThrVal: 2.941 ± 1.022
0.0ThrTrp: 0.0 ± 0.0
2.521ThrTyr: 2.521 ± 1.298
0.0ThrXaa: 0.0 ± 0.0
Val
3.782ValAla: 3.782 ± 2.672
0.84ValCys: 0.84 ± 1.481
1.261ValAsp: 1.261 ± 0.643
2.101ValGlu: 2.101 ± 1.072
1.681ValPhe: 1.681 ± 1.518
3.361ValGly: 3.361 ± 1.146
1.261ValHis: 1.261 ± 0.981
2.521ValIle: 2.521 ± 1.661
5.462ValLys: 5.462 ± 1.425
5.882ValLeu: 5.882 ± 1.372
0.84ValMet: 0.84 ± 0.429
2.941ValAsn: 2.941 ± 1.009
4.622ValPro: 4.622 ± 0.741
2.941ValGln: 2.941 ± 0.957
2.521ValArg: 2.521 ± 0.98
2.941ValSer: 2.941 ± 1.835
5.462ValThr: 5.462 ± 1.439
3.361ValVal: 3.361 ± 1.218
0.42ValTrp: 0.42 ± 0.686
2.521ValTyr: 2.521 ± 1.286
0.0ValXaa: 0.0 ± 0.0
Trp
2.101TrpAla: 2.101 ± 1.412
0.0TrpCys: 0.0 ± 0.0
0.42TrpAsp: 0.42 ± 0.686
0.84TrpGlu: 0.84 ± 0.429
0.42TrpPhe: 0.42 ± 0.686
0.0TrpGly: 0.0 ± 0.0
0.42TrpHis: 0.42 ± 0.214
0.42TrpIle: 0.42 ± 0.214
0.42TrpLys: 0.42 ± 0.214
1.261TrpLeu: 1.261 ± 0.643
0.0TrpMet: 0.0 ± 0.0
1.681TrpAsn: 1.681 ± 1.518
0.84TrpPro: 0.84 ± 1.481
0.0TrpGln: 0.0 ± 0.0
0.42TrpArg: 0.42 ± 1.577
0.84TrpSer: 0.84 ± 0.429
0.0TrpThr: 0.0 ± 0.0
0.42TrpVal: 0.42 ± 0.214
0.42TrpTrp: 0.42 ± 0.214
0.42TrpTyr: 0.42 ± 1.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.303TyrAla: 6.303 ± 1.425
0.42TyrCys: 0.42 ± 0.695
2.101TyrAsp: 2.101 ± 0.642
0.84TyrGlu: 0.84 ± 0.429
1.681TyrPhe: 1.681 ± 0.529
0.84TyrGly: 0.84 ± 0.429
1.261TyrHis: 1.261 ± 1.411
1.681TyrIle: 1.681 ± 0.858
1.261TyrLys: 1.261 ± 0.643
1.681TyrLeu: 1.681 ± 0.858
0.42TyrMet: 0.42 ± 0.214
2.101TyrAsn: 2.101 ± 1.041
1.681TyrPro: 1.681 ± 1.814
0.84TyrGln: 0.84 ± 1.074
1.261TyrArg: 1.261 ± 0.549
2.521TyrSer: 2.521 ± 1.26
2.521TyrThr: 2.521 ± 0.849
1.261TyrVal: 1.261 ± 0.502
0.42TyrTrp: 0.42 ± 0.214
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2381 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski