Amino acid dipepetide frequency for Luffa yellow mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.875AlaAla: 2.875 ± 2.338
1.15AlaCys: 1.15 ± 0.704
2.3AlaAsp: 2.3 ± 1.193
2.3AlaGlu: 2.3 ± 0.748
0.575AlaPhe: 0.575 ± 0.57
3.45AlaGly: 3.45 ± 1.112
0.575AlaHis: 0.575 ± 0.57
2.875AlaIle: 2.875 ± 1.025
2.875AlaLys: 2.875 ± 0.838
5.175AlaLeu: 5.175 ± 1.786
0.575AlaMet: 0.575 ± 0.57
0.575AlaAsn: 0.575 ± 0.468
2.3AlaPro: 2.3 ± 1.011
2.875AlaGln: 2.875 ± 0.835
2.875AlaArg: 2.875 ± 1.381
3.45AlaSer: 3.45 ± 2.183
2.3AlaThr: 2.3 ± 1.879
2.875AlaVal: 2.875 ± 0.894
1.15AlaTrp: 1.15 ± 0.614
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.575CysAla: 0.575 ± 0.61
0.0CysCys: 0.0 ± 0.0
0.575CysAsp: 0.575 ± 0.57
1.15CysGlu: 1.15 ± 0.735
0.0CysPhe: 0.0 ± 0.0
2.3CysGly: 2.3 ± 1.4
0.575CysHis: 0.575 ± 0.712
1.725CysIle: 1.725 ± 0.838
1.15CysLys: 1.15 ± 0.728
0.0CysLeu: 0.0 ± 0.0
1.725CysMet: 1.725 ± 1.168
2.875CysAsn: 2.875 ± 1.025
3.45CysPro: 3.45 ± 1.83
0.575CysGln: 0.575 ± 0.468
1.15CysArg: 1.15 ± 0.75
3.45CysSer: 3.45 ± 1.469
1.725CysThr: 1.725 ± 0.796
0.575CysVal: 0.575 ± 0.571
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.45AspAla: 3.45 ± 1.798
0.0AspCys: 0.0 ± 0.0
1.15AspAsp: 1.15 ± 0.645
3.45AspGlu: 3.45 ± 0.9
1.725AspPhe: 1.725 ± 0.911
2.875AspGly: 2.875 ± 1.395
1.15AspHis: 1.15 ± 0.853
2.875AspIle: 2.875 ± 1.166
2.3AspLys: 2.3 ± 0.629
3.45AspLeu: 3.45 ± 1.407
1.15AspMet: 1.15 ± 0.684
2.3AspAsn: 2.3 ± 0.897
1.725AspPro: 1.725 ± 0.76
1.725AspGln: 1.725 ± 1.115
3.45AspArg: 3.45 ± 1.064
3.45AspSer: 3.45 ± 1.364
1.15AspThr: 1.15 ± 0.943
6.325AspVal: 6.325 ± 1.948
1.725AspTrp: 1.725 ± 1.078
1.15AspTyr: 1.15 ± 0.935
0.0AspXaa: 0.0 ± 0.0
Glu
2.875GluAla: 2.875 ± 1.408
0.0GluCys: 0.0 ± 0.0
0.575GluAsp: 0.575 ± 0.561
5.175GluGlu: 5.175 ± 3.156
2.875GluPhe: 2.875 ± 1.558
2.3GluGly: 2.3 ± 0.987
2.3GluHis: 2.3 ± 1.167
4.025GluIle: 4.025 ± 2.617
2.875GluLys: 2.875 ± 1.88
2.3GluLeu: 2.3 ± 0.955
0.0GluMet: 0.0 ± 0.0
2.875GluAsn: 2.875 ± 1.591
1.15GluPro: 1.15 ± 0.614
2.3GluGln: 2.3 ± 1.254
1.725GluArg: 1.725 ± 1.217
4.6GluSer: 4.6 ± 1.628
1.15GluThr: 1.15 ± 0.909
4.025GluVal: 4.025 ± 1.137
0.575GluTrp: 0.575 ± 0.712
3.45GluTyr: 3.45 ± 1.478
0.0GluXaa: 0.0 ± 0.0
Phe
1.725PheAla: 1.725 ± 0.795
1.15PheCys: 1.15 ± 0.728
2.3PheAsp: 2.3 ± 1.228
1.725PheGlu: 1.725 ± 0.786
0.575PhePhe: 0.575 ± 0.571
2.3PheGly: 2.3 ± 1.649
2.3PheHis: 2.3 ± 1.176
2.875PheIle: 2.875 ± 1.302
3.45PheLys: 3.45 ± 1.435
3.45PheLeu: 3.45 ± 1.337
1.15PheMet: 1.15 ± 0.71
1.725PheAsn: 1.725 ± 1.071
2.3PhePro: 2.3 ± 0.934
3.45PheGln: 3.45 ± 1.88
1.15PheArg: 1.15 ± 0.61
3.45PheSer: 3.45 ± 1.847
2.3PheThr: 2.3 ± 1.195
2.875PheVal: 2.875 ± 1.068
0.575PheTrp: 0.575 ± 0.542
0.575PheTyr: 0.575 ± 0.571
0.0PheXaa: 0.0 ± 0.0
Gly
1.15GlyAla: 1.15 ± 0.935
1.15GlyCys: 1.15 ± 0.837
2.3GlyAsp: 2.3 ± 1.068
1.15GlyGlu: 1.15 ± 0.773
0.575GlyPhe: 0.575 ± 0.712
2.875GlyGly: 2.875 ± 0.783
2.3GlyHis: 2.3 ± 1.074
2.3GlyIle: 2.3 ± 1.175
5.75GlyLys: 5.75 ± 2.711
8.051GlyLeu: 8.051 ± 2.467
0.575GlyMet: 0.575 ± 0.506
2.3GlyAsn: 2.3 ± 0.936
3.45GlyPro: 3.45 ± 1.167
1.725GlyGln: 1.725 ± 0.942
1.725GlyArg: 1.725 ± 0.978
5.75GlySer: 5.75 ± 1.618
4.6GlyThr: 4.6 ± 2.139
4.025GlyVal: 4.025 ± 1.863
0.0GlyTrp: 0.0 ± 0.0
2.3GlyTyr: 2.3 ± 1.212
0.0GlyXaa: 0.0 ± 0.0
His
1.725HisAla: 1.725 ± 1.089
1.725HisCys: 1.725 ± 0.97
3.45HisAsp: 3.45 ± 1.722
0.575HisGlu: 0.575 ± 0.641
2.3HisPhe: 2.3 ± 1.0
2.3HisGly: 2.3 ± 1.371
1.15HisHis: 1.15 ± 0.808
2.3HisIle: 2.3 ± 1.179
1.15HisLys: 1.15 ± 0.927
2.3HisLeu: 2.3 ± 1.163
0.575HisMet: 0.575 ± 0.468
3.45HisAsn: 3.45 ± 1.398
2.3HisPro: 2.3 ± 1.43
1.725HisGln: 1.725 ± 1.048
3.45HisArg: 3.45 ± 1.386
2.875HisSer: 2.875 ± 1.202
1.725HisThr: 1.725 ± 1.186
3.45HisVal: 3.45 ± 1.81
0.0HisTrp: 0.0 ± 0.0
1.15HisTyr: 1.15 ± 0.61
0.0HisXaa: 0.0 ± 0.0
Ile
1.15IleAla: 1.15 ± 0.735
1.15IleCys: 1.15 ± 0.677
5.175IleAsp: 5.175 ± 2.117
4.025IleGlu: 4.025 ± 1.614
5.175IlePhe: 5.175 ± 1.798
2.875IleGly: 2.875 ± 1.115
1.15IleHis: 1.15 ± 0.61
2.875IleIle: 2.875 ± 1.065
4.025IleLys: 4.025 ± 0.912
5.75IleLeu: 5.75 ± 2.354
0.0IleMet: 0.0 ± 0.0
1.725IleAsn: 1.725 ± 0.966
4.025IlePro: 4.025 ± 0.96
2.875IleGln: 2.875 ± 1.221
4.025IleArg: 4.025 ± 1.192
4.6IleSer: 4.6 ± 1.271
4.6IleThr: 4.6 ± 1.793
2.875IleVal: 2.875 ± 1.328
2.3IleTrp: 2.3 ± 1.371
2.875IleTyr: 2.875 ± 1.511
0.0IleXaa: 0.0 ± 0.0
Lys
2.3LysAla: 2.3 ± 0.718
2.3LysCys: 2.3 ± 0.989
2.3LysAsp: 2.3 ± 0.926
3.45LysGlu: 3.45 ± 1.723
2.3LysPhe: 2.3 ± 1.11
4.025LysGly: 4.025 ± 1.599
2.3LysHis: 2.3 ± 0.989
4.6LysIle: 4.6 ± 1.593
2.3LysLys: 2.3 ± 1.497
4.025LysLeu: 4.025 ± 1.43
0.575LysMet: 0.575 ± 0.542
5.175LysAsn: 5.175 ± 1.284
1.725LysPro: 1.725 ± 0.656
2.3LysGln: 2.3 ± 0.629
3.45LysArg: 3.45 ± 1.87
4.6LysSer: 4.6 ± 0.997
4.025LysThr: 4.025 ± 1.165
2.875LysVal: 2.875 ± 1.676
0.575LysTrp: 0.575 ± 0.571
2.875LysTyr: 2.875 ± 1.123
0.0LysXaa: 0.0 ± 0.0
Leu
1.15LeuAla: 1.15 ± 0.771
1.725LeuCys: 1.725 ± 1.065
5.175LeuAsp: 5.175 ± 1.651
4.025LeuGlu: 4.025 ± 1.756
2.875LeuPhe: 2.875 ± 1.113
3.45LeuGly: 3.45 ± 1.184
4.6LeuHis: 4.6 ± 1.276
3.45LeuIle: 3.45 ± 0.944
4.025LeuLys: 4.025 ± 1.089
1.15LeuLeu: 1.15 ± 0.732
2.3LeuMet: 2.3 ± 0.902
4.6LeuAsn: 4.6 ± 1.414
3.45LeuPro: 3.45 ± 1.815
0.575LeuGln: 0.575 ± 0.641
7.476LeuArg: 7.476 ± 2.314
5.75LeuSer: 5.75 ± 1.689
4.6LeuThr: 4.6 ± 0.757
2.875LeuVal: 2.875 ± 1.758
0.575LeuTrp: 0.575 ± 0.57
4.025LeuTyr: 4.025 ± 1.614
0.0LeuXaa: 0.0 ± 0.0
Met
0.575MetAla: 0.575 ± 0.571
1.15MetCys: 1.15 ± 0.735
0.575MetAsp: 0.575 ± 0.571
0.0MetGlu: 0.0 ± 0.0
0.575MetPhe: 0.575 ± 0.571
1.725MetGly: 1.725 ± 1.056
1.15MetHis: 1.15 ± 0.735
1.725MetIle: 1.725 ± 0.907
2.3MetLys: 2.3 ± 1.089
2.3MetLeu: 2.3 ± 1.009
0.0MetMet: 0.0 ± 0.0
0.575MetAsn: 0.575 ± 0.571
2.3MetPro: 2.3 ± 1.176
0.575MetGln: 0.575 ± 0.712
1.15MetArg: 1.15 ± 0.75
2.875MetSer: 2.875 ± 1.258
0.575MetThr: 0.575 ± 0.61
0.575MetVal: 0.575 ± 0.542
1.15MetTrp: 1.15 ± 0.732
1.15MetTyr: 1.15 ± 1.142
0.0MetXaa: 0.0 ± 0.0
Asn
5.175AsnAla: 5.175 ± 1.215
2.3AsnCys: 2.3 ± 0.936
2.3AsnAsp: 2.3 ± 1.563
1.725AsnGlu: 1.725 ± 0.664
2.3AsnPhe: 2.3 ± 0.836
1.15AsnGly: 1.15 ± 0.701
2.3AsnHis: 2.3 ± 1.286
5.175AsnIle: 5.175 ± 1.474
0.575AsnLys: 0.575 ± 0.468
4.6AsnLeu: 4.6 ± 1.38
2.875AsnMet: 2.875 ± 1.844
3.45AsnAsn: 3.45 ± 1.032
2.875AsnPro: 2.875 ± 0.618
2.3AsnGln: 2.3 ± 0.732
5.75AsnArg: 5.75 ± 1.979
4.6AsnSer: 4.6 ± 2.252
3.45AsnThr: 3.45 ± 1.637
4.025AsnVal: 4.025 ± 1.378
0.0AsnTrp: 0.0 ± 0.0
2.3AsnTyr: 2.3 ± 0.736
0.0AsnXaa: 0.0 ± 0.0
Pro
1.725ProAla: 1.725 ± 1.152
1.15ProCys: 1.15 ± 0.894
1.15ProAsp: 1.15 ± 0.894
1.725ProGlu: 1.725 ± 0.951
2.875ProPhe: 2.875 ± 1.244
1.725ProGly: 1.725 ± 1.157
4.025ProHis: 4.025 ± 1.196
3.45ProIle: 3.45 ± 1.135
3.45ProLys: 3.45 ± 1.862
4.025ProLeu: 4.025 ± 1.239
0.575ProMet: 0.575 ± 0.571
4.6ProAsn: 4.6 ± 1.56
1.725ProPro: 1.725 ± 0.74
1.725ProGln: 1.725 ± 1.445
5.75ProArg: 5.75 ± 1.041
2.875ProSer: 2.875 ± 1.099
2.875ProThr: 2.875 ± 0.98
4.025ProVal: 4.025 ± 1.458
1.15ProTrp: 1.15 ± 0.61
2.3ProTyr: 2.3 ± 0.774
0.0ProXaa: 0.0 ± 0.0
Gln
1.725GlnAla: 1.725 ± 0.944
0.575GlnCys: 0.575 ± 0.542
1.15GlnAsp: 1.15 ± 0.614
1.725GlnGlu: 1.725 ± 0.87
2.875GlnPhe: 2.875 ± 2.338
1.15GlnGly: 1.15 ± 0.645
1.15GlnHis: 1.15 ± 1.139
2.3GlnIle: 2.3 ± 1.258
2.3GlnLys: 2.3 ± 1.026
1.15GlnLeu: 1.15 ± 0.61
0.0GlnMet: 0.0 ± 0.0
1.15GlnAsn: 1.15 ± 0.943
2.875GlnPro: 2.875 ± 2.224
1.725GlnGln: 1.725 ± 0.656
0.575GlnArg: 0.575 ± 0.468
4.025GlnSer: 4.025 ± 1.448
4.025GlnThr: 4.025 ± 1.284
5.175GlnVal: 5.175 ± 0.781
0.575GlnTrp: 0.575 ± 0.468
1.15GlnTyr: 1.15 ± 0.645
0.0GlnXaa: 0.0 ± 0.0
Arg
2.3ArgAla: 2.3 ± 1.193
2.875ArgCys: 2.875 ± 1.325
4.6ArgAsp: 4.6 ± 1.014
2.3ArgGlu: 2.3 ± 0.836
4.025ArgPhe: 4.025 ± 2.331
3.45ArgGly: 3.45 ± 1.009
2.875ArgHis: 2.875 ± 0.885
2.875ArgIle: 2.875 ± 0.933
4.025ArgLys: 4.025 ± 1.817
2.3ArgLeu: 2.3 ± 1.079
1.15ArgMet: 1.15 ± 1.142
4.6ArgAsn: 4.6 ± 1.623
6.325ArgPro: 6.325 ± 1.379
1.15ArgGln: 1.15 ± 0.943
2.875ArgArg: 2.875 ± 1.729
7.476ArgSer: 7.476 ± 1.571
6.325ArgThr: 6.325 ± 2.453
6.325ArgVal: 6.325 ± 1.97
0.575ArgTrp: 0.575 ± 0.561
2.3ArgTyr: 2.3 ± 1.11
0.0ArgXaa: 0.0 ± 0.0
Ser
4.6SerAla: 4.6 ± 1.456
2.875SerCys: 2.875 ± 1.271
3.45SerAsp: 3.45 ± 1.237
3.45SerGlu: 3.45 ± 1.117
1.725SerPhe: 1.725 ± 0.656
3.45SerGly: 3.45 ± 0.874
0.575SerHis: 0.575 ± 0.571
6.325SerIle: 6.325 ± 1.72
5.75SerLys: 5.75 ± 1.87
4.6SerLeu: 4.6 ± 1.63
1.725SerMet: 1.725 ± 1.112
6.325SerAsn: 6.325 ± 1.395
4.025SerPro: 4.025 ± 1.285
4.025SerGln: 4.025 ± 1.464
8.051SerArg: 8.051 ± 2.388
8.051SerSer: 8.051 ± 2.494
5.175SerThr: 5.175 ± 1.05
6.901SerVal: 6.901 ± 1.89
1.15SerTrp: 1.15 ± 0.704
3.45SerTyr: 3.45 ± 1.398
0.0SerXaa: 0.0 ± 0.0
Thr
2.875ThrAla: 2.875 ± 0.821
0.0ThrCys: 0.0 ± 0.0
1.725ThrAsp: 1.725 ± 0.917
2.875ThrGlu: 2.875 ± 0.618
2.3ThrPhe: 2.3 ± 0.748
6.325ThrGly: 6.325 ± 2.304
2.875ThrHis: 2.875 ± 1.103
1.725ThrIle: 1.725 ± 0.74
3.45ThrLys: 3.45 ± 1.241
2.875ThrLeu: 2.875 ± 1.487
1.725ThrMet: 1.725 ± 0.857
4.025ThrAsn: 4.025 ± 1.594
2.875ThrPro: 2.875 ± 1.494
2.3ThrGln: 2.3 ± 1.04
4.025ThrArg: 4.025 ± 1.576
5.75ThrSer: 5.75 ± 1.851
1.15ThrThr: 1.15 ± 0.808
4.025ThrVal: 4.025 ± 1.485
1.15ThrTrp: 1.15 ± 0.677
3.45ThrTyr: 3.45 ± 1.034
0.0ThrXaa: 0.0 ± 0.0
Val
1.15ValAla: 1.15 ± 0.909
2.3ValCys: 2.3 ± 0.826
2.875ValAsp: 2.875 ± 1.586
5.175ValGlu: 5.175 ± 2.837
3.45ValPhe: 3.45 ± 1.357
4.025ValGly: 4.025 ± 2.672
4.025ValHis: 4.025 ± 2.26
4.025ValIle: 4.025 ± 0.952
4.6ValLys: 4.6 ± 0.938
5.75ValLeu: 5.75 ± 2.485
3.45ValMet: 3.45 ± 1.154
4.6ValAsn: 4.6 ± 1.58
3.45ValPro: 3.45 ± 1.241
2.3ValGln: 2.3 ± 0.986
6.325ValArg: 6.325 ± 2.166
2.875ValSer: 2.875 ± 1.244
4.025ValThr: 4.025 ± 2.015
4.6ValVal: 4.6 ± 1.762
1.725ValTrp: 1.725 ± 0.683
2.875ValTyr: 2.875 ± 1.622
0.0ValXaa: 0.0 ± 0.0
Trp
2.3TrpAla: 2.3 ± 1.011
0.0TrpCys: 0.0 ± 0.0
1.15TrpAsp: 1.15 ± 0.943
0.575TrpGlu: 0.575 ± 0.61
0.0TrpPhe: 0.0 ± 0.0
1.15TrpGly: 1.15 ± 0.732
0.575TrpHis: 0.575 ± 0.571
0.575TrpIle: 0.575 ± 0.641
0.575TrpLys: 0.575 ± 0.561
0.575TrpLeu: 0.575 ± 0.561
0.575TrpMet: 0.575 ± 0.571
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.575TrpGln: 0.575 ± 0.468
2.875TrpArg: 2.875 ± 0.835
0.575TrpSer: 0.575 ± 0.542
1.15TrpThr: 1.15 ± 0.773
1.15TrpVal: 1.15 ± 0.645
0.0TrpTrp: 0.0 ± 0.0
0.575TrpTyr: 0.575 ± 0.468
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.725TyrAla: 1.725 ± 1.19
0.0TyrCys: 0.0 ± 0.0
2.3TyrAsp: 2.3 ± 1.193
0.575TyrGlu: 0.575 ± 0.571
2.3TyrPhe: 2.3 ± 0.897
1.15TyrGly: 1.15 ± 0.645
1.725TyrHis: 1.725 ± 0.948
5.175TyrIle: 5.175 ± 1.722
1.15TyrLys: 1.15 ± 0.61
4.025TyrLeu: 4.025 ± 1.577
1.725TyrMet: 1.725 ± 0.95
2.3TyrAsn: 2.3 ± 0.982
0.575TyrPro: 0.575 ± 0.561
0.575TyrGln: 0.575 ± 0.571
2.875TyrArg: 2.875 ± 1.385
4.6TyrSer: 4.6 ± 2.44
1.15TyrThr: 1.15 ± 0.704
4.025TyrVal: 4.025 ± 2.062
0.0TyrTrp: 0.0 ± 0.0
2.3TyrTyr: 2.3 ± 0.968
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1740 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski