Amino acid dipepetide frequency for Blackberry yellow vein-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.057AlaAla: 2.057 ± 1.083
0.735AlaCys: 0.735 ± 0.261
2.645AlaAsp: 2.645 ± 0.311
2.204AlaGlu: 2.204 ± 0.609
3.086AlaPhe: 3.086 ± 0.837
3.086AlaGly: 3.086 ± 0.534
0.294AlaHis: 0.294 ± 0.114
2.792AlaIle: 2.792 ± 0.287
5.143AlaLys: 5.143 ± 0.674
4.409AlaLeu: 4.409 ± 1.127
1.176AlaMet: 1.176 ± 0.258
1.616AlaAsn: 1.616 ± 0.521
1.176AlaPro: 1.176 ± 0.37
1.616AlaGln: 1.616 ± 0.464
0.735AlaArg: 0.735 ± 0.265
2.645AlaSer: 2.645 ± 0.429
1.323AlaThr: 1.323 ± 1.208
2.351AlaVal: 2.351 ± 0.303
0.0AlaTrp: 0.0 ± 0.0
1.029AlaTyr: 1.029 ± 0.47
0.0AlaXaa: 0.0 ± 0.0
Cys
0.735CysAla: 0.735 ± 0.221
0.294CysCys: 0.294 ± 0.114
0.882CysAsp: 0.882 ± 0.343
1.176CysGlu: 1.176 ± 0.364
1.176CysPhe: 1.176 ± 0.471
1.029CysGly: 1.029 ± 0.346
0.294CysHis: 0.294 ± 0.114
1.763CysIle: 1.763 ± 0.539
2.204CysLys: 2.204 ± 0.524
2.939CysLeu: 2.939 ± 0.65
0.882CysMet: 0.882 ± 0.206
1.323CysAsn: 1.323 ± 0.764
0.882CysPro: 0.882 ± 0.341
0.441CysGln: 0.441 ± 0.194
0.588CysArg: 0.588 ± 0.238
2.498CysSer: 2.498 ± 0.541
1.323CysThr: 1.323 ± 0.185
1.029CysVal: 1.029 ± 0.198
0.588CysTrp: 0.588 ± 0.227
1.616CysTyr: 1.616 ± 0.371
0.0CysXaa: 0.0 ± 0.0
Asp
1.763AspAla: 1.763 ± 0.531
0.882AspCys: 0.882 ± 0.226
3.38AspAsp: 3.38 ± 0.571
4.115AspGlu: 4.115 ± 0.682
4.996AspPhe: 4.996 ± 1.022
3.086AspGly: 3.086 ± 0.406
0.441AspHis: 0.441 ± 0.351
3.821AspIle: 3.821 ± 1.08
4.262AspLys: 4.262 ± 0.635
6.907AspLeu: 6.907 ± 0.951
1.763AspMet: 1.763 ± 0.548
3.968AspAsn: 3.968 ± 0.731
0.588AspPro: 0.588 ± 0.362
1.323AspGln: 1.323 ± 0.385
2.792AspArg: 2.792 ± 0.63
3.674AspSer: 3.674 ± 0.734
3.233AspThr: 3.233 ± 0.506
7.494AspVal: 7.494 ± 0.702
0.441AspTrp: 0.441 ± 0.351
2.645AspTyr: 2.645 ± 0.678
0.0AspXaa: 0.0 ± 0.0
Glu
1.763GluAla: 1.763 ± 0.316
1.763GluCys: 1.763 ± 0.491
3.674GluAsp: 3.674 ± 0.702
3.968GluGlu: 3.968 ± 0.598
2.204GluPhe: 2.204 ± 0.77
2.939GluGly: 2.939 ± 0.364
1.91GluHis: 1.91 ± 0.52
4.409GluIle: 4.409 ± 0.701
4.849GluLys: 4.849 ± 1.289
3.674GluLeu: 3.674 ± 0.726
1.616GluMet: 1.616 ± 0.562
3.674GluAsn: 3.674 ± 0.508
0.882GluPro: 0.882 ± 0.37
1.323GluGln: 1.323 ± 0.422
2.057GluArg: 2.057 ± 0.718
5.143GluSer: 5.143 ± 0.569
2.792GluThr: 2.792 ± 0.52
4.262GluVal: 4.262 ± 0.247
0.147GluTrp: 0.147 ± 0.158
3.086GluTyr: 3.086 ± 1.017
0.0GluXaa: 0.0 ± 0.0
Phe
1.029PheAla: 1.029 ± 0.285
2.351PheCys: 2.351 ± 0.725
4.262PheAsp: 4.262 ± 0.759
2.792PheGlu: 2.792 ± 0.951
4.996PhePhe: 4.996 ± 1.497
3.968PheGly: 3.968 ± 0.54
0.0PheHis: 0.0 ± 0.0
2.645PheIle: 2.645 ± 0.523
5.731PheLys: 5.731 ± 1.008
4.702PheLeu: 4.702 ± 1.353
1.763PheMet: 1.763 ± 0.433
4.849PheAsn: 4.849 ± 1.085
1.763PhePro: 1.763 ± 0.92
1.763PheGln: 1.763 ± 0.598
2.498PheArg: 2.498 ± 0.667
6.319PheSer: 6.319 ± 0.857
4.702PheThr: 4.702 ± 1.16
4.555PheVal: 4.555 ± 0.98
0.588PheTrp: 0.588 ± 0.642
1.323PheTyr: 1.323 ± 0.355
0.0PheXaa: 0.0 ± 0.0
Gly
2.057GlyAla: 2.057 ± 0.592
0.441GlyCys: 0.441 ± 0.484
3.674GlyAsp: 3.674 ± 0.704
4.262GlyGlu: 4.262 ± 0.909
1.323GlyPhe: 1.323 ± 0.804
3.968GlyGly: 3.968 ± 0.914
0.147GlyHis: 0.147 ± 0.24
1.91GlyIle: 1.91 ± 0.323
5.731GlyLys: 5.731 ± 1.282
4.262GlyLeu: 4.262 ± 0.5
1.616GlyMet: 1.616 ± 0.507
2.057GlyAsn: 2.057 ± 0.693
0.0GlyPro: 0.0 ± 0.0
1.176GlyGln: 1.176 ± 0.407
1.47GlyArg: 1.47 ± 0.263
2.939GlySer: 2.939 ± 0.577
2.204GlyThr: 2.204 ± 0.49
4.996GlyVal: 4.996 ± 0.714
0.294GlyTrp: 0.294 ± 0.114
1.323GlyTyr: 1.323 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
1.176HisAla: 1.176 ± 0.317
0.441HisCys: 0.441 ± 0.191
0.735HisAsp: 0.735 ± 0.22
0.0HisGlu: 0.0 ± 0.0
1.323HisPhe: 1.323 ± 0.693
0.147HisGly: 0.147 ± 0.24
0.294HisHis: 0.294 ± 0.214
0.294HisIle: 0.294 ± 0.114
1.91HisLys: 1.91 ± 0.578
1.763HisLeu: 1.763 ± 0.402
0.735HisMet: 0.735 ± 0.221
0.735HisAsn: 0.735 ± 0.236
0.147HisPro: 0.147 ± 0.207
0.441HisGln: 0.441 ± 0.344
0.735HisArg: 0.735 ± 0.398
2.204HisSer: 2.204 ± 0.526
0.441HisThr: 0.441 ± 0.154
1.029HisVal: 1.029 ± 0.314
0.0HisTrp: 0.0 ± 0.0
1.616HisTyr: 1.616 ± 0.237
0.0HisXaa: 0.0 ± 0.0
Ile
1.616IleAla: 1.616 ± 0.502
2.204IleCys: 2.204 ± 0.626
4.555IleAsp: 4.555 ± 0.941
4.996IleGlu: 4.996 ± 0.789
3.968IlePhe: 3.968 ± 0.844
1.616IleGly: 1.616 ± 0.436
1.176IleHis: 1.176 ± 0.398
5.143IleIle: 5.143 ± 0.685
4.409IleLys: 4.409 ± 0.4
5.878IleLeu: 5.878 ± 0.671
1.176IleMet: 1.176 ± 0.318
4.115IleAsn: 4.115 ± 0.575
3.086IlePro: 3.086 ± 0.434
1.763IleGln: 1.763 ± 0.207
3.086IleArg: 3.086 ± 0.504
7.054IleSer: 7.054 ± 0.785
3.233IleThr: 3.233 ± 0.544
4.702IleVal: 4.702 ± 0.613
0.0IleTrp: 0.0 ± 0.0
1.91IleTyr: 1.91 ± 0.458
0.0IleXaa: 0.0 ± 0.0
Lys
3.674LysAla: 3.674 ± 0.597
1.323LysCys: 1.323 ± 0.28
4.262LysAsp: 4.262 ± 0.668
3.821LysGlu: 3.821 ± 0.858
7.788LysPhe: 7.788 ± 1.451
2.645LysGly: 2.645 ± 0.503
1.763LysHis: 1.763 ± 0.791
5.731LysIle: 5.731 ± 0.504
5.584LysLys: 5.584 ± 0.986
5.584LysLeu: 5.584 ± 1.206
1.763LysMet: 1.763 ± 0.696
4.702LysAsn: 4.702 ± 0.538
2.204LysPro: 2.204 ± 0.342
1.47LysGln: 1.47 ± 0.72
4.409LysArg: 4.409 ± 0.518
7.201LysSer: 7.201 ± 1.332
5.731LysThr: 5.731 ± 0.717
5.878LysVal: 5.878 ± 0.853
1.029LysTrp: 1.029 ± 0.403
3.674LysTyr: 3.674 ± 0.541
0.0LysXaa: 0.0 ± 0.0
Leu
3.086LeuAla: 3.086 ± 0.557
2.204LeuCys: 2.204 ± 0.62
4.996LeuAsp: 4.996 ± 0.66
4.702LeuGlu: 4.702 ± 0.98
3.233LeuPhe: 3.233 ± 0.94
4.409LeuGly: 4.409 ± 0.699
1.029LeuHis: 1.029 ± 0.314
5.584LeuIle: 5.584 ± 0.992
8.082LeuLys: 8.082 ± 1.26
9.111LeuLeu: 9.111 ± 1.7
3.233LeuMet: 3.233 ± 0.673
6.76LeuAsn: 6.76 ± 0.875
3.527LeuPro: 3.527 ± 0.879
1.763LeuGln: 1.763 ± 0.397
5.731LeuArg: 5.731 ± 0.876
10.58LeuSer: 10.58 ± 1.091
4.702LeuThr: 4.702 ± 0.5
4.996LeuVal: 4.996 ± 1.76
0.147LeuTrp: 0.147 ± 0.207
2.645LeuTyr: 2.645 ± 0.816
0.0LeuXaa: 0.0 ± 0.0
Met
1.91MetAla: 1.91 ± 0.385
0.441MetCys: 0.441 ± 0.173
1.323MetAsp: 1.323 ± 0.475
0.588MetGlu: 0.588 ± 0.238
1.47MetPhe: 1.47 ± 0.374
0.441MetGly: 0.441 ± 0.366
0.294MetHis: 0.294 ± 0.114
2.204MetIle: 2.204 ± 0.513
1.323MetLys: 1.323 ± 0.419
2.057MetLeu: 2.057 ± 0.705
0.441MetMet: 0.441 ± 0.191
1.616MetAsn: 1.616 ± 0.407
0.735MetPro: 0.735 ± 0.221
0.294MetGln: 0.294 ± 0.288
1.763MetArg: 1.763 ± 0.541
3.233MetSer: 3.233 ± 0.602
2.498MetThr: 2.498 ± 0.552
1.91MetVal: 1.91 ± 0.333
0.147MetTrp: 0.147 ± 0.207
1.323MetTyr: 1.323 ± 0.185
0.0MetXaa: 0.0 ± 0.0
Asn
1.763AsnAla: 1.763 ± 0.301
0.882AsnCys: 0.882 ± 0.341
3.527AsnAsp: 3.527 ± 0.985
2.939AsnGlu: 2.939 ± 0.724
4.849AsnPhe: 4.849 ± 0.279
2.498AsnGly: 2.498 ± 0.447
1.176AsnHis: 1.176 ± 0.276
4.262AsnIle: 4.262 ± 0.247
5.29AsnLys: 5.29 ± 0.452
5.731AsnLeu: 5.731 ± 0.834
0.882AsnMet: 0.882 ± 0.305
1.763AsnAsn: 1.763 ± 0.721
1.763AsnPro: 1.763 ± 0.271
2.939AsnGln: 2.939 ± 0.659
2.498AsnArg: 2.498 ± 0.895
6.613AsnSer: 6.613 ± 0.703
3.233AsnThr: 3.233 ± 0.429
4.849AsnVal: 4.849 ± 0.504
0.588AsnTrp: 0.588 ± 0.227
1.47AsnTyr: 1.47 ± 0.777
0.0AsnXaa: 0.0 ± 0.0
Pro
0.588ProAla: 0.588 ± 0.177
0.882ProCys: 0.882 ± 0.251
2.351ProAsp: 2.351 ± 0.405
2.057ProGlu: 2.057 ± 0.209
2.057ProPhe: 2.057 ± 0.851
1.323ProGly: 1.323 ± 0.387
0.0ProHis: 0.0 ± 0.0
0.882ProIle: 0.882 ± 0.275
0.735ProLys: 0.735 ± 0.25
2.351ProLeu: 2.351 ± 0.664
0.588ProMet: 0.588 ± 0.177
2.204ProAsn: 2.204 ± 0.61
1.47ProPro: 1.47 ± 0.523
0.882ProGln: 0.882 ± 0.226
1.176ProArg: 1.176 ± 0.233
2.351ProSer: 2.351 ± 0.38
1.176ProThr: 1.176 ± 0.784
4.262ProVal: 4.262 ± 1.268
0.0ProTrp: 0.0 ± 0.0
2.204ProTyr: 2.204 ± 0.448
0.0ProXaa: 0.0 ± 0.0
Gln
1.176GlnAla: 1.176 ± 0.276
0.588GlnCys: 0.588 ± 0.227
1.763GlnAsp: 1.763 ± 0.75
1.176GlnGlu: 1.176 ± 0.565
1.616GlnPhe: 1.616 ± 0.454
1.323GlnGly: 1.323 ± 0.514
0.735GlnHis: 0.735 ± 0.194
2.204GlnIle: 2.204 ± 0.788
2.498GlnLys: 2.498 ± 0.608
2.351GlnLeu: 2.351 ± 0.83
0.147GlnMet: 0.147 ± 0.158
1.176GlnAsn: 1.176 ± 0.413
0.441GlnPro: 0.441 ± 0.191
0.588GlnGln: 0.588 ± 0.427
1.616GlnArg: 1.616 ± 0.626
1.91GlnSer: 1.91 ± 0.953
1.323GlnThr: 1.323 ± 0.259
1.323GlnVal: 1.323 ± 0.565
0.441GlnTrp: 0.441 ± 0.173
1.029GlnTyr: 1.029 ± 0.49
0.0GlnXaa: 0.0 ± 0.0
Arg
2.792ArgAla: 2.792 ± 0.532
1.029ArgCys: 1.029 ± 0.314
3.38ArgAsp: 3.38 ± 0.907
1.91ArgGlu: 1.91 ± 0.64
2.057ArgPhe: 2.057 ± 0.532
2.498ArgGly: 2.498 ± 1.254
1.323ArgHis: 1.323 ± 0.369
3.086ArgIle: 3.086 ± 0.67
3.233ArgLys: 3.233 ± 0.278
4.262ArgLeu: 4.262 ± 0.769
1.616ArgMet: 1.616 ± 0.347
2.792ArgAsn: 2.792 ± 0.461
1.176ArgPro: 1.176 ± 0.452
1.029ArgGln: 1.029 ± 0.289
2.204ArgArg: 2.204 ± 0.445
4.996ArgSer: 4.996 ± 0.763
2.057ArgThr: 2.057 ± 0.31
3.821ArgVal: 3.821 ± 0.654
0.294ArgTrp: 0.294 ± 0.431
2.204ArgTyr: 2.204 ± 0.62
0.0ArgXaa: 0.0 ± 0.0
Ser
5.143SerAla: 5.143 ± 0.833
2.057SerCys: 2.057 ± 0.349
5.878SerAsp: 5.878 ± 1.101
4.409SerGlu: 4.409 ± 0.657
7.348SerPhe: 7.348 ± 0.772
3.233SerGly: 3.233 ± 0.637
2.351SerHis: 2.351 ± 0.58
6.025SerIle: 6.025 ± 1.975
7.201SerLys: 7.201 ± 0.958
9.846SerLeu: 9.846 ± 1.234
3.086SerMet: 3.086 ± 0.706
6.172SerAsn: 6.172 ± 0.971
2.792SerPro: 2.792 ± 0.37
2.351SerGln: 2.351 ± 0.656
4.409SerArg: 4.409 ± 0.553
7.054SerSer: 7.054 ± 1.296
4.262SerThr: 4.262 ± 0.663
5.878SerVal: 5.878 ± 1.795
0.294SerTrp: 0.294 ± 0.114
3.086SerTyr: 3.086 ± 0.674
0.0SerXaa: 0.0 ± 0.0
Thr
2.792ThrAla: 2.792 ± 0.796
1.029ThrCys: 1.029 ± 0.338
3.086ThrAsp: 3.086 ± 0.423
3.086ThrGlu: 3.086 ± 1.099
3.674ThrPhe: 3.674 ± 0.932
2.645ThrGly: 2.645 ± 1.06
1.176ThrHis: 1.176 ± 0.454
4.115ThrIle: 4.115 ± 0.748
0.882ThrLys: 0.882 ± 0.862
4.996ThrLeu: 4.996 ± 0.895
0.588ThrMet: 0.588 ± 0.233
3.527ThrAsn: 3.527 ± 0.818
1.616ThrPro: 1.616 ± 0.356
2.057ThrGln: 2.057 ± 0.344
1.91ThrArg: 1.91 ± 0.31
5.878ThrSer: 5.878 ± 0.99
1.91ThrThr: 1.91 ± 0.287
3.968ThrVal: 3.968 ± 0.393
0.735ThrTrp: 0.735 ± 0.28
2.645ThrTyr: 2.645 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
2.351ValAla: 2.351 ± 0.55
1.91ValCys: 1.91 ± 0.303
4.849ValAsp: 4.849 ± 0.698
5.584ValGlu: 5.584 ± 0.93
3.968ValPhe: 3.968 ± 0.971
3.821ValGly: 3.821 ± 0.762
1.176ValHis: 1.176 ± 0.454
4.996ValIle: 4.996 ± 0.678
7.201ValLys: 7.201 ± 1.193
4.702ValLeu: 4.702 ± 0.594
1.763ValMet: 1.763 ± 0.323
4.849ValAsn: 4.849 ± 0.937
2.939ValPro: 2.939 ± 0.573
1.616ValGln: 1.616 ± 0.496
4.996ValArg: 4.996 ± 0.737
6.319ValSer: 6.319 ± 1.503
3.233ValThr: 3.233 ± 0.689
6.319ValVal: 6.319 ± 0.909
0.588ValTrp: 0.588 ± 0.227
2.939ValTyr: 2.939 ± 0.763
0.0ValXaa: 0.0 ± 0.0
Trp
0.588TrpAla: 0.588 ± 0.271
0.294TrpCys: 0.294 ± 0.114
0.441TrpAsp: 0.441 ± 0.201
0.294TrpGlu: 0.294 ± 0.114
0.147TrpPhe: 0.147 ± 0.21
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.029TrpIle: 1.029 ± 0.285
0.735TrpLys: 0.735 ± 0.413
1.029TrpLeu: 1.029 ± 0.304
0.147TrpMet: 0.147 ± 0.34
0.0TrpAsn: 0.0 ± 0.0
0.147TrpPro: 0.147 ± 0.158
0.0TrpGln: 0.0 ± 0.0
0.294TrpArg: 0.294 ± 0.114
0.588TrpSer: 0.588 ± 0.407
0.294TrpThr: 0.294 ± 0.114
0.147TrpVal: 0.147 ± 0.207
0.0TrpTrp: 0.0 ± 0.0
0.294TrpTyr: 0.294 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.057TyrAla: 2.057 ± 0.335
1.91TyrCys: 1.91 ± 0.511
1.91TyrAsp: 1.91 ± 0.284
2.057TyrGlu: 2.057 ± 0.59
1.323TyrPhe: 1.323 ± 0.389
1.176TyrGly: 1.176 ± 0.37
0.588TyrHis: 0.588 ± 0.203
2.939TyrIle: 2.939 ± 0.792
3.233TyrLys: 3.233 ± 0.458
3.968TyrLeu: 3.968 ± 0.966
0.882TyrMet: 0.882 ± 0.43
1.616TyrAsn: 1.616 ± 0.416
2.204TyrPro: 2.204 ± 0.477
0.588TyrGln: 0.588 ± 0.463
2.792TyrArg: 2.792 ± 0.557
3.674TyrSer: 3.674 ± 0.542
2.645TyrThr: 2.645 ± 0.303
2.351TyrVal: 2.351 ± 0.835
0.147TyrTrp: 0.147 ± 0.209
3.38TyrTyr: 3.38 ± 0.744
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (6806 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski