Amino acid dipepetide frequency for Tomato yellow mottle-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.804AlaAla: 4.804 ± 2.643
1.011AlaCys: 1.011 ± 0.367
3.034AlaAsp: 3.034 ± 0.84
3.034AlaGlu: 3.034 ± 1.098
1.011AlaPhe: 1.011 ± 0.399
1.517AlaGly: 1.517 ± 0.433
0.253AlaHis: 0.253 ± 0.143
4.804AlaIle: 4.804 ± 1.288
3.287AlaLys: 3.287 ± 1.498
4.804AlaLeu: 4.804 ± 1.837
1.517AlaMet: 1.517 ± 0.483
1.517AlaAsn: 1.517 ± 0.739
1.77AlaPro: 1.77 ± 0.955
1.517AlaGln: 1.517 ± 0.885
0.759AlaArg: 0.759 ± 0.324
5.057AlaSer: 5.057 ± 0.742
5.563AlaThr: 5.563 ± 0.873
3.793AlaVal: 3.793 ± 1.055
0.759AlaTrp: 0.759 ± 0.324
2.276AlaTyr: 2.276 ± 0.716
0.0AlaXaa: 0.0 ± 0.0
Cys
1.011CysAla: 1.011 ± 0.304
0.506CysCys: 0.506 ± 0.289
2.023CysAsp: 2.023 ± 0.592
1.264CysGlu: 1.264 ± 0.347
0.506CysPhe: 0.506 ± 0.287
1.517CysGly: 1.517 ± 0.454
0.759CysHis: 0.759 ± 0.297
2.276CysIle: 2.276 ± 1.057
0.759CysLys: 0.759 ± 0.43
1.517CysLeu: 1.517 ± 0.51
0.506CysMet: 0.506 ± 0.692
0.759CysAsn: 0.759 ± 0.462
1.011CysPro: 1.011 ± 0.963
0.253CysGln: 0.253 ± 0.346
1.011CysArg: 1.011 ± 0.648
1.264CysSer: 1.264 ± 0.569
0.506CysThr: 0.506 ± 0.289
1.517CysVal: 1.517 ± 0.601
0.253CysTrp: 0.253 ± 0.143
0.506CysTyr: 0.506 ± 0.289
0.0CysXaa: 0.0 ± 0.0
Asp
2.023AspAla: 2.023 ± 0.93
1.264AspCys: 1.264 ± 0.717
3.793AspAsp: 3.793 ± 0.838
3.54AspGlu: 3.54 ± 0.642
1.517AspPhe: 1.517 ± 0.663
4.298AspGly: 4.298 ± 1.178
1.264AspHis: 1.264 ± 0.423
3.793AspIle: 3.793 ± 1.601
4.046AspLys: 4.046 ± 0.4
6.068AspLeu: 6.068 ± 0.926
2.023AspMet: 2.023 ± 1.24
4.046AspAsn: 4.046 ± 0.721
2.528AspPro: 2.528 ± 0.851
1.264AspGln: 1.264 ± 0.341
4.046AspArg: 4.046 ± 0.387
5.563AspSer: 5.563 ± 0.883
3.793AspThr: 3.793 ± 0.481
3.793AspVal: 3.793 ± 0.81
1.011AspTrp: 1.011 ± 0.436
1.77AspTyr: 1.77 ± 0.456
0.0AspXaa: 0.0 ± 0.0
Glu
2.781GluAla: 2.781 ± 0.475
1.517GluCys: 1.517 ± 0.606
3.034GluAsp: 3.034 ± 0.624
5.057GluGlu: 5.057 ± 0.38
1.264GluPhe: 1.264 ± 0.42
3.54GluGly: 3.54 ± 0.777
1.011GluHis: 1.011 ± 0.677
4.551GluIle: 4.551 ± 1.773
4.551GluLys: 4.551 ± 0.925
6.321GluLeu: 6.321 ± 1.484
1.77GluMet: 1.77 ± 0.605
3.54GluAsn: 3.54 ± 1.554
2.023GluPro: 2.023 ± 0.551
1.77GluGln: 1.77 ± 0.685
5.057GluArg: 5.057 ± 1.048
4.551GluSer: 4.551 ± 0.715
3.287GluThr: 3.287 ± 1.524
3.034GluVal: 3.034 ± 0.595
1.011GluTrp: 1.011 ± 0.356
1.517GluTyr: 1.517 ± 0.358
0.0GluXaa: 0.0 ± 0.0
Phe
1.264PheAla: 1.264 ± 0.504
0.759PheCys: 0.759 ± 0.338
1.011PheAsp: 1.011 ± 0.391
2.276PheGlu: 2.276 ± 0.933
0.506PhePhe: 0.506 ± 0.287
0.759PheGly: 0.759 ± 0.508
0.759PheHis: 0.759 ± 0.441
1.011PheIle: 1.011 ± 0.367
2.528PheLys: 2.528 ± 0.631
2.781PheLeu: 2.781 ± 0.833
0.253PheMet: 0.253 ± 0.143
1.517PheAsn: 1.517 ± 0.529
2.023PhePro: 2.023 ± 0.665
0.253PheGln: 0.253 ± 0.143
2.276PheArg: 2.276 ± 0.532
4.551PheSer: 4.551 ± 1.001
1.264PheThr: 1.264 ± 0.369
2.781PheVal: 2.781 ± 1.171
0.0PheTrp: 0.0 ± 0.0
0.759PheTyr: 0.759 ± 0.338
0.0PheXaa: 0.0 ± 0.0
Gly
2.023GlyAla: 2.023 ± 0.772
1.517GlyCys: 1.517 ± 1.242
2.528GlyAsp: 2.528 ± 0.689
2.023GlyGlu: 2.023 ± 0.77
2.276GlyPhe: 2.276 ± 0.771
3.034GlyGly: 3.034 ± 0.993
0.759GlyHis: 0.759 ± 0.43
1.517GlyIle: 1.517 ± 0.86
2.781GlyLys: 2.781 ± 1.146
5.057GlyLeu: 5.057 ± 1.155
2.781GlyMet: 2.781 ± 0.761
2.276GlyAsn: 2.276 ± 0.43
1.517GlyPro: 1.517 ± 1.472
1.264GlyGln: 1.264 ± 0.641
2.781GlyArg: 2.781 ± 0.884
5.057GlySer: 5.057 ± 1.817
2.781GlyThr: 2.781 ± 0.682
3.287GlyVal: 3.287 ± 0.844
0.759GlyTrp: 0.759 ± 0.43
1.77GlyTyr: 1.77 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
0.506HisAla: 0.506 ± 0.304
0.253HisCys: 0.253 ± 0.346
2.276HisAsp: 2.276 ± 0.615
1.517HisGlu: 1.517 ± 0.411
0.506HisPhe: 0.506 ± 0.282
1.517HisGly: 1.517 ± 0.886
1.264HisHis: 1.264 ± 0.606
1.517HisIle: 1.517 ± 0.591
1.517HisLys: 1.517 ± 0.438
1.77HisLeu: 1.77 ± 0.262
0.506HisMet: 0.506 ± 0.289
1.264HisAsn: 1.264 ± 0.554
1.011HisPro: 1.011 ± 0.573
1.011HisGln: 1.011 ± 0.449
1.77HisArg: 1.77 ± 0.652
1.77HisSer: 1.77 ± 0.34
0.253HisThr: 0.253 ± 0.346
0.253HisVal: 0.253 ± 0.143
0.759HisTrp: 0.759 ± 0.43
1.517HisTyr: 1.517 ± 0.595
0.0HisXaa: 0.0 ± 0.0
Ile
3.034IleAla: 3.034 ± 1.301
0.759IleCys: 0.759 ± 0.391
2.276IleAsp: 2.276 ± 0.902
4.298IleGlu: 4.298 ± 0.748
1.264IlePhe: 1.264 ± 1.056
5.563IleGly: 5.563 ± 1.208
0.759IleHis: 0.759 ± 0.356
3.793IleIle: 3.793 ± 1.01
4.804IleLys: 4.804 ± 1.086
4.046IleLeu: 4.046 ± 1.523
2.781IleMet: 2.781 ± 0.899
3.034IleAsn: 3.034 ± 0.835
3.034IlePro: 3.034 ± 1.327
1.77IleGln: 1.77 ± 0.525
5.31IleArg: 5.31 ± 1.261
9.861IleSer: 9.861 ± 1.365
4.046IleThr: 4.046 ± 0.982
3.287IleVal: 3.287 ± 0.991
1.264IleTrp: 1.264 ± 0.471
3.034IleTyr: 3.034 ± 0.659
0.0IleXaa: 0.0 ± 0.0
Lys
4.804LysAla: 4.804 ± 0.686
0.506LysCys: 0.506 ± 0.289
4.551LysAsp: 4.551 ± 1.325
4.046LysGlu: 4.046 ± 0.645
2.276LysPhe: 2.276 ± 0.8
2.528LysGly: 2.528 ± 0.553
1.517LysHis: 1.517 ± 0.584
4.046LysIle: 4.046 ± 1.117
3.034LysLys: 3.034 ± 0.815
3.793LysLeu: 3.793 ± 1.356
2.781LysMet: 2.781 ± 0.825
1.77LysAsn: 1.77 ± 0.36
1.77LysPro: 1.77 ± 1.099
1.77LysGln: 1.77 ± 0.986
4.551LysArg: 4.551 ± 0.779
5.815LysSer: 5.815 ± 1.139
4.298LysThr: 4.298 ± 1.65
4.298LysVal: 4.298 ± 1.01
1.517LysTrp: 1.517 ± 0.55
1.77LysTyr: 1.77 ± 0.62
0.0LysXaa: 0.0 ± 0.0
Leu
5.057LeuAla: 5.057 ± 1.84
1.264LeuCys: 1.264 ± 0.471
4.298LeuAsp: 4.298 ± 0.836
5.563LeuGlu: 5.563 ± 1.188
2.781LeuPhe: 2.781 ± 0.86
3.793LeuGly: 3.793 ± 0.932
3.287LeuHis: 3.287 ± 0.807
4.551LeuIle: 4.551 ± 1.017
5.31LeuLys: 5.31 ± 0.788
7.332LeuLeu: 7.332 ± 1.474
3.034LeuMet: 3.034 ± 0.735
4.046LeuAsn: 4.046 ± 0.704
2.276LeuPro: 2.276 ± 0.832
2.528LeuGln: 2.528 ± 1.471
7.08LeuArg: 7.08 ± 1.174
9.102LeuSer: 9.102 ± 3.02
4.804LeuThr: 4.804 ± 1.815
4.551LeuVal: 4.551 ± 0.474
1.011LeuTrp: 1.011 ± 0.718
5.057LeuTyr: 5.057 ± 1.506
0.0LeuXaa: 0.0 ± 0.0
Met
1.77MetAla: 1.77 ± 0.557
0.253MetCys: 0.253 ± 0.346
2.023MetAsp: 2.023 ± 0.573
2.276MetGlu: 2.276 ± 0.796
1.264MetPhe: 1.264 ± 0.471
1.011MetGly: 1.011 ± 0.433
0.506MetHis: 0.506 ± 0.47
2.528MetIle: 2.528 ± 0.718
3.034MetLys: 3.034 ± 0.737
1.011MetLeu: 1.011 ± 0.436
1.77MetMet: 1.77 ± 0.456
1.517MetAsn: 1.517 ± 0.779
1.517MetPro: 1.517 ± 0.435
1.264MetGln: 1.264 ± 0.689
2.023MetArg: 2.023 ± 0.537
4.804MetSer: 4.804 ± 1.113
3.54MetThr: 3.54 ± 0.781
1.517MetVal: 1.517 ± 0.355
0.506MetTrp: 0.506 ± 0.282
1.517MetTyr: 1.517 ± 0.867
0.0MetXaa: 0.0 ± 0.0
Asn
1.77AsnAla: 1.77 ± 0.592
0.759AsnCys: 0.759 ± 0.297
2.276AsnAsp: 2.276 ± 0.791
3.54AsnGlu: 3.54 ± 0.53
1.264AsnPhe: 1.264 ± 0.921
1.77AsnGly: 1.77 ± 0.491
0.759AsnHis: 0.759 ± 0.369
4.551AsnIle: 4.551 ± 1.128
3.54AsnLys: 3.54 ± 0.848
2.528AsnLeu: 2.528 ± 0.496
0.759AsnMet: 0.759 ± 0.474
2.023AsnAsn: 2.023 ± 0.898
2.781AsnPro: 2.781 ± 0.898
2.528AsnGln: 2.528 ± 1.628
2.528AsnArg: 2.528 ± 0.576
1.517AsnSer: 1.517 ± 0.382
2.781AsnThr: 2.781 ± 1.159
3.54AsnVal: 3.54 ± 2.162
0.506AsnTrp: 0.506 ± 0.287
2.276AsnTyr: 2.276 ± 0.541
0.0AsnXaa: 0.0 ± 0.0
Pro
1.77ProAla: 1.77 ± 0.363
0.759ProCys: 0.759 ± 0.43
2.023ProAsp: 2.023 ± 0.771
4.298ProGlu: 4.298 ± 0.578
1.264ProPhe: 1.264 ± 0.606
1.011ProGly: 1.011 ± 0.737
0.759ProHis: 0.759 ± 0.426
3.034ProIle: 3.034 ± 1.008
2.528ProLys: 2.528 ± 1.09
4.551ProLeu: 4.551 ± 1.424
0.506ProMet: 0.506 ± 0.289
1.011ProAsn: 1.011 ± 0.631
2.276ProPro: 2.276 ± 0.563
1.264ProGln: 1.264 ± 0.497
2.276ProArg: 2.276 ± 0.449
4.551ProSer: 4.551 ± 1.267
2.276ProThr: 2.276 ± 1.137
2.528ProVal: 2.528 ± 1.613
0.253ProTrp: 0.253 ± 0.143
1.264ProTyr: 1.264 ± 0.423
0.0ProXaa: 0.0 ± 0.0
Gln
1.264GlnAla: 1.264 ± 0.545
1.77GlnCys: 1.77 ± 0.525
2.276GlnAsp: 2.276 ± 0.619
1.264GlnGlu: 1.264 ± 0.808
0.759GlnPhe: 0.759 ± 0.366
2.023GlnGly: 2.023 ± 1.087
0.506GlnHis: 0.506 ± 0.287
3.034GlnIle: 3.034 ± 0.959
1.264GlnLys: 1.264 ± 0.347
2.023GlnLeu: 2.023 ± 0.934
1.264GlnMet: 1.264 ± 0.406
1.77GlnAsn: 1.77 ± 1.007
0.506GlnPro: 0.506 ± 0.734
1.011GlnGln: 1.011 ± 0.46
1.264GlnArg: 1.264 ± 0.467
2.276GlnSer: 2.276 ± 0.555
1.264GlnThr: 1.264 ± 1.161
2.781GlnVal: 2.781 ± 1.056
0.253GlnTrp: 0.253 ± 0.346
0.506GlnTyr: 0.506 ± 0.56
0.0GlnXaa: 0.0 ± 0.0
Arg
5.057ArgAla: 5.057 ± 0.668
2.023ArgCys: 2.023 ± 0.67
2.276ArgAsp: 2.276 ± 1.758
2.781ArgGlu: 2.781 ± 0.304
2.023ArgPhe: 2.023 ± 0.418
2.276ArgGly: 2.276 ± 0.674
1.264ArgHis: 1.264 ± 0.423
5.31ArgIle: 5.31 ± 1.205
2.781ArgLys: 2.781 ± 0.65
5.815ArgLeu: 5.815 ± 1.216
1.77ArgMet: 1.77 ± 0.72
2.023ArgAsn: 2.023 ± 0.695
1.77ArgPro: 1.77 ± 0.539
2.276ArgGln: 2.276 ± 0.741
4.298ArgArg: 4.298 ± 1.041
5.815ArgSer: 5.815 ± 1.051
4.046ArgThr: 4.046 ± 0.874
4.298ArgVal: 4.298 ± 0.842
1.011ArgTrp: 1.011 ± 0.441
3.287ArgTyr: 3.287 ± 0.552
0.0ArgXaa: 0.0 ± 0.0
Ser
3.793SerAla: 3.793 ± 0.703
1.77SerCys: 1.77 ± 0.881
9.355SerAsp: 9.355 ± 1.685
5.057SerGlu: 5.057 ± 1.742
2.276SerPhe: 2.276 ± 0.812
6.068SerGly: 6.068 ± 0.949
3.034SerHis: 3.034 ± 0.914
6.827SerIle: 6.827 ± 2.278
5.31SerLys: 5.31 ± 1.171
9.608SerLeu: 9.608 ± 0.936
4.551SerMet: 4.551 ± 1.505
5.31SerAsn: 5.31 ± 1.046
4.046SerPro: 4.046 ± 0.723
1.011SerGln: 1.011 ± 0.742
5.057SerArg: 5.057 ± 0.472
8.85SerSer: 8.85 ± 2.211
5.563SerThr: 5.563 ± 1.057
5.815SerVal: 5.815 ± 2.076
1.011SerTrp: 1.011 ± 0.479
3.54SerTyr: 3.54 ± 0.907
0.0SerXaa: 0.0 ± 0.0
Thr
3.287ThrAla: 3.287 ± 0.965
0.506ThrCys: 0.506 ± 0.47
3.287ThrAsp: 3.287 ± 1.121
3.793ThrGlu: 3.793 ± 1.147
1.77ThrPhe: 1.77 ± 0.596
1.264ThrGly: 1.264 ± 0.471
1.517ThrHis: 1.517 ± 0.454
4.804ThrIle: 4.804 ± 1.76
3.793ThrLys: 3.793 ± 1.212
6.068ThrLeu: 6.068 ± 1.27
2.781ThrMet: 2.781 ± 0.566
2.023ThrAsn: 2.023 ± 0.802
2.528ThrPro: 2.528 ± 0.448
2.276ThrGln: 2.276 ± 1.002
3.54ThrArg: 3.54 ± 0.88
6.827ThrSer: 6.827 ± 1.307
4.804ThrThr: 4.804 ± 1.124
5.31ThrVal: 5.31 ± 1.459
1.264ThrTrp: 1.264 ± 0.834
1.011ThrTyr: 1.011 ± 0.492
0.0ThrXaa: 0.0 ± 0.0
Val
3.54ValAla: 3.54 ± 2.01
1.517ValCys: 1.517 ± 0.503
4.298ValAsp: 4.298 ± 1.114
3.793ValGlu: 3.793 ± 2.097
2.528ValPhe: 2.528 ± 0.57
1.517ValGly: 1.517 ± 0.74
1.011ValHis: 1.011 ± 0.549
4.046ValIle: 4.046 ± 1.056
2.781ValLys: 2.781 ± 0.834
7.08ValLeu: 7.08 ± 1.276
2.023ValMet: 2.023 ± 0.72
2.781ValAsn: 2.781 ± 0.442
3.793ValPro: 3.793 ± 1.225
2.276ValGln: 2.276 ± 1.049
3.793ValArg: 3.793 ± 1.037
5.563ValSer: 5.563 ± 1.375
4.804ValThr: 4.804 ± 1.377
3.793ValVal: 3.793 ± 0.894
1.264ValTrp: 1.264 ± 0.612
2.276ValTyr: 2.276 ± 0.77
0.0ValXaa: 0.0 ± 0.0
Trp
1.264TrpAla: 1.264 ± 0.471
0.253TrpCys: 0.253 ± 0.143
2.276TrpAsp: 2.276 ± 0.839
0.506TrpGlu: 0.506 ± 0.287
1.011TrpPhe: 1.011 ± 0.49
0.759TrpGly: 0.759 ± 0.338
0.0TrpHis: 0.0 ± 0.0
1.011TrpIle: 1.011 ± 0.573
1.011TrpLys: 1.011 ± 0.441
0.506TrpLeu: 0.506 ± 0.287
0.759TrpMet: 0.759 ± 0.412
1.011TrpAsn: 1.011 ± 0.477
0.0TrpPro: 0.0 ± 0.0
0.253TrpGln: 0.253 ± 0.415
0.759TrpArg: 0.759 ± 0.338
1.011TrpSer: 1.011 ± 0.367
1.517TrpThr: 1.517 ± 0.454
0.759TrpVal: 0.759 ± 0.324
0.506TrpTrp: 0.506 ± 0.287
0.253TrpTyr: 0.253 ± 0.347
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.264TyrAla: 1.264 ± 0.581
0.759TyrCys: 0.759 ± 0.297
2.528TyrAsp: 2.528 ± 0.751
1.264TyrGlu: 1.264 ± 0.471
1.264TyrPhe: 1.264 ± 0.581
2.023TyrGly: 2.023 ± 0.854
1.77TyrHis: 1.77 ± 0.652
1.011TyrIle: 1.011 ± 0.354
2.781TyrLys: 2.781 ± 0.81
4.046TyrLeu: 4.046 ± 1.292
1.264TyrMet: 1.264 ± 0.649
0.759TyrAsn: 0.759 ± 0.474
2.023TyrPro: 2.023 ± 0.922
1.517TyrGln: 1.517 ± 0.355
2.023TyrArg: 2.023 ± 0.747
4.046TyrSer: 4.046 ± 0.867
1.517TyrThr: 1.517 ± 0.545
3.54TyrVal: 3.54 ± 0.992
0.506TyrTrp: 0.506 ± 0.287
2.276TyrTyr: 2.276 ± 0.836
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3956 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski