Amino acid dipepetide frequency for Tobacco rattle virus (isolate PpK20) (TRV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.443AlaAla: 6.443 ± 1.698
0.339AlaCys: 0.339 ± 0.547
4.747AlaAsp: 4.747 ± 1.157
3.391AlaGlu: 3.391 ± 1.276
2.713AlaPhe: 2.713 ± 1.244
2.035AlaGly: 2.035 ± 0.619
1.017AlaHis: 1.017 ± 0.595
2.374AlaIle: 2.374 ± 0.804
3.73AlaLys: 3.73 ± 0.632
7.799AlaLeu: 7.799 ± 2.187
1.017AlaMet: 1.017 ± 0.595
3.73AlaAsn: 3.73 ± 1.565
1.356AlaPro: 1.356 ± 0.605
4.069AlaGln: 4.069 ± 1.003
3.391AlaArg: 3.391 ± 1.126
3.391AlaSer: 3.391 ± 2.579
3.391AlaThr: 3.391 ± 1.019
4.747AlaVal: 4.747 ± 1.669
0.0AlaTrp: 0.0 ± 0.0
1.017AlaTyr: 1.017 ± 0.462
0.0AlaXaa: 0.0 ± 0.0
Cys
1.695CysAla: 1.695 ± 0.781
0.678CysCys: 0.678 ± 0.551
1.356CysAsp: 1.356 ± 0.995
0.678CysGlu: 0.678 ± 0.396
0.0CysPhe: 0.0 ± 0.0
1.695CysGly: 1.695 ± 1.145
0.0CysHis: 0.0 ± 0.0
1.017CysIle: 1.017 ± 0.524
0.678CysLys: 0.678 ± 0.396
0.339CysLeu: 0.339 ± 0.198
0.0CysMet: 0.0 ± 0.0
1.356CysAsn: 1.356 ± 0.793
1.017CysPro: 1.017 ± 0.536
0.678CysGln: 0.678 ± 0.456
1.356CysArg: 1.356 ± 0.636
1.356CysSer: 1.356 ± 0.918
0.678CysThr: 0.678 ± 0.396
2.035CysVal: 2.035 ± 1.722
0.0CysTrp: 0.0 ± 0.0
1.017CysTyr: 1.017 ± 0.448
0.0CysXaa: 0.0 ± 0.0
Asp
2.374AspAla: 2.374 ± 0.712
0.678AspCys: 0.678 ± 0.396
6.104AspAsp: 6.104 ± 1.551
4.747AspGlu: 4.747 ± 1.505
2.374AspPhe: 2.374 ± 0.668
4.747AspGly: 4.747 ± 1.98
0.339AspHis: 0.339 ± 0.509
3.391AspIle: 3.391 ± 0.964
5.086AspLys: 5.086 ± 0.779
7.46AspLeu: 7.46 ± 1.25
1.695AspMet: 1.695 ± 0.706
2.035AspAsn: 2.035 ± 1.427
1.695AspPro: 1.695 ± 0.991
2.713AspGln: 2.713 ± 1.234
2.374AspArg: 2.374 ± 1.097
5.086AspSer: 5.086 ± 1.19
3.391AspThr: 3.391 ± 1.333
8.138AspVal: 8.138 ± 1.55
2.035AspTrp: 2.035 ± 0.51
2.713AspTyr: 2.713 ± 0.92
0.0AspXaa: 0.0 ± 0.0
Glu
2.035GluAla: 2.035 ± 0.558
0.678GluCys: 0.678 ± 0.396
2.035GluAsp: 2.035 ± 1.236
2.374GluGlu: 2.374 ± 0.927
2.374GluPhe: 2.374 ± 0.756
2.374GluGly: 2.374 ± 1.256
1.356GluHis: 1.356 ± 1.07
5.426GluIle: 5.426 ± 1.925
4.747GluLys: 4.747 ± 1.005
5.765GluLeu: 5.765 ± 1.256
1.356GluMet: 1.356 ± 0.617
4.408GluAsn: 4.408 ± 0.88
0.678GluPro: 0.678 ± 0.444
2.374GluGln: 2.374 ± 0.712
5.426GluArg: 5.426 ± 1.407
3.391GluSer: 3.391 ± 1.148
4.408GluThr: 4.408 ± 1.351
4.408GluVal: 4.408 ± 1.445
0.678GluTrp: 0.678 ± 0.444
2.374GluTyr: 2.374 ± 0.804
0.0GluXaa: 0.0 ± 0.0
Phe
3.391PheAla: 3.391 ± 0.968
0.678PheCys: 0.678 ± 0.396
4.069PheAsp: 4.069 ± 0.919
3.391PheGlu: 3.391 ± 0.736
1.356PhePhe: 1.356 ± 0.521
3.052PheGly: 3.052 ± 1.61
1.017PheHis: 1.017 ± 0.683
1.017PheIle: 1.017 ± 0.609
3.391PheLys: 3.391 ± 0.989
6.782PheLeu: 6.782 ± 1.615
1.695PheMet: 1.695 ± 0.695
1.356PheAsn: 1.356 ± 0.493
2.374PhePro: 2.374 ± 0.959
0.678PheGln: 0.678 ± 0.444
2.035PheArg: 2.035 ± 0.51
3.391PheSer: 3.391 ± 1.084
1.356PheThr: 1.356 ± 0.888
3.73PheVal: 3.73 ± 0.632
0.339PheTrp: 0.339 ± 0.198
1.356PheTyr: 1.356 ± 0.555
0.0PheXaa: 0.0 ± 0.0
Gly
3.391GlyAla: 3.391 ± 1.309
1.356GlyCys: 1.356 ± 0.751
4.408GlyAsp: 4.408 ± 0.765
2.713GlyGlu: 2.713 ± 0.468
4.069GlyPhe: 4.069 ± 0.738
4.069GlyGly: 4.069 ± 1.522
2.374GlyHis: 2.374 ± 1.794
1.695GlyIle: 1.695 ± 1.065
5.765GlyLys: 5.765 ± 1.375
4.747GlyLeu: 4.747 ± 1.598
1.356GlyMet: 1.356 ± 0.793
2.035GlyAsn: 2.035 ± 0.857
3.73GlyPro: 3.73 ± 1.097
1.356GlyGln: 1.356 ± 0.521
2.374GlyArg: 2.374 ± 1.004
2.713GlySer: 2.713 ± 1.377
3.052GlyThr: 3.052 ± 0.867
4.408GlyVal: 4.408 ± 1.387
1.017GlyTrp: 1.017 ± 0.977
2.374GlyTyr: 2.374 ± 1.074
0.0GlyXaa: 0.0 ± 0.0
His
2.374HisAla: 2.374 ± 0.755
0.678HisCys: 0.678 ± 0.456
1.356HisAsp: 1.356 ± 1.048
2.713HisGlu: 2.713 ± 0.574
1.356HisPhe: 1.356 ± 0.555
1.356HisGly: 1.356 ± 0.555
0.339HisHis: 0.339 ± 0.198
1.017HisIle: 1.017 ± 0.448
1.356HisLys: 1.356 ± 0.856
2.035HisLeu: 2.035 ± 1.141
0.0HisMet: 0.0 ± 0.0
0.678HisAsn: 0.678 ± 0.551
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.678HisArg: 0.678 ± 0.74
1.017HisSer: 1.017 ± 0.536
0.678HisThr: 0.678 ± 0.396
1.356HisVal: 1.356 ± 1.041
0.0HisTrp: 0.0 ± 0.0
0.339HisTyr: 0.339 ± 0.509
0.0HisXaa: 0.0 ± 0.0
Ile
3.391IleAla: 3.391 ± 1.532
0.339IleCys: 0.339 ± 0.198
2.035IleAsp: 2.035 ± 0.816
2.713IleGlu: 2.713 ± 2.127
1.356IlePhe: 1.356 ± 0.793
3.73IleGly: 3.73 ± 1.286
1.356IleHis: 1.356 ± 0.913
0.0IleIle: 0.0 ± 0.0
3.391IleLys: 3.391 ± 0.873
1.695IleLeu: 1.695 ± 0.929
0.678IleMet: 0.678 ± 0.396
2.035IleAsn: 2.035 ± 0.661
0.339IlePro: 0.339 ± 0.604
1.695IleGln: 1.695 ± 1.184
4.408IleArg: 4.408 ± 0.72
3.391IleSer: 3.391 ± 0.82
1.695IleThr: 1.695 ± 0.724
5.086IleVal: 5.086 ± 0.991
1.017IleTrp: 1.017 ± 0.794
2.374IleTyr: 2.374 ± 1.032
0.0IleXaa: 0.0 ± 0.0
Lys
2.374LysAla: 2.374 ± 0.627
0.678LysCys: 0.678 ± 0.551
3.73LysAsp: 3.73 ± 1.318
4.408LysGlu: 4.408 ± 1.292
5.086LysPhe: 5.086 ± 1.82
2.713LysGly: 2.713 ± 0.823
1.356LysHis: 1.356 ± 1.048
2.374LysIle: 2.374 ± 0.67
8.138LysLys: 8.138 ± 2.3
7.121LysLeu: 7.121 ± 1.164
0.678LysMet: 0.678 ± 0.498
2.713LysAsn: 2.713 ± 0.528
3.73LysPro: 3.73 ± 1.547
2.035LysGln: 2.035 ± 1.219
6.443LysArg: 6.443 ± 1.953
7.46LysSer: 7.46 ± 2.021
4.069LysThr: 4.069 ± 1.73
7.121LysVal: 7.121 ± 2.334
0.678LysTrp: 0.678 ± 0.456
2.035LysTyr: 2.035 ± 1.012
0.0LysXaa: 0.0 ± 0.0
Leu
3.391LeuAla: 3.391 ± 1.306
2.713LeuCys: 2.713 ± 1.158
7.799LeuAsp: 7.799 ± 1.6
4.747LeuGlu: 4.747 ± 1.544
4.069LeuPhe: 4.069 ± 0.979
5.765LeuGly: 5.765 ± 1.699
1.356LeuHis: 1.356 ± 0.605
4.408LeuIle: 4.408 ± 0.918
5.086LeuLys: 5.086 ± 1.596
9.156LeuLeu: 9.156 ± 2.073
3.052LeuMet: 3.052 ± 0.652
4.069LeuAsn: 4.069 ± 1.487
2.035LeuPro: 2.035 ± 0.91
5.086LeuGln: 5.086 ± 1.235
2.713LeuArg: 2.713 ± 1.099
6.782LeuSer: 6.782 ± 1.744
6.443LeuThr: 6.443 ± 0.876
6.104LeuVal: 6.104 ± 1.478
1.356LeuTrp: 1.356 ± 0.521
3.391LeuTyr: 3.391 ± 1.248
0.339LeuXaa: 0.339 ± 0.198
Met
1.356MetAla: 1.356 ± 0.793
0.0MetCys: 0.0 ± 0.0
1.017MetAsp: 1.017 ± 0.536
1.695MetGlu: 1.695 ± 0.604
1.695MetPhe: 1.695 ± 0.461
1.017MetGly: 1.017 ± 0.722
0.678MetHis: 0.678 ± 0.456
1.356MetIle: 1.356 ± 0.617
1.695MetLys: 1.695 ± 0.503
2.374MetLeu: 2.374 ± 0.951
0.678MetMet: 0.678 ± 0.396
0.0MetAsn: 0.0 ± 0.0
0.339MetPro: 0.339 ± 0.198
0.678MetGln: 0.678 ± 0.396
0.678MetArg: 0.678 ± 0.396
3.391MetSer: 3.391 ± 0.769
1.017MetThr: 1.017 ± 0.572
3.052MetVal: 3.052 ± 1.092
0.678MetTrp: 0.678 ± 0.396
1.017MetTyr: 1.017 ± 0.722
0.0MetXaa: 0.0 ± 0.0
Asn
2.713AsnAla: 2.713 ± 1.392
1.356AsnCys: 1.356 ± 1.316
2.713AsnAsp: 2.713 ± 0.884
1.356AsnGlu: 1.356 ± 0.636
1.695AsnPhe: 1.695 ± 0.631
3.052AsnGly: 3.052 ± 1.209
0.678AsnHis: 0.678 ± 0.768
1.695AsnIle: 1.695 ± 0.728
2.374AsnLys: 2.374 ± 0.668
3.391AsnLeu: 3.391 ± 0.831
1.356AsnMet: 1.356 ± 0.717
1.356AsnAsn: 1.356 ± 0.655
2.713AsnPro: 2.713 ± 1.296
2.374AsnGln: 2.374 ± 0.755
2.035AsnArg: 2.035 ± 0.749
4.069AsnSer: 4.069 ± 0.759
1.695AsnThr: 1.695 ± 1.053
4.747AsnVal: 4.747 ± 1.497
1.017AsnTrp: 1.017 ± 0.595
1.695AsnTyr: 1.695 ± 0.884
0.0AsnXaa: 0.0 ± 0.0
Pro
3.73ProAla: 3.73 ± 1.633
1.695ProCys: 1.695 ± 0.991
3.73ProAsp: 3.73 ± 0.805
1.017ProGlu: 1.017 ± 0.595
1.017ProPhe: 1.017 ± 1.044
2.374ProGly: 2.374 ± 0.924
1.017ProHis: 1.017 ± 0.448
2.374ProIle: 2.374 ± 0.56
3.052ProLys: 3.052 ± 1.174
1.017ProLeu: 1.017 ± 0.595
1.356ProMet: 1.356 ± 1.097
2.035ProAsn: 2.035 ± 1.103
2.035ProPro: 2.035 ± 1.444
0.339ProGln: 0.339 ± 0.198
1.695ProArg: 1.695 ± 0.461
1.017ProSer: 1.017 ± 0.595
0.339ProThr: 0.339 ± 0.198
3.052ProVal: 3.052 ± 1.445
0.678ProTrp: 0.678 ± 0.444
1.356ProTyr: 1.356 ± 1.465
0.0ProXaa: 0.0 ± 0.0
Gln
2.374GlnAla: 2.374 ± 0.755
0.339GlnCys: 0.339 ± 0.604
0.339GlnAsp: 0.339 ± 0.543
2.035GlnGlu: 2.035 ± 1.012
2.713GlnPhe: 2.713 ± 0.835
1.695GlnGly: 1.695 ± 0.631
0.339GlnHis: 0.339 ± 0.198
2.035GlnIle: 2.035 ± 0.764
2.035GlnLys: 2.035 ± 0.51
3.73GlnLeu: 3.73 ± 1.032
1.017GlnMet: 1.017 ± 0.595
2.374GlnAsn: 2.374 ± 0.712
0.339GlnPro: 0.339 ± 0.198
1.017GlnGln: 1.017 ± 0.595
2.035GlnArg: 2.035 ± 1.049
2.035GlnSer: 2.035 ± 0.63
1.695GlnThr: 1.695 ± 0.649
2.713GlnVal: 2.713 ± 0.7
0.339GlnTrp: 0.339 ± 0.198
1.695GlnTyr: 1.695 ± 0.985
0.0GlnXaa: 0.0 ± 0.0
Arg
3.052ArgAla: 3.052 ± 1.201
1.017ArgCys: 1.017 ± 0.572
4.408ArgAsp: 4.408 ± 0.968
5.765ArgGlu: 5.765 ± 1.28
3.052ArgPhe: 3.052 ± 0.782
4.069ArgGly: 4.069 ± 1.221
0.678ArgHis: 0.678 ± 0.396
1.356ArgIle: 1.356 ± 0.521
5.086ArgLys: 5.086 ± 1.354
3.73ArgLeu: 3.73 ± 1.083
2.035ArgMet: 2.035 ± 0.76
2.374ArgAsn: 2.374 ± 1.56
3.052ArgPro: 3.052 ± 1.711
1.356ArgGln: 1.356 ± 0.648
5.765ArgArg: 5.765 ± 2.513
3.052ArgSer: 3.052 ± 0.663
2.713ArgThr: 2.713 ± 0.769
4.747ArgVal: 4.747 ± 1.538
0.678ArgTrp: 0.678 ± 0.396
1.017ArgTyr: 1.017 ± 0.462
0.0ArgXaa: 0.0 ± 0.0
Ser
5.426SerAla: 5.426 ± 1.45
0.678SerCys: 0.678 ± 0.396
5.086SerAsp: 5.086 ± 1.208
3.391SerGlu: 3.391 ± 1.49
3.052SerPhe: 3.052 ± 0.64
7.121SerGly: 7.121 ± 1.958
2.035SerHis: 2.035 ± 1.219
2.374SerIle: 2.374 ± 1.154
4.747SerLys: 4.747 ± 1.782
7.799SerLeu: 7.799 ± 1.813
0.678SerMet: 0.678 ± 0.48
4.069SerAsn: 4.069 ± 0.945
1.356SerPro: 1.356 ± 0.648
2.035SerGln: 2.035 ± 0.857
5.765SerArg: 5.765 ± 1.052
7.121SerSer: 7.121 ± 2.118
4.408SerThr: 4.408 ± 1.738
5.086SerVal: 5.086 ± 0.851
1.017SerTrp: 1.017 ± 1.368
2.035SerTyr: 2.035 ± 0.657
0.0SerXaa: 0.0 ± 0.0
Thr
3.391ThrAla: 3.391 ± 1.344
1.695ThrCys: 1.695 ± 1.308
3.052ThrAsp: 3.052 ± 0.986
2.374ThrGlu: 2.374 ± 1.082
1.695ThrPhe: 1.695 ± 0.758
2.374ThrGly: 2.374 ± 1.05
0.339ThrHis: 0.339 ± 0.198
2.035ThrIle: 2.035 ± 0.912
4.408ThrLys: 4.408 ± 1.244
3.73ThrLeu: 3.73 ± 0.869
1.695ThrMet: 1.695 ± 0.649
2.035ThrAsn: 2.035 ± 2.229
1.695ThrPro: 1.695 ± 0.758
2.374ThrGln: 2.374 ± 0.782
2.713ThrArg: 2.713 ± 1.553
3.391ThrSer: 3.391 ± 0.494
3.052ThrThr: 3.052 ± 2.756
4.408ThrVal: 4.408 ± 1.44
0.678ThrTrp: 0.678 ± 0.456
3.052ThrTyr: 3.052 ± 0.754
0.0ThrXaa: 0.0 ± 0.0
Val
4.408ValAla: 4.408 ± 1.342
1.695ValCys: 1.695 ± 0.994
6.782ValAsp: 6.782 ± 2.148
5.426ValGlu: 5.426 ± 1.841
4.408ValPhe: 4.408 ± 1.729
4.408ValGly: 4.408 ± 1.599
2.035ValHis: 2.035 ± 0.857
3.391ValIle: 3.391 ± 0.824
5.765ValLys: 5.765 ± 1.677
7.46ValLeu: 7.46 ± 2.088
2.035ValMet: 2.035 ± 0.84
3.73ValAsn: 3.73 ± 1.314
5.086ValPro: 5.086 ± 2.024
1.017ValGln: 1.017 ± 0.524
3.391ValArg: 3.391 ± 1.297
8.477ValSer: 8.477 ± 1.497
5.086ValThr: 5.086 ± 1.515
8.817ValVal: 8.817 ± 1.654
1.017ValTrp: 1.017 ± 0.524
3.052ValTyr: 3.052 ± 0.567
0.0ValXaa: 0.0 ± 0.0
Trp
1.356TrpAla: 1.356 ± 0.913
0.339TrpCys: 0.339 ± 0.198
0.339TrpAsp: 0.339 ± 0.198
1.356TrpGlu: 1.356 ± 0.793
0.339TrpPhe: 0.339 ± 0.599
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.339TrpIle: 0.339 ± 0.543
2.374TrpLys: 2.374 ± 0.767
0.678TrpLeu: 0.678 ± 1.086
0.678TrpMet: 0.678 ± 0.38
0.678TrpAsn: 0.678 ± 0.444
0.0TrpPro: 0.0 ± 0.0
0.339TrpGln: 0.339 ± 0.198
0.678TrpArg: 0.678 ± 0.444
1.695TrpSer: 1.695 ± 1.371
0.0TrpThr: 0.0 ± 0.0
1.356TrpVal: 1.356 ± 0.613
0.339TrpTrp: 0.339 ± 0.198
0.678TrpTyr: 0.678 ± 0.396
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.035TyrAla: 2.035 ± 0.97
0.0TyrCys: 0.0 ± 0.0
4.069TyrAsp: 4.069 ± 1.056
2.374TyrGlu: 2.374 ± 0.978
2.035TyrPhe: 2.035 ± 0.657
1.356TyrGly: 1.356 ± 1.048
1.017TyrHis: 1.017 ± 0.448
3.052TyrIle: 3.052 ± 0.645
2.035TyrLys: 2.035 ± 1.122
3.052TyrLeu: 3.052 ± 1.401
1.017TyrMet: 1.017 ± 0.595
1.017TyrAsn: 1.017 ± 0.694
1.356TyrPro: 1.356 ± 0.617
0.678TyrGln: 0.678 ± 0.396
2.713TyrArg: 2.713 ± 0.662
3.391TyrSer: 3.391 ± 0.701
1.017TyrThr: 1.017 ± 0.732
2.374TyrVal: 2.374 ± 1.574
0.0TyrTrp: 0.0 ± 0.0
4.747TyrTyr: 4.747 ± 2.467
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.339XaaArg: 0.339 ± 0.198
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2950 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski