Amino acid dipepetide frequency for Tobacco virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.445AlaAla: 3.445 ± 0.921
1.013AlaCys: 1.013 ± 0.443
3.647AlaAsp: 3.647 ± 1.169
3.242AlaGlu: 3.242 ± 0.429
2.026AlaPhe: 2.026 ± 0.924
3.04AlaGly: 3.04 ± 0.829
0.0AlaHis: 0.0 ± 0.0
3.85AlaIle: 3.85 ± 0.843
3.242AlaLys: 3.242 ± 0.984
6.687AlaLeu: 6.687 ± 1.759
1.013AlaMet: 1.013 ± 0.472
3.04AlaAsn: 3.04 ± 0.708
1.621AlaPro: 1.621 ± 0.567
1.418AlaGln: 1.418 ± 0.381
4.255AlaArg: 4.255 ± 0.947
5.471AlaSer: 5.471 ± 0.776
2.634AlaThr: 2.634 ± 0.502
4.053AlaVal: 4.053 ± 1.103
0.405AlaTrp: 0.405 ± 0.209
1.824AlaTyr: 1.824 ± 0.581
0.0AlaXaa: 0.0 ± 0.0
Cys
1.621CysAla: 1.621 ± 0.779
0.608CysCys: 0.608 ± 0.377
1.824CysAsp: 1.824 ± 0.542
1.824CysGlu: 1.824 ± 0.643
2.026CysPhe: 2.026 ± 0.716
2.229CysGly: 2.229 ± 0.486
0.405CysHis: 0.405 ± 0.251
1.216CysIle: 1.216 ± 0.524
1.216CysLys: 1.216 ± 0.754
2.229CysLeu: 2.229 ± 0.601
0.203CysMet: 0.203 ± 0.126
0.405CysAsn: 0.405 ± 0.219
0.811CysPro: 0.811 ± 0.503
0.203CysGln: 0.203 ± 0.244
1.418CysArg: 1.418 ± 0.762
2.026CysSer: 2.026 ± 0.889
1.621CysThr: 1.621 ± 0.476
1.621CysVal: 1.621 ± 0.415
0.203CysTrp: 0.203 ± 0.126
0.608CysTyr: 0.608 ± 0.266
0.0CysXaa: 0.0 ± 0.0
Asp
3.04AspAla: 3.04 ± 0.53
1.013AspCys: 1.013 ± 0.453
2.634AspAsp: 2.634 ± 0.572
5.268AspGlu: 5.268 ± 1.021
4.255AspPhe: 4.255 ± 1.586
2.634AspGly: 2.634 ± 1.148
1.824AspHis: 1.824 ± 0.683
4.458AspIle: 4.458 ± 0.9
2.634AspLys: 2.634 ± 0.824
7.497AspLeu: 7.497 ± 1.033
1.216AspMet: 1.216 ± 0.76
2.026AspAsn: 2.026 ± 0.849
1.216AspPro: 1.216 ± 0.481
1.013AspGln: 1.013 ± 0.379
2.229AspArg: 2.229 ± 0.573
7.092AspSer: 7.092 ± 0.972
2.432AspThr: 2.432 ± 0.681
4.053AspVal: 4.053 ± 0.861
0.0AspTrp: 0.0 ± 0.0
1.418AspTyr: 1.418 ± 0.453
0.0AspXaa: 0.0 ± 0.0
Glu
3.647GluAla: 3.647 ± 1.253
1.621GluCys: 1.621 ± 0.763
2.432GluAsp: 2.432 ± 0.812
3.04GluGlu: 3.04 ± 0.951
3.04GluPhe: 3.04 ± 0.936
1.621GluGly: 1.621 ± 0.386
1.013GluHis: 1.013 ± 0.555
3.647GluIle: 3.647 ± 0.868
5.674GluLys: 5.674 ± 0.898
5.674GluLeu: 5.674 ± 1.046
1.824GluMet: 1.824 ± 0.622
3.04GluAsn: 3.04 ± 1.061
2.229GluPro: 2.229 ± 0.785
1.216GluGln: 1.216 ± 0.436
4.661GluArg: 4.661 ± 0.845
4.863GluSer: 4.863 ± 0.744
3.445GluThr: 3.445 ± 1.018
5.268GluVal: 5.268 ± 1.151
0.811GluTrp: 0.811 ± 0.414
3.445GluTyr: 3.445 ± 0.612
0.0GluXaa: 0.0 ± 0.0
Phe
1.418PheAla: 1.418 ± 0.487
1.216PheCys: 1.216 ± 0.499
3.242PheAsp: 3.242 ± 0.799
5.471PheGlu: 5.471 ± 1.263
4.053PhePhe: 4.053 ± 0.637
3.242PheGly: 3.242 ± 0.654
1.418PheHis: 1.418 ± 0.71
3.445PheIle: 3.445 ± 1.374
4.053PheLys: 4.053 ± 1.102
4.458PheLeu: 4.458 ± 1.15
0.811PheMet: 0.811 ± 0.411
1.418PheAsn: 1.418 ± 0.524
1.621PhePro: 1.621 ± 0.617
1.013PheGln: 1.013 ± 0.371
2.432PheArg: 2.432 ± 0.712
6.89PheSer: 6.89 ± 1.348
3.647PheThr: 3.647 ± 1.002
5.268PheVal: 5.268 ± 0.849
0.203PheTrp: 0.203 ± 0.404
1.621PheTyr: 1.621 ± 0.354
0.0PheXaa: 0.0 ± 0.0
Gly
2.837GlyAla: 2.837 ± 0.821
1.824GlyCys: 1.824 ± 0.571
3.242GlyAsp: 3.242 ± 0.726
5.066GlyGlu: 5.066 ± 0.828
2.634GlyPhe: 2.634 ± 0.575
4.458GlyGly: 4.458 ± 1.463
0.811GlyHis: 0.811 ± 0.414
1.418GlyIle: 1.418 ± 0.332
3.242GlyLys: 3.242 ± 1.153
3.85GlyLeu: 3.85 ± 0.81
0.608GlyMet: 0.608 ± 0.543
1.013GlyAsn: 1.013 ± 0.284
0.405GlyPro: 0.405 ± 0.366
0.203GlyGln: 0.203 ± 0.126
2.432GlyArg: 2.432 ± 0.662
4.661GlySer: 4.661 ± 0.802
2.837GlyThr: 2.837 ± 1.676
4.661GlyVal: 4.661 ± 1.425
0.405GlyTrp: 0.405 ± 0.397
2.026GlyTyr: 2.026 ± 0.662
0.0GlyXaa: 0.0 ± 0.0
His
1.216HisAla: 1.216 ± 0.406
1.621HisCys: 1.621 ± 0.41
1.824HisAsp: 1.824 ± 0.534
1.216HisGlu: 1.216 ± 0.504
0.608HisPhe: 0.608 ± 0.317
0.811HisGly: 0.811 ± 0.354
1.216HisHis: 1.216 ± 0.524
1.418HisIle: 1.418 ± 0.298
0.405HisLys: 0.405 ± 0.199
1.013HisLeu: 1.013 ± 0.47
0.203HisMet: 0.203 ± 0.404
1.013HisAsn: 1.013 ± 0.471
0.811HisPro: 0.811 ± 0.38
0.811HisGln: 0.811 ± 0.411
1.216HisArg: 1.216 ± 0.451
2.432HisSer: 2.432 ± 0.728
1.216HisThr: 1.216 ± 0.786
1.621HisVal: 1.621 ± 0.553
0.0HisTrp: 0.0 ± 0.0
1.013HisTyr: 1.013 ± 0.744
0.0HisXaa: 0.0 ± 0.0
Ile
3.242IleAla: 3.242 ± 0.522
0.811IleCys: 0.811 ± 0.339
2.634IleAsp: 2.634 ± 0.735
2.634IleGlu: 2.634 ± 0.848
1.621IlePhe: 1.621 ± 0.874
2.026IleGly: 2.026 ± 0.32
1.013IleHis: 1.013 ± 0.374
2.837IleIle: 2.837 ± 0.755
4.255IleLys: 4.255 ± 0.435
4.255IleLeu: 4.255 ± 1.396
1.418IleMet: 1.418 ± 0.642
4.053IleAsn: 4.053 ± 0.909
3.445IlePro: 3.445 ± 0.808
1.013IleGln: 1.013 ± 0.521
3.647IleArg: 3.647 ± 0.892
6.079IleSer: 6.079 ± 1.679
2.634IleThr: 2.634 ± 0.58
4.053IleVal: 4.053 ± 0.98
0.203IleTrp: 0.203 ± 0.126
1.824IleTyr: 1.824 ± 0.646
0.0IleXaa: 0.0 ± 0.0
Lys
3.04LysAla: 3.04 ± 1.014
2.837LysCys: 2.837 ± 0.778
3.445LysAsp: 3.445 ± 0.472
2.634LysGlu: 2.634 ± 0.613
5.066LysPhe: 5.066 ± 0.61
3.445LysGly: 3.445 ± 1.108
1.216LysHis: 1.216 ± 0.725
3.647LysIle: 3.647 ± 0.776
3.445LysLys: 3.445 ± 0.634
8.713LysLeu: 8.713 ± 2.055
1.013LysMet: 1.013 ± 0.355
2.432LysAsn: 2.432 ± 0.67
1.824LysPro: 1.824 ± 0.401
1.824LysGln: 1.824 ± 0.589
4.053LysArg: 4.053 ± 0.953
6.282LysSer: 6.282 ± 1.515
2.634LysThr: 2.634 ± 0.988
4.458LysVal: 4.458 ± 0.921
0.203LysTrp: 0.203 ± 0.239
1.621LysTyr: 1.621 ± 0.661
0.0LysXaa: 0.0 ± 0.0
Leu
5.471LeuAla: 5.471 ± 0.999
1.418LeuCys: 1.418 ± 0.314
5.876LeuAsp: 5.876 ± 0.972
4.053LeuGlu: 4.053 ± 0.395
4.863LeuPhe: 4.863 ± 0.79
5.876LeuGly: 5.876 ± 0.859
2.432LeuHis: 2.432 ± 0.615
4.255LeuIle: 4.255 ± 0.874
6.282LeuLys: 6.282 ± 1.138
8.713LeuLeu: 8.713 ± 1.191
2.837LeuMet: 2.837 ± 1.334
5.066LeuAsn: 5.066 ± 0.568
3.647LeuPro: 3.647 ± 0.806
2.432LeuGln: 2.432 ± 0.755
5.066LeuArg: 5.066 ± 1.529
10.537LeuSer: 10.537 ± 1.267
6.484LeuThr: 6.484 ± 0.879
5.066LeuVal: 5.066 ± 0.664
0.608LeuTrp: 0.608 ± 0.303
4.458LeuTyr: 4.458 ± 0.627
0.0LeuXaa: 0.0 ± 0.0
Met
1.216MetAla: 1.216 ± 0.489
0.405MetCys: 0.405 ± 0.219
1.621MetAsp: 1.621 ± 0.497
1.621MetGlu: 1.621 ± 0.393
1.621MetPhe: 1.621 ± 0.758
0.0MetGly: 0.0 ± 0.0
0.608MetHis: 0.608 ± 0.226
1.216MetIle: 1.216 ± 0.497
0.608MetLys: 0.608 ± 0.352
2.634MetLeu: 2.634 ± 0.845
0.811MetMet: 0.811 ± 0.395
1.621MetAsn: 1.621 ± 0.562
0.405MetPro: 0.405 ± 0.319
0.608MetGln: 0.608 ± 0.501
0.811MetArg: 0.811 ± 0.456
0.405MetSer: 0.405 ± 0.219
0.811MetThr: 0.811 ± 0.503
2.026MetVal: 2.026 ± 0.532
0.0MetTrp: 0.0 ± 0.0
1.216MetTyr: 1.216 ± 0.458
0.0MetXaa: 0.0 ± 0.0
Asn
2.229AsnAla: 2.229 ± 1.166
0.608AsnCys: 0.608 ± 0.368
2.026AsnAsp: 2.026 ± 0.391
2.432AsnGlu: 2.432 ± 0.561
3.647AsnPhe: 3.647 ± 0.673
1.216AsnGly: 1.216 ± 0.691
1.418AsnHis: 1.418 ± 0.559
3.445AsnIle: 3.445 ± 0.912
2.229AsnLys: 2.229 ± 0.53
4.863AsnLeu: 4.863 ± 1.074
0.203AsnMet: 0.203 ± 0.239
3.647AsnAsn: 3.647 ± 1.177
0.608AsnPro: 0.608 ± 0.328
1.621AsnGln: 1.621 ± 0.395
3.445AsnArg: 3.445 ± 0.515
5.674AsnSer: 5.674 ± 0.984
2.837AsnThr: 2.837 ± 0.687
3.242AsnVal: 3.242 ± 0.672
0.608AsnTrp: 0.608 ± 0.441
1.824AsnTyr: 1.824 ± 0.893
0.0AsnXaa: 0.0 ± 0.0
Pro
2.432ProAla: 2.432 ± 0.622
0.608ProCys: 0.608 ± 0.248
1.418ProAsp: 1.418 ± 0.577
1.621ProGlu: 1.621 ± 0.377
2.026ProPhe: 2.026 ± 0.602
1.216ProGly: 1.216 ± 0.356
0.405ProHis: 0.405 ± 0.251
2.026ProIle: 2.026 ± 0.616
2.229ProLys: 2.229 ± 0.569
3.445ProLeu: 3.445 ± 0.685
1.013ProMet: 1.013 ± 0.628
1.621ProAsn: 1.621 ± 1.269
1.824ProPro: 1.824 ± 0.643
1.013ProGln: 1.013 ± 0.681
2.026ProArg: 2.026 ± 0.842
3.04ProSer: 3.04 ± 0.58
1.621ProThr: 1.621 ± 0.603
2.229ProVal: 2.229 ± 0.678
0.203ProTrp: 0.203 ± 0.126
1.824ProTyr: 1.824 ± 0.418
0.0ProXaa: 0.0 ± 0.0
Gln
2.026GlnAla: 2.026 ± 0.736
0.405GlnCys: 0.405 ± 0.328
1.013GlnAsp: 1.013 ± 0.566
1.621GlnGlu: 1.621 ± 0.447
1.418GlnPhe: 1.418 ± 0.691
1.013GlnGly: 1.013 ± 0.373
0.405GlnHis: 0.405 ± 0.366
0.811GlnIle: 0.811 ± 0.322
1.418GlnLys: 1.418 ± 0.297
1.418GlnLeu: 1.418 ± 0.644
0.0GlnMet: 0.0 ± 0.0
0.608GlnAsn: 0.608 ± 0.273
0.811GlnPro: 0.811 ± 0.503
1.013GlnGln: 1.013 ± 0.411
1.621GlnArg: 1.621 ± 0.423
2.634GlnSer: 2.634 ± 0.415
1.013GlnThr: 1.013 ± 0.471
1.824GlnVal: 1.824 ± 0.635
0.0GlnTrp: 0.0 ± 0.0
1.013GlnTyr: 1.013 ± 0.704
0.0GlnXaa: 0.0 ± 0.0
Arg
3.04ArgAla: 3.04 ± 1.27
2.432ArgCys: 2.432 ± 0.784
4.255ArgAsp: 4.255 ± 0.824
5.268ArgGlu: 5.268 ± 0.753
3.04ArgPhe: 3.04 ± 0.707
3.04ArgGly: 3.04 ± 0.733
0.811ArgHis: 0.811 ± 0.307
3.04ArgIle: 3.04 ± 0.92
3.85ArgLys: 3.85 ± 0.822
3.85ArgLeu: 3.85 ± 1.236
0.811ArgMet: 0.811 ± 0.321
2.026ArgAsn: 2.026 ± 0.61
1.621ArgPro: 1.621 ± 0.844
1.216ArgGln: 1.216 ± 0.532
4.863ArgArg: 4.863 ± 1.395
5.471ArgSer: 5.471 ± 1.289
3.04ArgThr: 3.04 ± 0.784
5.674ArgVal: 5.674 ± 1.528
0.405ArgTrp: 0.405 ± 0.341
1.418ArgTyr: 1.418 ± 0.575
0.0ArgXaa: 0.0 ± 0.0
Ser
6.282SerAla: 6.282 ± 1.125
2.026SerCys: 2.026 ± 0.624
5.066SerAsp: 5.066 ± 1.078
5.674SerGlu: 5.674 ± 0.985
6.282SerPhe: 6.282 ± 0.727
5.268SerGly: 5.268 ± 0.646
2.634SerHis: 2.634 ± 0.607
5.674SerIle: 5.674 ± 1.109
6.079SerLys: 6.079 ± 0.501
7.903SerLeu: 7.903 ± 1.374
2.634SerMet: 2.634 ± 0.498
5.471SerAsn: 5.471 ± 2.376
3.85SerPro: 3.85 ± 1.192
3.04SerGln: 3.04 ± 0.722
5.268SerArg: 5.268 ± 1.72
9.726SerSer: 9.726 ± 2.128
5.471SerThr: 5.471 ± 1.522
6.89SerVal: 6.89 ± 0.927
0.203SerTrp: 0.203 ± 0.239
4.661SerTyr: 4.661 ± 1.336
0.0SerXaa: 0.0 ± 0.0
Thr
2.432ThrAla: 2.432 ± 0.436
1.216ThrCys: 1.216 ± 0.402
2.634ThrAsp: 2.634 ± 0.996
2.026ThrGlu: 2.026 ± 0.597
3.445ThrPhe: 3.445 ± 0.978
1.824ThrGly: 1.824 ± 0.54
1.216ThrHis: 1.216 ± 0.398
1.418ThrIle: 1.418 ± 0.362
4.053ThrLys: 4.053 ± 0.812
7.092ThrLeu: 7.092 ± 0.784
1.013ThrMet: 1.013 ± 0.656
1.824ThrAsn: 1.824 ± 0.967
3.04ThrPro: 3.04 ± 0.889
1.216ThrGln: 1.216 ± 0.433
3.647ThrArg: 3.647 ± 0.569
6.282ThrSer: 6.282 ± 1.534
3.242ThrThr: 3.242 ± 1.222
4.255ThrVal: 4.255 ± 0.625
0.405ThrTrp: 0.405 ± 0.209
1.824ThrTyr: 1.824 ± 0.534
0.0ThrXaa: 0.0 ± 0.0
Val
5.066ValAla: 5.066 ± 1.157
2.026ValCys: 2.026 ± 0.631
6.079ValAsp: 6.079 ± 0.837
5.674ValGlu: 5.674 ± 1.601
3.04ValPhe: 3.04 ± 0.763
3.647ValGly: 3.647 ± 1.011
1.824ValHis: 1.824 ± 0.459
4.053ValIle: 4.053 ± 0.834
5.674ValLys: 5.674 ± 0.738
5.471ValLeu: 5.471 ± 0.986
1.013ValMet: 1.013 ± 0.51
4.863ValAsn: 4.863 ± 0.456
3.242ValPro: 3.242 ± 0.624
0.811ValGln: 0.811 ± 0.503
4.863ValArg: 4.863 ± 0.841
5.268ValSer: 5.268 ± 1.251
3.85ValThr: 3.85 ± 0.814
8.105ValVal: 8.105 ± 1.169
0.405ValTrp: 0.405 ± 0.404
2.837ValTyr: 2.837 ± 0.763
0.0ValXaa: 0.0 ± 0.0
Trp
0.203TrpAla: 0.203 ± 0.239
0.0TrpCys: 0.0 ± 0.0
0.203TrpAsp: 0.203 ± 0.126
0.0TrpGlu: 0.0 ± 0.0
0.203TrpPhe: 0.203 ± 0.126
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.608TrpIle: 0.608 ± 0.273
0.608TrpLys: 0.608 ± 0.563
0.608TrpLeu: 0.608 ± 0.266
0.811TrpMet: 0.811 ± 0.768
0.405TrpAsn: 0.405 ± 0.366
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.203TrpArg: 0.203 ± 0.126
0.608TrpSer: 0.608 ± 0.266
0.203TrpThr: 0.203 ± 0.126
0.608TrpVal: 0.608 ± 0.597
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.026TyrAla: 2.026 ± 0.989
0.608TyrCys: 0.608 ± 0.266
2.837TyrAsp: 2.837 ± 0.813
1.824TyrGlu: 1.824 ± 0.929
2.026TyrPhe: 2.026 ± 0.636
2.026TyrGly: 2.026 ± 0.51
1.013TyrHis: 1.013 ± 0.411
1.216TyrIle: 1.216 ± 0.604
2.634TyrLys: 2.634 ± 0.69
5.066TyrLeu: 5.066 ± 1.844
0.608TyrMet: 0.608 ± 0.377
2.229TyrAsn: 2.229 ± 0.582
0.811TyrPro: 0.811 ± 0.307
0.405TyrGln: 0.405 ± 0.219
1.013TyrArg: 1.013 ± 0.459
4.661TyrSer: 4.661 ± 0.826
2.634TyrThr: 2.634 ± 0.787
2.837TyrVal: 2.837 ± 0.716
0.0TyrTrp: 0.0 ± 0.0
2.432TyrTyr: 2.432 ± 0.752
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4936 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski