Amino acid dipepetide frequency for Tobacco streak virus (strain WC) (TSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.098AlaAla: 6.098 ± 2.057
1.524AlaCys: 1.524 ± 0.747
3.811AlaAsp: 3.811 ± 0.954
4.192AlaGlu: 4.192 ± 0.925
3.049AlaPhe: 3.049 ± 1.47
3.049AlaGly: 3.049 ± 1.219
1.143AlaHis: 1.143 ± 0.616
4.573AlaIle: 4.573 ± 1.189
2.668AlaLys: 2.668 ± 1.668
6.479AlaLeu: 6.479 ± 1.283
2.668AlaMet: 2.668 ± 1.008
2.287AlaAsn: 2.287 ± 0.789
3.43AlaPro: 3.43 ± 0.784
1.143AlaGln: 1.143 ± 0.514
3.43AlaArg: 3.43 ± 1.175
4.192AlaSer: 4.192 ± 0.876
3.049AlaThr: 3.049 ± 0.469
4.192AlaVal: 4.192 ± 1.156
0.762AlaTrp: 0.762 ± 0.514
0.762AlaTyr: 0.762 ± 0.486
0.0AlaXaa: 0.0 ± 0.0
Cys
1.905CysAla: 1.905 ± 0.716
0.762CysCys: 0.762 ± 0.514
2.668CysAsp: 2.668 ± 1.204
1.524CysGlu: 1.524 ± 0.37
0.381CysPhe: 0.381 ± 0.257
1.524CysGly: 1.524 ± 0.747
1.524CysHis: 1.524 ± 1.027
1.905CysIle: 1.905 ± 0.506
0.381CysLys: 0.381 ± 0.257
1.905CysLeu: 1.905 ± 0.506
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.905CysPro: 1.905 ± 1.029
0.0CysGln: 0.0 ± 0.0
1.143CysArg: 1.143 ± 0.461
1.905CysSer: 1.905 ± 0.395
0.762CysThr: 0.762 ± 0.221
2.287CysVal: 2.287 ± 0.481
0.0CysTrp: 0.0 ± 0.0
0.381CysTyr: 0.381 ± 0.301
0.0CysXaa: 0.0 ± 0.0
Asp
4.573AspAla: 4.573 ± 0.491
1.905AspCys: 1.905 ± 0.635
7.241AspAsp: 7.241 ± 2.453
4.954AspGlu: 4.954 ± 0.717
4.192AspPhe: 4.192 ± 0.63
4.573AspGly: 4.573 ± 0.996
1.905AspHis: 1.905 ± 0.422
3.811AspIle: 3.811 ± 1.214
3.43AspLys: 3.43 ± 1.214
5.716AspLeu: 5.716 ± 1.395
0.762AspMet: 0.762 ± 0.493
1.143AspAsn: 1.143 ± 0.372
1.905AspPro: 1.905 ± 0.359
1.524AspGln: 1.524 ± 0.37
3.049AspArg: 3.049 ± 0.668
4.573AspSer: 4.573 ± 1.802
2.668AspThr: 2.668 ± 0.446
8.003AspVal: 8.003 ± 0.97
1.143AspTrp: 1.143 ± 0.461
2.287AspTyr: 2.287 ± 0.922
0.0AspXaa: 0.0 ± 0.0
Glu
4.954GluAla: 4.954 ± 1.377
1.905GluCys: 1.905 ± 0.556
3.049GluAsp: 3.049 ± 0.668
3.43GluGlu: 3.43 ± 0.97
1.143GluPhe: 1.143 ± 0.372
0.762GluGly: 0.762 ± 0.554
1.524GluHis: 1.524 ± 0.747
4.192GluIle: 4.192 ± 1.129
5.716GluLys: 5.716 ± 1.178
6.479GluLeu: 6.479 ± 0.952
1.143GluMet: 1.143 ± 0.547
1.905GluAsn: 1.905 ± 0.544
1.905GluPro: 1.905 ± 0.521
1.905GluGln: 1.905 ± 0.689
3.811GluArg: 3.811 ± 0.831
4.192GluSer: 4.192 ± 1.728
6.098GluThr: 6.098 ± 0.655
4.573GluVal: 4.573 ± 1.516
1.143GluTrp: 1.143 ± 0.394
1.143GluTyr: 1.143 ± 0.461
0.0GluXaa: 0.0 ± 0.0
Phe
2.287PheAla: 2.287 ± 0.35
0.762PheCys: 0.762 ± 0.514
3.811PheAsp: 3.811 ± 0.952
1.905PheGlu: 1.905 ± 0.537
0.762PhePhe: 0.762 ± 0.554
2.287PheGly: 2.287 ± 0.549
0.381PheHis: 0.381 ± 0.257
4.192PheIle: 4.192 ± 1.63
3.049PheLys: 3.049 ± 0.739
4.192PheLeu: 4.192 ± 0.843
0.0PheMet: 0.0 ± 0.0
3.049PheAsn: 3.049 ± 1.194
2.668PhePro: 2.668 ± 0.601
1.143PheGln: 1.143 ± 0.461
3.43PheArg: 3.43 ± 1.421
3.049PheSer: 3.049 ± 0.551
1.524PheThr: 1.524 ± 0.813
3.811PheVal: 3.811 ± 1.218
0.0PheTrp: 0.0 ± 0.0
0.762PheTyr: 0.762 ± 0.221
0.0PheXaa: 0.0 ± 0.0
Gly
1.524GlyAla: 1.524 ± 0.51
1.143GlyCys: 1.143 ± 0.771
3.811GlyAsp: 3.811 ± 0.641
2.287GlyGlu: 2.287 ± 0.646
2.668GlyPhe: 2.668 ± 0.928
1.905GlyGly: 1.905 ± 0.831
0.762GlyHis: 0.762 ± 0.514
3.049GlyIle: 3.049 ± 0.668
4.192GlyLys: 4.192 ± 0.219
4.192GlyLeu: 4.192 ± 1.303
1.143GlyMet: 1.143 ± 0.616
1.143GlyAsn: 1.143 ± 0.372
2.287GlyPro: 2.287 ± 0.365
0.381GlyGln: 0.381 ± 0.257
3.049GlyArg: 3.049 ± 0.727
3.049GlySer: 3.049 ± 0.438
2.668GlyThr: 2.668 ± 1.47
4.192GlyVal: 4.192 ± 0.219
0.762GlyTrp: 0.762 ± 0.367
1.143GlyTyr: 1.143 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
2.668HisAla: 2.668 ± 1.069
0.381HisCys: 0.381 ± 0.257
1.905HisAsp: 1.905 ± 0.657
0.762HisGlu: 0.762 ± 0.221
1.905HisPhe: 1.905 ± 0.556
0.0HisGly: 0.0 ± 0.0
1.524HisHis: 1.524 ± 0.747
1.143HisIle: 1.143 ± 0.461
1.905HisLys: 1.905 ± 0.679
1.905HisLeu: 1.905 ± 0.556
1.905HisMet: 1.905 ± 0.413
1.524HisAsn: 1.524 ± 0.51
0.381HisPro: 0.381 ± 0.529
0.381HisGln: 0.381 ± 0.301
1.524HisArg: 1.524 ± 0.601
3.43HisSer: 3.43 ± 1.444
1.905HisThr: 1.905 ± 0.395
1.524HisVal: 1.524 ± 0.601
0.0HisTrp: 0.0 ± 0.0
0.762HisTyr: 0.762 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
4.192IleAla: 4.192 ± 1.225
0.762IleCys: 0.762 ± 0.465
5.335IleAsp: 5.335 ± 0.923
4.954IleGlu: 4.954 ± 1.036
1.905IlePhe: 1.905 ± 0.776
2.668IleGly: 2.668 ± 1.466
2.287IleHis: 2.287 ± 0.746
0.762IleIle: 0.762 ± 0.221
5.716IleLys: 5.716 ± 1.47
4.192IleLeu: 4.192 ± 0.933
0.381IleMet: 0.381 ± 0.259
1.905IleAsn: 1.905 ± 0.635
7.622IlePro: 7.622 ± 2.396
2.287IleGln: 2.287 ± 0.549
1.905IleArg: 1.905 ± 0.657
5.716IleSer: 5.716 ± 0.771
2.287IleThr: 2.287 ± 0.745
2.287IleVal: 2.287 ± 0.703
0.381IleTrp: 0.381 ± 0.403
1.143IleTyr: 1.143 ± 0.771
0.0IleXaa: 0.0 ± 0.0
Lys
4.954LysAla: 4.954 ± 1.403
1.143LysCys: 1.143 ± 0.461
2.668LysAsp: 2.668 ± 1.004
5.716LysGlu: 5.716 ± 1.826
4.192LysPhe: 4.192 ± 1.355
3.811LysGly: 3.811 ± 0.941
0.762LysHis: 0.762 ± 0.602
2.287LysIle: 2.287 ± 0.897
3.049LysLys: 3.049 ± 0.478
6.098LysLeu: 6.098 ± 0.85
0.762LysMet: 0.762 ± 0.406
3.43LysAsn: 3.43 ± 0.262
3.811LysPro: 3.811 ± 1.01
1.905LysGln: 1.905 ± 0.689
1.524LysArg: 1.524 ± 0.586
3.811LysSer: 3.811 ± 0.954
6.479LysThr: 6.479 ± 1.436
3.811LysVal: 3.811 ± 1.275
0.762LysTrp: 0.762 ± 0.221
1.524LysTyr: 1.524 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
6.098LeuAla: 6.098 ± 0.424
2.287LeuCys: 2.287 ± 0.597
4.954LeuAsp: 4.954 ± 0.939
4.192LeuGlu: 4.192 ± 1.241
3.049LeuPhe: 3.049 ± 0.833
5.335LeuGly: 5.335 ± 0.571
1.905LeuHis: 1.905 ± 1.041
4.954LeuIle: 4.954 ± 1.222
6.098LeuLys: 6.098 ± 0.748
8.765LeuLeu: 8.765 ± 0.977
3.049LeuMet: 3.049 ± 1.362
5.716LeuAsn: 5.716 ± 1.007
5.335LeuPro: 5.335 ± 1.051
2.287LeuGln: 2.287 ± 0.662
4.573LeuArg: 4.573 ± 0.468
8.765LeuSer: 8.765 ± 1.392
4.573LeuThr: 4.573 ± 1.73
6.479LeuVal: 6.479 ± 1.419
0.381LeuTrp: 0.381 ± 0.301
2.287LeuTyr: 2.287 ± 1.054
0.0LeuXaa: 0.0 ± 0.0
Met
1.524MetAla: 1.524 ± 0.636
0.0MetCys: 0.0 ± 0.0
1.524MetAsp: 1.524 ± 0.601
2.287MetGlu: 2.287 ± 0.758
1.143MetPhe: 1.143 ± 0.69
0.381MetGly: 0.381 ± 0.403
0.381MetHis: 0.381 ± 0.301
1.905MetIle: 1.905 ± 0.544
1.905MetLys: 1.905 ± 0.544
1.524MetLeu: 1.524 ± 0.601
1.905MetMet: 1.905 ± 0.544
1.143MetAsn: 1.143 ± 0.717
1.143MetPro: 1.143 ± 0.655
0.381MetGln: 0.381 ± 0.369
1.524MetArg: 1.524 ± 0.752
1.524MetSer: 1.524 ± 0.506
3.049MetThr: 3.049 ± 0.881
1.143MetVal: 1.143 ± 0.689
0.381MetTrp: 0.381 ± 0.403
0.381MetTyr: 0.381 ± 0.403
0.0MetXaa: 0.0 ± 0.0
Asn
3.049AsnAla: 3.049 ± 1.809
0.762AsnCys: 0.762 ± 0.735
2.287AsnAsp: 2.287 ± 1.0
1.524AsnGlu: 1.524 ± 1.027
1.905AsnPhe: 1.905 ± 0.43
0.762AsnGly: 0.762 ± 0.514
1.143AsnHis: 1.143 ± 0.547
1.524AsnIle: 1.524 ± 0.521
1.524AsnLys: 1.524 ± 0.37
5.335AsnLeu: 5.335 ± 0.864
1.143AsnMet: 1.143 ± 0.671
1.905AsnAsn: 1.905 ± 1.458
3.43AsnPro: 3.43 ± 1.74
2.287AsnGln: 2.287 ± 1.421
2.668AsnArg: 2.668 ± 0.446
3.049AsnSer: 3.049 ± 0.891
3.811AsnThr: 3.811 ± 2.449
3.811AsnVal: 3.811 ± 2.113
0.0AsnTrp: 0.0 ± 0.0
1.143AsnTyr: 1.143 ± 0.372
0.0AsnXaa: 0.0 ± 0.0
Pro
3.43ProAla: 3.43 ± 1.038
0.381ProCys: 0.381 ± 0.257
2.668ProAsp: 2.668 ± 0.913
4.954ProGlu: 4.954 ± 0.99
2.668ProPhe: 2.668 ± 0.642
2.668ProGly: 2.668 ± 1.212
1.905ProHis: 1.905 ± 0.657
4.954ProIle: 4.954 ± 0.72
4.192ProLys: 4.192 ± 1.558
3.811ProLeu: 3.811 ± 1.232
0.762ProMet: 0.762 ± 0.514
1.905ProAsn: 1.905 ± 1.214
1.905ProPro: 1.905 ± 1.454
0.762ProGln: 0.762 ± 0.735
1.524ProArg: 1.524 ± 0.487
4.573ProSer: 4.573 ± 2.118
3.43ProThr: 3.43 ± 0.576
3.811ProVal: 3.811 ± 1.456
0.0ProTrp: 0.0 ± 0.0
1.524ProTyr: 1.524 ± 0.441
0.0ProXaa: 0.0 ± 0.0
Gln
0.762GlnAla: 0.762 ± 0.486
0.0GlnCys: 0.0 ± 0.0
1.143GlnAsp: 1.143 ± 0.394
0.381GlnGlu: 0.381 ± 0.301
1.524GlnPhe: 1.524 ± 0.367
2.287GlnGly: 2.287 ± 0.477
0.381GlnHis: 0.381 ± 0.403
1.524GlnIle: 1.524 ± 0.367
1.143GlnLys: 1.143 ± 0.372
1.524GlnLeu: 1.524 ± 1.315
0.762GlnMet: 0.762 ± 0.221
0.381GlnAsn: 0.381 ± 0.403
1.143GlnPro: 1.143 ± 0.335
0.381GlnGln: 0.381 ± 0.257
2.287GlnArg: 2.287 ± 0.945
1.905GlnSer: 1.905 ± 1.041
1.143GlnThr: 1.143 ± 0.771
2.668GlnVal: 2.668 ± 0.64
0.0GlnTrp: 0.0 ± 0.0
1.905GlnTyr: 1.905 ± 0.556
0.0GlnXaa: 0.0 ± 0.0
Arg
4.192ArgAla: 4.192 ± 1.244
2.668ArgCys: 2.668 ± 0.519
4.192ArgAsp: 4.192 ± 1.052
2.668ArgGlu: 2.668 ± 0.273
2.287ArgPhe: 2.287 ± 0.922
1.524ArgGly: 1.524 ± 0.37
1.524ArgHis: 1.524 ± 1.027
3.43ArgIle: 3.43 ± 0.793
3.049ArgLys: 3.049 ± 0.668
5.716ArgLeu: 5.716 ± 1.243
1.524ArgMet: 1.524 ± 0.367
3.43ArgAsn: 3.43 ± 1.332
1.143ArgPro: 1.143 ± 0.75
1.143ArgGln: 1.143 ± 0.571
5.335ArgArg: 5.335 ± 2.226
4.573ArgSer: 4.573 ± 1.793
2.668ArgThr: 2.668 ± 0.865
5.335ArgVal: 5.335 ± 0.946
0.0ArgTrp: 0.0 ± 0.0
1.524ArgTyr: 1.524 ± 1.027
0.0ArgXaa: 0.0 ± 0.0
Ser
3.049SerAla: 3.049 ± 0.607
4.192SerCys: 4.192 ± 0.78
6.098SerAsp: 6.098 ± 1.233
4.192SerGlu: 4.192 ± 1.206
2.668SerPhe: 2.668 ± 0.324
5.716SerGly: 5.716 ± 0.866
2.287SerHis: 2.287 ± 0.662
3.43SerIle: 3.43 ± 1.061
4.573SerLys: 4.573 ± 1.368
10.29SerLeu: 10.29 ± 1.569
1.143SerMet: 1.143 ± 0.633
5.716SerAsn: 5.716 ± 0.891
1.143SerPro: 1.143 ± 0.372
1.905SerGln: 1.905 ± 0.556
6.479SerArg: 6.479 ± 2.337
8.003SerSer: 8.003 ± 0.831
1.524SerThr: 1.524 ± 0.521
6.098SerVal: 6.098 ± 0.635
0.762SerTrp: 0.762 ± 0.221
1.905SerTyr: 1.905 ± 0.556
0.0SerXaa: 0.0 ± 0.0
Thr
1.905ThrAla: 1.905 ± 0.395
0.381ThrCys: 0.381 ± 0.529
2.287ThrAsp: 2.287 ± 1.541
3.811ThrGlu: 3.811 ± 1.074
2.668ThrPhe: 2.668 ± 0.68
2.668ThrGly: 2.668 ± 0.961
3.811ThrHis: 3.811 ± 1.315
4.192ThrIle: 4.192 ± 0.4
4.192ThrLys: 4.192 ± 1.011
5.716ThrLeu: 5.716 ± 1.351
2.287ThrMet: 2.287 ± 1.054
1.143ThrAsn: 1.143 ± 0.571
1.905ThrPro: 1.905 ± 0.395
0.762ThrGln: 0.762 ± 0.465
3.811ThrArg: 3.811 ± 0.408
4.573ThrSer: 4.573 ± 1.586
3.811ThrThr: 3.811 ± 0.545
5.335ThrVal: 5.335 ± 1.62
1.143ThrTrp: 1.143 ± 0.333
1.524ThrTyr: 1.524 ± 0.506
0.0ThrXaa: 0.0 ± 0.0
Val
4.192ValAla: 4.192 ± 0.629
1.524ValCys: 1.524 ± 0.441
8.003ValAsp: 8.003 ± 0.749
6.098ValGlu: 6.098 ± 1.492
2.287ValPhe: 2.287 ± 0.455
2.287ValGly: 2.287 ± 0.805
0.762ValHis: 0.762 ± 0.406
4.573ValIle: 4.573 ± 1.473
4.573ValLys: 4.573 ± 1.414
4.192ValLeu: 4.192 ± 1.25
2.287ValMet: 2.287 ± 0.758
4.573ValAsn: 4.573 ± 0.857
7.241ValPro: 7.241 ± 0.782
1.524ValGln: 1.524 ± 0.506
4.954ValArg: 4.954 ± 1.651
7.241ValSer: 7.241 ± 1.733
4.573ValThr: 4.573 ± 1.366
5.716ValVal: 5.716 ± 1.038
1.524ValTrp: 1.524 ± 0.906
2.287ValTyr: 2.287 ± 0.662
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.381TrpCys: 0.381 ± 0.529
0.381TrpAsp: 0.381 ± 0.257
0.381TrpGlu: 0.381 ± 0.257
0.762TrpPhe: 0.762 ± 0.221
0.381TrpGly: 0.381 ± 0.301
0.0TrpHis: 0.0 ± 0.0
0.381TrpIle: 0.381 ± 0.529
0.381TrpLys: 0.381 ± 0.369
0.381TrpLeu: 0.381 ± 0.257
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.381TrpPro: 0.381 ± 0.403
0.0TrpGln: 0.0 ± 0.0
0.381TrpArg: 0.381 ± 0.257
1.524TrpSer: 1.524 ± 0.636
0.762TrpThr: 0.762 ± 0.602
1.905TrpVal: 1.905 ± 1.214
0.0TrpTrp: 0.0 ± 0.0
1.143TrpTyr: 1.143 ± 0.547
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.143TyrAla: 1.143 ± 0.771
0.0TyrCys: 0.0 ± 0.0
1.524TyrAsp: 1.524 ± 0.441
0.762TyrGlu: 0.762 ± 0.514
1.905TyrPhe: 1.905 ± 0.556
0.762TyrGly: 0.762 ± 0.465
1.524TyrHis: 1.524 ± 0.747
2.287TyrIle: 2.287 ± 0.455
0.762TyrLys: 0.762 ± 0.221
2.668TyrLeu: 2.668 ± 0.547
1.143TyrMet: 1.143 ± 0.372
0.762TyrAsn: 0.762 ± 0.514
1.143TyrPro: 1.143 ± 0.333
1.143TyrGln: 1.143 ± 0.671
1.524TyrArg: 1.524 ± 0.906
1.524TyrSer: 1.524 ± 0.441
1.143TyrThr: 1.143 ± 0.372
3.43TyrVal: 3.43 ± 1.117
0.381TyrTrp: 0.381 ± 0.529
0.381TyrTyr: 0.381 ± 0.301
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2625 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski