Amino acid dipepetide frequency for Polygonum ringspot tospovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.383AlaAla: 2.383 ± 2.297
1.589AlaCys: 1.589 ± 0.416
1.986AlaAsp: 1.986 ± 0.647
3.376AlaGlu: 3.376 ± 0.234
1.787AlaPhe: 1.787 ± 0.409
3.376AlaGly: 3.376 ± 2.352
1.192AlaHis: 1.192 ± 0.742
4.171AlaIle: 4.171 ± 0.877
2.582AlaLys: 2.582 ± 0.671
4.369AlaLeu: 4.369 ± 0.899
0.794AlaMet: 0.794 ± 0.25
2.582AlaAsn: 2.582 ± 0.47
1.589AlaPro: 1.589 ± 0.281
0.596AlaGln: 0.596 ± 0.327
0.794AlaArg: 0.794 ± 1.238
4.171AlaSer: 4.171 ± 2.006
2.582AlaThr: 2.582 ± 0.802
4.171AlaVal: 4.171 ± 1.619
0.199AlaTrp: 0.199 ± 0.109
0.993AlaTyr: 0.993 ± 0.742
0.0AlaXaa: 0.0 ± 0.0
Cys
0.993CysAla: 0.993 ± 0.332
0.397CysCys: 0.397 ± 0.218
1.192CysAsp: 1.192 ± 0.559
1.589CysGlu: 1.589 ± 0.626
1.589CysPhe: 1.589 ± 0.488
0.794CysGly: 0.794 ± 0.788
0.199CysHis: 0.199 ± 0.109
2.781CysIle: 2.781 ± 0.692
1.589CysLys: 1.589 ± 0.387
2.185CysLeu: 2.185 ± 0.703
0.397CysMet: 0.397 ± 0.289
0.794CysAsn: 0.794 ± 0.398
0.596CysPro: 0.596 ± 0.327
0.596CysGln: 0.596 ± 0.327
0.993CysArg: 0.993 ± 0.327
1.986CysSer: 1.986 ± 0.99
1.192CysThr: 1.192 ± 0.523
1.589CysVal: 1.589 ± 1.318
0.397CysTrp: 0.397 ± 0.282
0.794CysTyr: 0.794 ± 0.398
0.0CysXaa: 0.0 ± 0.0
Asp
1.39AspAla: 1.39 ± 1.157
2.185AspCys: 2.185 ± 0.73
3.774AspAsp: 3.774 ± 1.419
2.979AspGlu: 2.979 ± 0.748
3.575AspPhe: 3.575 ± 0.921
3.178AspGly: 3.178 ± 0.652
0.794AspHis: 0.794 ± 0.398
4.965AspIle: 4.965 ± 0.975
4.369AspLys: 4.369 ± 0.587
6.554AspLeu: 6.554 ± 1.559
3.575AspMet: 3.575 ± 1.056
2.582AspAsn: 2.582 ± 0.751
2.185AspPro: 2.185 ± 0.673
1.787AspGln: 1.787 ± 0.552
2.582AspArg: 2.582 ± 0.501
4.568AspSer: 4.568 ± 0.824
3.575AspThr: 3.575 ± 1.012
3.972AspVal: 3.972 ± 0.29
0.794AspTrp: 0.794 ± 0.538
2.781AspTyr: 2.781 ± 0.558
0.0AspXaa: 0.0 ± 0.0
Glu
2.582GluAla: 2.582 ± 1.586
1.589GluCys: 1.589 ± 0.528
4.171GluAsp: 4.171 ± 0.607
4.369GluGlu: 4.369 ± 0.788
4.369GluPhe: 4.369 ± 1.095
2.383GluGly: 2.383 ± 0.356
1.192GluHis: 1.192 ± 0.491
4.965GluIle: 4.965 ± 1.058
5.561GluLys: 5.561 ± 1.126
5.362GluLeu: 5.362 ± 0.68
2.979GluMet: 2.979 ± 0.774
6.157GluAsn: 6.157 ± 0.82
1.192GluPro: 1.192 ± 0.378
1.39GluGln: 1.39 ± 1.117
2.185GluArg: 2.185 ± 0.766
2.781GluSer: 2.781 ± 0.892
3.575GluThr: 3.575 ± 0.426
3.376GluVal: 3.376 ± 0.936
0.397GluTrp: 0.397 ± 0.159
2.781GluTyr: 2.781 ± 0.845
0.0GluXaa: 0.0 ± 0.0
Phe
2.383PheAla: 2.383 ± 0.752
1.192PheCys: 1.192 ± 0.354
4.369PheAsp: 4.369 ± 1.024
2.185PheGlu: 2.185 ± 0.553
2.979PhePhe: 2.979 ± 1.196
1.986PheGly: 1.986 ± 0.505
0.794PheHis: 0.794 ± 0.357
2.781PheIle: 2.781 ± 0.728
4.171PheLys: 4.171 ± 0.748
5.164PheLeu: 5.164 ± 1.223
2.582PheMet: 2.582 ± 0.684
3.178PheAsn: 3.178 ± 1.421
1.986PhePro: 1.986 ± 0.363
1.589PheGln: 1.589 ± 0.996
1.787PheArg: 1.787 ± 0.335
5.958PheSer: 5.958 ± 0.959
2.979PheThr: 2.979 ± 1.047
2.781PheVal: 2.781 ± 0.626
0.199PheTrp: 0.199 ± 0.197
1.192PheTyr: 1.192 ± 0.654
0.0PheXaa: 0.0 ± 0.0
Gly
1.787GlyAla: 1.787 ± 0.841
1.39GlyCys: 1.39 ± 0.874
2.979GlyAsp: 2.979 ± 0.588
2.781GlyGlu: 2.781 ± 0.834
3.376GlyPhe: 3.376 ± 1.607
1.589GlyGly: 1.589 ± 0.785
0.993GlyHis: 0.993 ± 0.595
2.979GlyIle: 2.979 ± 0.638
3.972GlyLys: 3.972 ± 0.926
3.774GlyLeu: 3.774 ± 1.04
0.993GlyMet: 0.993 ± 0.327
3.178GlyAsn: 3.178 ± 0.57
0.993GlyPro: 0.993 ± 0.391
1.192GlyGln: 1.192 ± 0.448
0.993GlyArg: 0.993 ± 0.327
3.972GlySer: 3.972 ± 0.349
1.986GlyThr: 1.986 ± 0.923
1.787GlyVal: 1.787 ± 0.429
0.199GlyTrp: 0.199 ± 0.109
1.192GlyTyr: 1.192 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
0.596HisAla: 0.596 ± 0.483
0.199HisCys: 0.199 ± 0.197
1.39HisAsp: 1.39 ± 0.446
0.993HisGlu: 0.993 ± 0.529
1.986HisPhe: 1.986 ± 0.505
0.794HisGly: 0.794 ± 0.319
0.397HisHis: 0.397 ± 0.36
0.993HisIle: 0.993 ± 0.43
1.192HisLys: 1.192 ± 0.654
0.794HisLeu: 0.794 ± 0.578
0.397HisMet: 0.397 ± 0.218
1.589HisAsn: 1.589 ± 0.485
0.794HisPro: 0.794 ± 0.319
0.199HisGln: 0.199 ± 0.109
0.596HisArg: 0.596 ± 0.281
1.589HisSer: 1.589 ± 0.433
1.192HisThr: 1.192 ± 0.281
0.993HisVal: 0.993 ± 0.332
0.199HisTrp: 0.199 ± 0.109
0.397HisTyr: 0.397 ± 0.394
0.0HisXaa: 0.0 ± 0.0
Ile
4.369IleAla: 4.369 ± 1.374
1.787IleCys: 1.787 ± 0.387
4.369IleAsp: 4.369 ± 1.029
4.965IleGlu: 4.965 ± 0.634
2.979IlePhe: 2.979 ± 0.638
2.582IleGly: 2.582 ± 0.452
0.993IleHis: 0.993 ± 0.357
4.171IleIle: 4.171 ± 0.652
9.533IleLys: 9.533 ± 0.556
4.568IleLeu: 4.568 ± 0.603
0.993IleMet: 0.993 ± 0.385
5.561IleAsn: 5.561 ± 0.958
2.383IlePro: 2.383 ± 1.034
1.986IleGln: 1.986 ± 0.879
1.589IleArg: 1.589 ± 0.283
8.143IleSer: 8.143 ± 1.312
3.774IleThr: 3.774 ± 1.036
2.979IleVal: 2.979 ± 1.147
0.199IleTrp: 0.199 ± 0.109
3.774IleTyr: 3.774 ± 0.432
0.0IleXaa: 0.0 ± 0.0
Lys
4.965LysAla: 4.965 ± 1.647
0.794LysCys: 0.794 ± 0.264
6.356LysAsp: 6.356 ± 1.421
7.349LysGlu: 7.349 ± 1.012
4.568LysPhe: 4.568 ± 1.676
3.178LysGly: 3.178 ± 0.925
1.39LysHis: 1.39 ± 0.561
6.554LysIle: 6.554 ± 1.2
5.76LysLys: 5.76 ± 0.661
7.547LysLeu: 7.547 ± 1.324
3.376LysMet: 3.376 ± 1.635
4.965LysAsn: 4.965 ± 0.842
2.383LysPro: 2.383 ± 1.312
2.185LysGln: 2.185 ± 0.582
3.376LysArg: 3.376 ± 0.512
6.157LysSer: 6.157 ± 0.824
8.342LysThr: 8.342 ± 1.992
6.157LysVal: 6.157 ± 1.387
0.397LysTrp: 0.397 ± 0.159
2.383LysTyr: 2.383 ± 0.759
0.0LysXaa: 0.0 ± 0.0
Leu
5.958LeuAla: 5.958 ± 0.797
1.39LeuCys: 1.39 ± 0.907
3.178LeuAsp: 3.178 ± 1.293
7.349LeuGlu: 7.349 ± 0.96
2.979LeuPhe: 2.979 ± 0.944
3.774LeuGly: 3.774 ± 0.635
1.589LeuHis: 1.589 ± 0.666
7.349LeuIle: 7.349 ± 0.636
9.136LeuLys: 9.136 ± 2.569
6.951LeuLeu: 6.951 ± 1.638
4.767LeuMet: 4.767 ± 1.389
5.76LeuAsn: 5.76 ± 1.714
1.986LeuPro: 1.986 ± 1.188
2.383LeuGln: 2.383 ± 0.891
2.582LeuArg: 2.582 ± 0.392
10.526LeuSer: 10.526 ± 0.95
5.76LeuThr: 5.76 ± 0.893
3.376LeuVal: 3.376 ± 0.88
0.596LeuTrp: 0.596 ± 0.341
2.781LeuTyr: 2.781 ± 0.975
0.0LeuXaa: 0.0 ± 0.0
Met
1.192MetAla: 1.192 ± 0.862
0.397MetCys: 0.397 ± 0.159
1.192MetAsp: 1.192 ± 0.27
2.185MetGlu: 2.185 ± 0.655
0.794MetPhe: 0.794 ± 0.474
1.192MetGly: 1.192 ± 0.491
0.596MetHis: 0.596 ± 0.731
2.383MetIle: 2.383 ± 0.699
2.781MetLys: 2.781 ± 1.059
3.774MetLeu: 3.774 ± 1.262
1.787MetMet: 1.787 ± 0.699
2.185MetAsn: 2.185 ± 0.813
0.993MetPro: 0.993 ± 0.332
0.993MetGln: 0.993 ± 0.266
1.787MetArg: 1.787 ± 0.981
4.171MetSer: 4.171 ± 0.753
2.185MetThr: 2.185 ± 0.47
2.582MetVal: 2.582 ± 0.6
0.199MetTrp: 0.199 ± 0.109
0.794MetTyr: 0.794 ± 0.534
0.0MetXaa: 0.0 ± 0.0
Asn
3.178AsnAla: 3.178 ± 1.1
1.39AsnCys: 1.39 ± 0.274
4.767AsnAsp: 4.767 ± 0.581
4.369AsnGlu: 4.369 ± 1.489
4.965AsnPhe: 4.965 ± 0.797
1.589AsnGly: 1.589 ± 0.7
0.993AsnHis: 0.993 ± 0.638
3.972AsnIle: 3.972 ± 0.759
3.774AsnLys: 3.774 ± 0.707
6.356AsnLeu: 6.356 ± 1.633
1.39AsnMet: 1.39 ± 0.322
1.39AsnAsn: 1.39 ± 0.705
1.192AsnPro: 1.192 ± 0.392
2.383AsnGln: 2.383 ± 0.848
2.781AsnArg: 2.781 ± 0.953
4.965AsnSer: 4.965 ± 1.945
2.781AsnThr: 2.781 ± 1.102
3.376AsnVal: 3.376 ± 0.747
0.794AsnTrp: 0.794 ± 0.571
4.171AsnTyr: 4.171 ± 1.187
0.0AsnXaa: 0.0 ± 0.0
Pro
1.39ProAla: 1.39 ± 0.614
0.397ProCys: 0.397 ± 0.282
1.192ProAsp: 1.192 ± 0.516
0.993ProGlu: 0.993 ± 0.357
1.39ProPhe: 1.39 ± 0.446
1.39ProGly: 1.39 ± 1.04
0.397ProHis: 0.397 ± 0.218
2.582ProIle: 2.582 ± 0.452
2.582ProLys: 2.582 ± 0.665
2.582ProLeu: 2.582 ± 1.064
0.794ProMet: 0.794 ± 0.317
1.787ProAsn: 1.787 ± 0.44
0.993ProPro: 0.993 ± 0.56
0.596ProGln: 0.596 ± 0.189
1.192ProArg: 1.192 ± 1.007
3.178ProSer: 3.178 ± 0.795
1.787ProThr: 1.787 ± 0.753
2.383ProVal: 2.383 ± 0.974
0.0ProTrp: 0.0 ± 0.0
0.993ProTyr: 0.993 ± 0.357
0.0ProXaa: 0.0 ± 0.0
Gln
0.993GlnAla: 0.993 ± 0.332
1.39GlnCys: 1.39 ± 0.485
1.192GlnAsp: 1.192 ± 0.384
0.993GlnGlu: 0.993 ± 0.357
0.993GlnPhe: 0.993 ± 0.818
1.39GlnGly: 1.39 ± 1.247
0.199GlnHis: 0.199 ± 0.109
1.986GlnIle: 1.986 ± 0.344
3.376GlnLys: 3.376 ± 0.253
1.986GlnLeu: 1.986 ± 0.348
0.993GlnMet: 0.993 ± 0.513
1.39GlnAsn: 1.39 ± 0.535
0.794GlnPro: 0.794 ± 0.232
0.397GlnGln: 0.397 ± 0.467
0.794GlnArg: 0.794 ± 0.436
3.575GlnSer: 3.575 ± 1.154
0.993GlnThr: 0.993 ± 0.332
1.192GlnVal: 1.192 ± 0.378
0.0GlnTrp: 0.0 ± 0.0
0.993GlnTyr: 0.993 ± 0.364
0.0GlnXaa: 0.0 ± 0.0
Arg
1.39ArgAla: 1.39 ± 0.835
0.596ArgCys: 0.596 ± 0.463
2.781ArgAsp: 2.781 ± 0.666
1.986ArgGlu: 1.986 ± 0.435
0.596ArgPhe: 0.596 ± 0.281
1.589ArgGly: 1.589 ± 0.394
1.192ArgHis: 1.192 ± 0.378
2.582ArgIle: 2.582 ± 0.859
3.178ArgLys: 3.178 ± 0.613
3.972ArgLeu: 3.972 ± 1.214
0.596ArgMet: 0.596 ± 0.327
2.979ArgAsn: 2.979 ± 0.739
0.993ArgPro: 0.993 ± 0.756
0.993ArgGln: 0.993 ± 0.332
1.192ArgArg: 1.192 ± 0.705
1.986ArgSer: 1.986 ± 0.41
3.178ArgThr: 3.178 ± 1.427
1.986ArgVal: 1.986 ± 0.558
0.596ArgTrp: 0.596 ± 0.327
1.589ArgTyr: 1.589 ± 0.516
0.0ArgXaa: 0.0 ± 0.0
Ser
3.774SerAla: 3.774 ± 0.777
1.192SerCys: 1.192 ± 0.304
5.362SerAsp: 5.362 ± 0.683
6.356SerGlu: 6.356 ± 1.809
3.178SerPhe: 3.178 ± 0.57
3.774SerGly: 3.774 ± 0.79
1.986SerHis: 1.986 ± 0.714
5.561SerIle: 5.561 ± 0.79
10.129SerLys: 10.129 ± 1.409
9.732SerLeu: 9.732 ± 1.349
2.582SerMet: 2.582 ± 0.882
5.362SerAsn: 5.362 ± 1.249
2.185SerPro: 2.185 ± 0.473
2.582SerGln: 2.582 ± 0.661
5.164SerArg: 5.164 ± 0.766
9.93SerSer: 9.93 ± 1.591
4.568SerThr: 4.568 ± 1.016
5.362SerVal: 5.362 ± 0.82
0.794SerTrp: 0.794 ± 0.387
4.171SerTyr: 4.171 ± 0.842
0.0SerXaa: 0.0 ± 0.0
Thr
2.185ThrAla: 2.185 ± 1.363
1.787ThrCys: 1.787 ± 0.569
4.171ThrAsp: 4.171 ± 0.482
2.582ThrGlu: 2.582 ± 0.661
4.369ThrPhe: 4.369 ± 1.122
3.774ThrGly: 3.774 ± 0.699
0.993ThrHis: 0.993 ± 0.266
4.767ThrIle: 4.767 ± 0.99
3.575ThrLys: 3.575 ± 1.028
5.362ThrLeu: 5.362 ± 0.683
0.993ThrMet: 0.993 ± 0.545
3.178ThrAsn: 3.178 ± 1.263
1.192ThrPro: 1.192 ± 0.438
1.39ThrGln: 1.39 ± 0.661
1.787ThrArg: 1.787 ± 0.981
6.157ThrSer: 6.157 ± 0.479
4.965ThrThr: 4.965 ± 0.881
4.171ThrVal: 4.171 ± 0.855
0.596ThrTrp: 0.596 ± 0.279
1.787ThrTyr: 1.787 ± 0.474
0.0ThrXaa: 0.0 ± 0.0
Val
1.986ValAla: 1.986 ± 0.551
1.39ValCys: 1.39 ± 0.485
4.568ValAsp: 4.568 ± 1.146
4.369ValGlu: 4.369 ± 1.362
3.178ValPhe: 3.178 ± 0.593
1.986ValGly: 1.986 ± 0.75
0.993ValHis: 0.993 ± 0.385
2.383ValIle: 2.383 ± 0.523
5.362ValLys: 5.362 ± 1.115
5.362ValLeu: 5.362 ± 1.045
1.986ValMet: 1.986 ± 0.595
3.178ValAsn: 3.178 ± 0.942
3.376ValPro: 3.376 ± 1.245
1.39ValGln: 1.39 ± 0.274
1.986ValArg: 1.986 ± 0.796
4.767ValSer: 4.767 ± 1.158
3.178ValThr: 3.178 ± 0.721
4.767ValVal: 4.767 ± 0.275
0.596ValTrp: 0.596 ± 0.279
3.178ValTyr: 3.178 ± 1.025
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.199TrpCys: 0.199 ± 0.109
0.596TrpAsp: 0.596 ± 0.279
0.199TrpGlu: 0.199 ± 0.109
0.397TrpPhe: 0.397 ± 0.282
0.199TrpGly: 0.199 ± 0.197
0.0TrpHis: 0.0 ± 0.0
0.596TrpIle: 0.596 ± 0.341
1.192TrpLys: 1.192 ± 0.252
0.596TrpLeu: 0.596 ± 0.189
0.397TrpMet: 0.397 ± 0.218
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.397TrpArg: 0.397 ± 0.36
1.589TrpSer: 1.589 ± 0.488
0.199TrpThr: 0.199 ± 0.438
0.794TrpVal: 0.794 ± 0.685
0.199TrpTrp: 0.199 ± 0.197
0.199TrpTyr: 0.199 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.787TyrAla: 1.787 ± 0.191
1.39TyrCys: 1.39 ± 0.446
2.582TyrAsp: 2.582 ± 0.545
1.39TyrGlu: 1.39 ± 1.225
2.383TyrPhe: 2.383 ± 0.4
1.589TyrGly: 1.589 ± 0.657
0.397TyrHis: 0.397 ± 0.218
3.178TyrIle: 3.178 ± 0.509
4.369TyrLys: 4.369 ± 1.1
3.178TyrLeu: 3.178 ± 0.772
1.589TyrMet: 1.589 ± 0.416
2.781TyrAsn: 2.781 ± 0.883
0.794TyrPro: 0.794 ± 0.232
0.993TyrGln: 0.993 ± 0.495
1.39TyrArg: 1.39 ± 0.429
3.575TyrSer: 3.575 ± 0.924
0.993TyrThr: 0.993 ± 0.357
2.185TyrVal: 2.185 ± 0.813
0.397TyrTrp: 0.397 ± 0.282
1.787TyrTyr: 1.787 ± 0.567
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (5036 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski