Amino acid dipepetide frequency for Potato yellow vein virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.023AlaAla: 2.023 ± 0.971
0.405AlaCys: 0.405 ± 0.407
2.63AlaAsp: 2.63 ± 0.575
1.821AlaGlu: 1.821 ± 0.676
1.416AlaPhe: 1.416 ± 0.82
2.63AlaGly: 2.63 ± 0.675
1.012AlaHis: 1.012 ± 0.279
2.225AlaIle: 2.225 ± 1.212
2.225AlaLys: 2.225 ± 0.731
3.439AlaLeu: 3.439 ± 0.778
0.405AlaMet: 0.405 ± 0.228
1.416AlaAsn: 1.416 ± 0.417
1.012AlaPro: 1.012 ± 0.402
1.618AlaGln: 1.618 ± 0.613
1.012AlaArg: 1.012 ± 0.396
1.821AlaSer: 1.821 ± 0.239
1.214AlaThr: 1.214 ± 0.559
2.832AlaVal: 2.832 ± 0.78
0.0AlaTrp: 0.0 ± 0.0
1.214AlaTyr: 1.214 ± 0.299
0.0AlaXaa: 0.0 ± 0.0
Cys
0.202CysAla: 0.202 ± 0.114
0.0CysCys: 0.0 ± 0.0
1.618CysAsp: 1.618 ± 0.268
1.012CysGlu: 1.012 ± 0.339
0.607CysPhe: 0.607 ± 0.216
0.809CysGly: 0.809 ± 0.398
0.0CysHis: 0.0 ± 0.0
0.607CysIle: 0.607 ± 0.422
1.821CysLys: 1.821 ± 0.416
2.428CysLeu: 2.428 ± 0.374
0.809CysMet: 0.809 ± 0.315
1.214CysAsn: 1.214 ± 0.373
0.607CysPro: 0.607 ± 0.25
0.202CysGln: 0.202 ± 0.114
1.012CysArg: 1.012 ± 0.439
1.821CysSer: 1.821 ± 0.93
1.214CysThr: 1.214 ± 0.906
1.012CysVal: 1.012 ± 0.445
0.202CysTrp: 0.202 ± 0.114
1.012CysTyr: 1.012 ± 0.381
0.0CysXaa: 0.0 ± 0.0
Asp
1.821AspAla: 1.821 ± 0.41
1.214AspCys: 1.214 ± 0.45
5.26AspAsp: 5.26 ± 1.368
4.855AspGlu: 4.855 ± 0.839
4.653AspPhe: 4.653 ± 0.573
4.451AspGly: 4.451 ± 0.647
1.012AspHis: 1.012 ± 0.399
5.058AspIle: 5.058 ± 0.535
4.855AspLys: 4.855 ± 0.826
7.89AspLeu: 7.89 ± 1.229
1.416AspMet: 1.416 ± 0.461
2.63AspAsn: 2.63 ± 0.495
3.035AspPro: 3.035 ± 1.066
0.607AspGln: 0.607 ± 0.673
3.035AspArg: 3.035 ± 0.768
4.248AspSer: 4.248 ± 0.687
2.023AspThr: 2.023 ± 0.598
5.26AspVal: 5.26 ± 0.745
0.607AspTrp: 0.607 ± 0.225
2.225AspTyr: 2.225 ± 0.533
0.0AspXaa: 0.0 ± 0.0
Glu
1.416GluAla: 1.416 ± 0.457
1.012GluCys: 1.012 ± 0.588
4.046GluAsp: 4.046 ± 0.642
2.832GluGlu: 2.832 ± 0.678
3.237GluPhe: 3.237 ± 0.765
2.023GluGly: 2.023 ± 0.566
0.405GluHis: 0.405 ± 0.438
4.248GluIle: 4.248 ± 1.072
6.878GluLys: 6.878 ± 1.243
6.271GluLeu: 6.271 ± 0.991
1.416GluMet: 1.416 ± 0.579
3.439GluAsn: 3.439 ± 0.305
1.012GluPro: 1.012 ± 0.397
1.214GluGln: 1.214 ± 0.485
2.63GluArg: 2.63 ± 0.458
4.046GluSer: 4.046 ± 0.512
1.821GluThr: 1.821 ± 0.652
5.058GluVal: 5.058 ± 0.777
0.405GluTrp: 0.405 ± 0.188
3.035GluTyr: 3.035 ± 0.641
0.0GluXaa: 0.0 ± 0.0
Phe
1.214PheAla: 1.214 ± 0.378
2.225PheCys: 2.225 ± 0.537
5.26PheAsp: 5.26 ± 1.025
3.035PheGlu: 3.035 ± 0.572
1.416PhePhe: 1.416 ± 0.464
3.439PheGly: 3.439 ± 0.954
0.809PheHis: 0.809 ± 0.348
3.439PheIle: 3.439 ± 0.753
5.058PheLys: 5.058 ± 0.88
6.069PheLeu: 6.069 ± 1.398
1.821PheMet: 1.821 ± 0.297
2.63PheAsn: 2.63 ± 0.345
1.012PhePro: 1.012 ± 0.326
1.214PheGln: 1.214 ± 0.579
2.63PheArg: 2.63 ± 1.069
7.081PheSer: 7.081 ± 0.874
2.023PheThr: 2.023 ± 0.39
2.832PheVal: 2.832 ± 0.518
0.0PheTrp: 0.0 ± 0.0
2.023PheTyr: 2.023 ± 0.691
0.0PheXaa: 0.0 ± 0.0
Gly
1.214GlyAla: 1.214 ± 0.345
1.012GlyCys: 1.012 ± 0.32
4.046GlyAsp: 4.046 ± 1.407
4.248GlyGlu: 4.248 ± 0.869
2.225GlyPhe: 2.225 ± 0.493
3.035GlyGly: 3.035 ± 0.873
0.809GlyHis: 0.809 ± 0.322
2.63GlyIle: 2.63 ± 0.393
4.451GlyLys: 4.451 ± 1.388
3.035GlyLeu: 3.035 ± 0.439
1.618GlyMet: 1.618 ± 0.528
3.035GlyAsn: 3.035 ± 0.642
0.202GlyPro: 0.202 ± 0.224
0.809GlyGln: 0.809 ± 0.494
2.428GlyArg: 2.428 ± 0.601
3.844GlySer: 3.844 ± 0.745
2.225GlyThr: 2.225 ± 0.599
4.046GlyVal: 4.046 ± 0.644
0.202GlyTrp: 0.202 ± 0.224
1.214GlyTyr: 1.214 ± 0.432
0.0GlyXaa: 0.0 ± 0.0
His
0.607HisAla: 0.607 ± 0.235
0.607HisCys: 0.607 ± 0.222
1.416HisAsp: 1.416 ± 0.574
0.405HisGlu: 0.405 ± 0.228
1.012HisPhe: 1.012 ± 0.439
1.012HisGly: 1.012 ± 0.648
0.0HisHis: 0.0 ± 0.0
1.214HisIle: 1.214 ± 0.585
1.416HisLys: 1.416 ± 0.617
1.214HisLeu: 1.214 ± 0.39
0.202HisMet: 0.202 ± 0.207
0.202HisAsn: 0.202 ± 0.26
0.405HisPro: 0.405 ± 0.407
0.405HisGln: 0.405 ± 0.326
0.607HisArg: 0.607 ± 0.318
1.214HisSer: 1.214 ± 0.565
0.405HisThr: 0.405 ± 0.228
1.214HisVal: 1.214 ± 0.565
0.0HisTrp: 0.0 ± 0.0
1.618HisTyr: 1.618 ± 0.426
0.0HisXaa: 0.0 ± 0.0
Ile
1.416IleAla: 1.416 ± 0.435
0.809IleCys: 0.809 ± 0.307
4.248IleAsp: 4.248 ± 0.868
3.439IleGlu: 3.439 ± 0.778
5.26IlePhe: 5.26 ± 1.25
2.832IleGly: 2.832 ± 0.429
1.012IleHis: 1.012 ± 0.569
4.855IleIle: 4.855 ± 0.712
5.867IleLys: 5.867 ± 0.9
8.092IleLeu: 8.092 ± 1.419
1.618IleMet: 1.618 ± 0.599
3.844IleAsn: 3.844 ± 0.631
3.642IlePro: 3.642 ± 0.617
2.023IleGln: 2.023 ± 0.252
3.439IleArg: 3.439 ± 0.614
9.508IleSer: 9.508 ± 1.408
2.832IleThr: 2.832 ± 0.759
4.248IleVal: 4.248 ± 0.899
0.607IleTrp: 0.607 ± 0.568
2.428IleTyr: 2.428 ± 0.44
0.0IleXaa: 0.0 ± 0.0
Lys
2.023LysAla: 2.023 ± 0.94
1.012LysCys: 1.012 ± 0.374
4.248LysAsp: 4.248 ± 0.596
5.058LysGlu: 5.058 ± 0.506
5.867LysPhe: 5.867 ± 1.562
3.035LysGly: 3.035 ± 0.53
1.214LysHis: 1.214 ± 0.487
7.283LysIle: 7.283 ± 0.881
4.046LysLys: 4.046 ± 0.526
9.913LysLeu: 9.913 ± 1.178
0.607LysMet: 0.607 ± 0.342
7.283LysAsn: 7.283 ± 1.452
2.428LysPro: 2.428 ± 0.611
2.63LysGln: 2.63 ± 0.955
3.844LysArg: 3.844 ± 0.455
5.665LysSer: 5.665 ± 1.341
5.058LysThr: 5.058 ± 0.761
6.271LysVal: 6.271 ± 0.707
0.405LysTrp: 0.405 ± 0.228
4.248LysTyr: 4.248 ± 0.65
0.0LysXaa: 0.0 ± 0.0
Leu
2.63LeuAla: 2.63 ± 1.39
1.618LeuCys: 1.618 ± 0.679
5.665LeuAsp: 5.665 ± 0.577
4.248LeuGlu: 4.248 ± 0.548
4.855LeuPhe: 4.855 ± 0.55
5.058LeuGly: 5.058 ± 0.632
1.012LeuHis: 1.012 ± 0.471
7.485LeuIle: 7.485 ± 0.994
8.497LeuLys: 8.497 ± 1.232
7.081LeuLeu: 7.081 ± 0.926
2.023LeuMet: 2.023 ± 0.708
8.295LeuAsn: 8.295 ± 1.508
2.428LeuPro: 2.428 ± 0.964
1.821LeuGln: 1.821 ± 0.536
7.89LeuArg: 7.89 ± 0.798
10.925LeuSer: 10.925 ± 0.948
5.867LeuThr: 5.867 ± 0.962
5.058LeuVal: 5.058 ± 0.624
0.202LeuTrp: 0.202 ± 0.26
5.26LeuTyr: 5.26 ± 1.22
0.0LeuXaa: 0.0 ± 0.0
Met
1.416MetAla: 1.416 ± 0.415
0.607MetCys: 0.607 ± 0.348
1.214MetAsp: 1.214 ± 0.457
1.416MetGlu: 1.416 ± 0.607
1.416MetPhe: 1.416 ± 0.482
0.202MetGly: 0.202 ± 0.114
0.0MetHis: 0.0 ± 0.0
2.225MetIle: 2.225 ± 0.664
2.63MetLys: 2.63 ± 0.592
1.012MetLeu: 1.012 ± 0.334
0.0MetMet: 0.0 ± 0.0
2.428MetAsn: 2.428 ± 0.535
0.809MetPro: 0.809 ± 0.315
0.607MetGln: 0.607 ± 0.225
2.225MetArg: 2.225 ± 0.569
2.428MetSer: 2.428 ± 0.783
1.012MetThr: 1.012 ± 0.431
2.023MetVal: 2.023 ± 0.654
0.0MetTrp: 0.0 ± 0.0
1.214MetTyr: 1.214 ± 0.345
0.0MetXaa: 0.0 ± 0.0
Asn
3.237AsnAla: 3.237 ± 0.508
0.809AsnCys: 0.809 ± 0.202
4.046AsnAsp: 4.046 ± 0.409
3.237AsnGlu: 3.237 ± 0.864
3.844AsnPhe: 3.844 ± 0.82
1.821AsnGly: 1.821 ± 0.627
0.809AsnHis: 0.809 ± 0.289
3.844AsnIle: 3.844 ± 0.428
6.676AsnLys: 6.676 ± 0.942
5.462AsnLeu: 5.462 ± 0.834
3.035AsnMet: 3.035 ± 0.686
3.844AsnAsn: 3.844 ± 0.888
3.237AsnPro: 3.237 ± 1.033
1.821AsnGln: 1.821 ± 0.573
3.035AsnArg: 3.035 ± 0.415
7.485AsnSer: 7.485 ± 1.723
2.832AsnThr: 2.832 ± 0.723
3.844AsnVal: 3.844 ± 1.149
0.405AsnTrp: 0.405 ± 0.228
1.416AsnTyr: 1.416 ± 1.129
0.0AsnXaa: 0.0 ± 0.0
Pro
0.809ProAla: 0.809 ± 0.421
0.607ProCys: 0.607 ± 0.373
2.428ProAsp: 2.428 ± 0.685
2.832ProGlu: 2.832 ± 1.253
0.809ProPhe: 0.809 ± 0.307
1.821ProGly: 1.821 ± 0.713
0.607ProHis: 0.607 ± 0.454
2.428ProIle: 2.428 ± 0.584
2.428ProLys: 2.428 ± 0.346
3.237ProLeu: 3.237 ± 0.858
0.405ProMet: 0.405 ± 0.182
1.618ProAsn: 1.618 ± 0.385
1.618ProPro: 1.618 ± 0.709
0.607ProGln: 0.607 ± 0.25
1.416ProArg: 1.416 ± 0.328
2.832ProSer: 2.832 ± 0.651
1.416ProThr: 1.416 ± 0.375
2.428ProVal: 2.428 ± 1.044
0.0ProTrp: 0.0 ± 0.0
1.618ProTyr: 1.618 ± 0.513
0.0ProXaa: 0.0 ± 0.0
Gln
0.809GlnAla: 0.809 ± 0.338
0.405GlnCys: 0.405 ± 0.228
2.023GlnAsp: 2.023 ± 0.564
1.012GlnGlu: 1.012 ± 0.634
2.225GlnPhe: 2.225 ± 0.546
1.214GlnGly: 1.214 ± 0.318
0.202GlnHis: 0.202 ± 0.203
2.023GlnIle: 2.023 ± 0.48
1.821GlnLys: 1.821 ± 0.746
1.416GlnLeu: 1.416 ± 0.812
0.607GlnMet: 0.607 ± 0.31
1.821GlnAsn: 1.821 ± 0.434
1.214GlnPro: 1.214 ± 0.797
0.607GlnGln: 0.607 ± 0.398
1.012GlnArg: 1.012 ± 0.404
1.416GlnSer: 1.416 ± 0.829
2.225GlnThr: 2.225 ± 0.866
1.214GlnVal: 1.214 ± 0.394
0.0GlnTrp: 0.0 ± 0.0
1.214GlnTyr: 1.214 ± 0.581
0.0GlnXaa: 0.0 ± 0.0
Arg
1.618ArgAla: 1.618 ± 0.411
0.809ArgCys: 0.809 ± 0.377
3.237ArgAsp: 3.237 ± 1.296
2.023ArgGlu: 2.023 ± 0.762
3.439ArgPhe: 3.439 ± 0.882
2.63ArgGly: 2.63 ± 0.565
1.618ArgHis: 1.618 ± 0.347
4.046ArgIle: 4.046 ± 0.684
3.642ArgLys: 3.642 ± 0.606
4.855ArgLeu: 4.855 ± 0.775
1.416ArgMet: 1.416 ± 0.412
4.248ArgAsn: 4.248 ± 0.749
0.607ArgPro: 0.607 ± 0.241
0.809ArgGln: 0.809 ± 0.367
3.844ArgArg: 3.844 ± 1.045
5.26ArgSer: 5.26 ± 1.037
2.832ArgThr: 2.832 ± 0.92
3.439ArgVal: 3.439 ± 1.069
0.202ArgTrp: 0.202 ± 0.203
2.832ArgTyr: 2.832 ± 0.765
0.0ArgXaa: 0.0 ± 0.0
Ser
4.451SerAla: 4.451 ± 0.325
1.618SerCys: 1.618 ± 0.205
4.046SerAsp: 4.046 ± 0.834
5.26SerGlu: 5.26 ± 0.828
5.058SerPhe: 5.058 ± 1.465
4.046SerGly: 4.046 ± 1.05
2.023SerHis: 2.023 ± 0.492
5.867SerIle: 5.867 ± 1.0
8.699SerLys: 8.699 ± 0.849
9.711SerLeu: 9.711 ± 0.928
2.832SerMet: 2.832 ± 0.619
6.069SerAsn: 6.069 ± 1.017
2.023SerPro: 2.023 ± 0.391
2.023SerGln: 2.023 ± 0.666
4.653SerArg: 4.653 ± 0.629
7.081SerSer: 7.081 ± 1.062
4.046SerThr: 4.046 ± 1.071
7.081SerVal: 7.081 ± 0.809
0.202SerTrp: 0.202 ± 0.224
5.665SerTyr: 5.665 ± 0.784
0.0SerXaa: 0.0 ± 0.0
Thr
1.821ThrAla: 1.821 ± 0.564
0.607ThrCys: 0.607 ± 0.368
2.023ThrAsp: 2.023 ± 0.663
3.035ThrGlu: 3.035 ± 0.691
2.023ThrPhe: 2.023 ± 0.435
2.023ThrGly: 2.023 ± 0.439
0.809ThrHis: 0.809 ± 0.281
2.428ThrIle: 2.428 ± 0.591
3.642ThrLys: 3.642 ± 1.008
4.653ThrLeu: 4.653 ± 0.67
0.809ThrMet: 0.809 ± 0.307
2.428ThrAsn: 2.428 ± 0.577
2.225ThrPro: 2.225 ± 1.039
1.618ThrGln: 1.618 ± 0.54
1.618ThrArg: 1.618 ± 0.471
5.26ThrSer: 5.26 ± 0.591
3.237ThrThr: 3.237 ± 0.677
4.046ThrVal: 4.046 ± 1.344
0.405ThrTrp: 0.405 ± 0.228
2.225ThrTyr: 2.225 ± 0.399
0.0ThrXaa: 0.0 ± 0.0
Val
2.023ValAla: 2.023 ± 0.689
1.416ValCys: 1.416 ± 0.431
6.271ValAsp: 6.271 ± 0.715
3.439ValGlu: 3.439 ± 0.758
3.642ValPhe: 3.642 ± 0.634
3.035ValGly: 3.035 ± 0.722
1.012ValHis: 1.012 ± 0.279
6.271ValIle: 6.271 ± 1.027
3.642ValLys: 3.642 ± 0.524
5.26ValLeu: 5.26 ± 0.851
2.023ValMet: 2.023 ± 0.722
5.26ValAsn: 5.26 ± 0.709
1.821ValPro: 1.821 ± 0.538
2.428ValGln: 2.428 ± 0.404
4.451ValArg: 4.451 ± 0.389
7.081ValSer: 7.081 ± 0.996
2.63ValThr: 2.63 ± 0.42
5.462ValVal: 5.462 ± 0.898
0.607ValTrp: 0.607 ± 0.352
2.428ValTyr: 2.428 ± 0.555
0.0ValXaa: 0.0 ± 0.0
Trp
0.405TrpAla: 0.405 ± 0.228
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.405TrpPhe: 0.405 ± 0.188
0.0TrpGly: 0.0 ± 0.0
0.405TrpHis: 0.405 ± 0.228
0.405TrpIle: 0.405 ± 0.228
0.607TrpLys: 0.607 ± 0.344
0.809TrpLeu: 0.809 ± 0.212
0.405TrpMet: 0.405 ± 0.274
0.202TrpAsn: 0.202 ± 0.114
0.202TrpPro: 0.202 ± 0.224
0.0TrpGln: 0.0 ± 0.0
0.202TrpArg: 0.202 ± 0.224
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.202TrpVal: 0.202 ± 0.203
0.0TrpTrp: 0.0 ± 0.0
0.202TrpTyr: 0.202 ± 0.224
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.214TyrAla: 1.214 ± 0.393
1.416TyrCys: 1.416 ± 0.393
2.428TyrAsp: 2.428 ± 0.804
3.237TyrGlu: 3.237 ± 0.811
2.023TyrPhe: 2.023 ± 0.528
1.214TyrGly: 1.214 ± 0.432
0.405TyrHis: 0.405 ± 0.228
3.439TyrIle: 3.439 ± 0.618
3.237TyrLys: 3.237 ± 0.446
6.069TyrLeu: 6.069 ± 1.051
1.214TyrMet: 1.214 ± 0.649
2.832TyrAsn: 2.832 ± 1.197
2.428TyrPro: 2.428 ± 0.431
1.416TyrGln: 1.416 ± 0.831
2.428TyrArg: 2.428 ± 0.731
3.439TyrSer: 3.439 ± 0.826
2.023TyrThr: 2.023 ± 0.612
2.63TyrVal: 2.63 ± 0.551
0.0TyrTrp: 0.0 ± 0.0
1.416TyrTyr: 1.416 ± 0.617
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (4944 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski