Amino acid dipepetide frequency for Potato yellow dwarf nucleorhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.856AlaAla: 4.856 ± 2.179
1.022AlaCys: 1.022 ± 0.408
3.322AlaAsp: 3.322 ± 0.906
2.3AlaGlu: 2.3 ± 0.812
0.767AlaPhe: 0.767 ± 0.419
1.533AlaGly: 1.533 ± 1.014
1.022AlaHis: 1.022 ± 0.292
2.556AlaIle: 2.556 ± 1.28
2.556AlaLys: 2.556 ± 0.893
5.878AlaLeu: 5.878 ± 0.976
2.044AlaMet: 2.044 ± 0.92
1.789AlaAsn: 1.789 ± 0.897
4.344AlaPro: 4.344 ± 2.44
2.044AlaGln: 2.044 ± 1.776
2.044AlaArg: 2.044 ± 0.683
3.067AlaSer: 3.067 ± 0.49
4.089AlaThr: 4.089 ± 1.208
4.856AlaVal: 4.856 ± 1.039
0.511AlaTrp: 0.511 ± 0.667
1.533AlaTyr: 1.533 ± 0.482
0.0AlaXaa: 0.0 ± 0.0
Cys
0.511CysAla: 0.511 ± 0.274
0.0CysCys: 0.0 ± 0.0
1.022CysAsp: 1.022 ± 0.548
0.511CysGlu: 0.511 ± 0.522
1.278CysPhe: 1.278 ± 0.524
1.022CysGly: 1.022 ± 0.374
0.0CysHis: 0.0 ± 0.0
1.533CysIle: 1.533 ± 0.426
0.511CysLys: 0.511 ± 0.274
1.789CysLeu: 1.789 ± 0.71
0.767CysMet: 0.767 ± 0.338
1.789CysAsn: 1.789 ± 0.443
1.022CysPro: 1.022 ± 0.674
0.256CysGln: 0.256 ± 0.342
0.256CysArg: 0.256 ± 0.158
1.278CysSer: 1.278 ± 1.209
0.767CysThr: 0.767 ± 0.473
1.022CysVal: 1.022 ± 0.548
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.322AspAla: 3.322 ± 1.727
1.278AspCys: 1.278 ± 0.411
3.833AspAsp: 3.833 ± 0.841
3.833AspGlu: 3.833 ± 1.076
2.3AspPhe: 2.3 ± 0.659
2.3AspGly: 2.3 ± 0.562
1.022AspHis: 1.022 ± 0.654
4.344AspIle: 4.344 ± 0.974
3.067AspLys: 3.067 ± 0.603
4.856AspLeu: 4.856 ± 0.871
2.3AspMet: 2.3 ± 0.518
3.067AspAsn: 3.067 ± 0.589
4.6AspPro: 4.6 ± 1.382
2.556AspGln: 2.556 ± 1.214
2.556AspArg: 2.556 ± 0.786
4.089AspSer: 4.089 ± 1.851
4.344AspThr: 4.344 ± 2.076
2.556AspVal: 2.556 ± 1.017
0.767AspTrp: 0.767 ± 0.509
1.278AspTyr: 1.278 ± 0.565
0.0AspXaa: 0.0 ± 0.0
Glu
1.789GluAla: 1.789 ± 0.541
0.767GluCys: 0.767 ± 0.472
4.089GluAsp: 4.089 ± 0.656
4.089GluGlu: 4.089 ± 1.28
1.789GluPhe: 1.789 ± 0.612
3.067GluGly: 3.067 ± 0.934
1.278GluHis: 1.278 ± 0.577
4.6GluIle: 4.6 ± 0.92
4.856GluLys: 4.856 ± 1.55
4.856GluLeu: 4.856 ± 1.516
3.578GluMet: 3.578 ± 1.228
2.044GluAsn: 2.044 ± 0.999
1.278GluPro: 1.278 ± 0.489
0.767GluGln: 0.767 ± 0.383
3.067GluArg: 3.067 ± 1.357
4.6GluSer: 4.6 ± 1.687
3.322GluThr: 3.322 ± 0.638
2.811GluVal: 2.811 ± 1.171
0.767GluTrp: 0.767 ± 0.94
2.044GluTyr: 2.044 ± 0.889
0.0GluXaa: 0.0 ± 0.0
Phe
2.044PheAla: 2.044 ± 1.398
0.0PheCys: 0.0 ± 0.0
1.278PheAsp: 1.278 ± 0.669
1.278PheGlu: 1.278 ± 0.788
1.278PhePhe: 1.278 ± 0.843
1.789PheGly: 1.789 ± 0.341
0.511PheHis: 0.511 ± 0.315
1.789PheIle: 1.789 ± 0.585
1.533PheLys: 1.533 ± 0.593
3.833PheLeu: 3.833 ± 0.644
1.022PheMet: 1.022 ± 0.344
2.044PheAsn: 2.044 ± 0.592
2.811PhePro: 2.811 ± 1.19
1.789PheGln: 1.789 ± 0.551
2.556PheArg: 2.556 ± 0.781
2.811PheSer: 2.811 ± 0.399
1.278PheThr: 1.278 ± 0.416
2.811PheVal: 2.811 ± 0.663
1.533PheTrp: 1.533 ± 0.482
2.044PheTyr: 2.044 ± 0.71
0.0PheXaa: 0.0 ± 0.0
Gly
2.3GlyAla: 2.3 ± 0.589
0.767GlyCys: 0.767 ± 0.473
3.833GlyAsp: 3.833 ± 0.838
4.856GlyGlu: 4.856 ± 1.457
2.044GlyPhe: 2.044 ± 0.662
3.578GlyGly: 3.578 ± 1.06
1.022GlyHis: 1.022 ± 0.478
4.344GlyIle: 4.344 ± 1.202
4.089GlyLys: 4.089 ± 0.926
5.878GlyLeu: 5.878 ± 0.792
2.3GlyMet: 2.3 ± 0.756
2.044GlyAsn: 2.044 ± 0.719
1.533GlyPro: 1.533 ± 0.668
1.533GlyGln: 1.533 ± 0.951
3.833GlyArg: 3.833 ± 1.154
4.344GlySer: 4.344 ± 0.903
4.6GlyThr: 4.6 ± 1.985
2.556GlyVal: 2.556 ± 0.869
1.022GlyTrp: 1.022 ± 0.486
1.533GlyTyr: 1.533 ± 0.488
0.0GlyXaa: 0.0 ± 0.0
His
1.278HisAla: 1.278 ± 0.497
0.256HisCys: 0.256 ± 0.158
1.533HisAsp: 1.533 ± 0.395
1.022HisGlu: 1.022 ± 0.41
1.022HisPhe: 1.022 ± 0.631
2.3HisGly: 2.3 ± 1.048
0.256HisHis: 0.256 ± 0.158
2.044HisIle: 2.044 ± 0.334
0.511HisLys: 0.511 ± 0.38
0.767HisLeu: 0.767 ± 0.419
0.767HisMet: 0.767 ± 0.381
1.278HisAsn: 1.278 ± 0.613
1.278HisPro: 1.278 ± 0.613
0.767HisGln: 0.767 ± 0.381
0.511HisArg: 0.511 ± 0.315
1.022HisSer: 1.022 ± 0.548
1.278HisThr: 1.278 ± 0.576
1.278HisVal: 1.278 ± 0.767
0.256HisTrp: 0.256 ± 0.321
0.511HisTyr: 0.511 ± 0.327
0.0HisXaa: 0.0 ± 0.0
Ile
2.044IleAla: 2.044 ± 0.436
1.533IleCys: 1.533 ± 0.622
4.6IleAsp: 4.6 ± 1.416
4.089IleGlu: 4.089 ± 1.209
2.3IlePhe: 2.3 ± 0.482
3.833IleGly: 3.833 ± 1.332
1.278IleHis: 1.278 ± 0.525
4.856IleIle: 4.856 ± 0.904
4.344IleLys: 4.344 ± 0.784
5.367IleLeu: 5.367 ± 1.09
1.789IleMet: 1.789 ± 0.433
4.6IleAsn: 4.6 ± 1.13
4.6IlePro: 4.6 ± 0.925
2.811IleGln: 2.811 ± 1.329
3.578IleArg: 3.578 ± 0.609
7.411IleSer: 7.411 ± 2.727
5.367IleThr: 5.367 ± 0.837
2.556IleVal: 2.556 ± 0.722
1.278IleTrp: 1.278 ± 0.843
3.322IleTyr: 3.322 ± 0.773
0.0IleXaa: 0.0 ± 0.0
Lys
2.044LysAla: 2.044 ± 0.835
0.511LysCys: 0.511 ± 0.395
1.789LysAsp: 1.789 ± 0.544
4.856LysGlu: 4.856 ± 1.26
1.533LysPhe: 1.533 ± 0.478
2.811LysGly: 2.811 ± 0.831
1.278LysHis: 1.278 ± 0.598
4.6LysIle: 4.6 ± 0.831
1.789LysLys: 1.789 ± 0.541
3.833LysLeu: 3.833 ± 0.972
2.044LysMet: 2.044 ± 0.767
4.344LysAsn: 4.344 ± 1.221
3.322LysPro: 3.322 ± 1.088
1.533LysGln: 1.533 ± 0.692
4.089LysArg: 4.089 ± 0.822
2.556LysSer: 2.556 ± 0.893
4.856LysThr: 4.856 ± 1.808
1.789LysVal: 1.789 ± 0.646
1.022LysTrp: 1.022 ± 0.645
3.833LysTyr: 3.833 ± 1.199
0.0LysXaa: 0.0 ± 0.0
Leu
5.622LeuAla: 5.622 ± 0.973
2.3LeuCys: 2.3 ± 0.612
3.322LeuAsp: 3.322 ± 1.297
5.622LeuGlu: 5.622 ± 1.399
2.556LeuPhe: 2.556 ± 0.811
4.6LeuGly: 4.6 ± 1.031
2.044LeuHis: 2.044 ± 0.675
5.367LeuIle: 5.367 ± 1.227
6.645LeuLys: 6.645 ± 1.943
8.945LeuLeu: 8.945 ± 2.13
3.067LeuMet: 3.067 ± 0.806
4.856LeuAsn: 4.856 ± 1.053
5.622LeuPro: 5.622 ± 1.056
2.556LeuGln: 2.556 ± 1.034
5.622LeuArg: 5.622 ± 1.056
8.433LeuSer: 8.433 ± 1.088
6.133LeuThr: 6.133 ± 1.323
5.622LeuVal: 5.622 ± 0.749
0.767LeuTrp: 0.767 ± 0.383
3.067LeuTyr: 3.067 ± 0.825
0.0LeuXaa: 0.0 ± 0.0
Met
2.044MetAla: 2.044 ± 1.132
0.767MetCys: 0.767 ± 0.473
1.022MetAsp: 1.022 ± 0.374
1.533MetGlu: 1.533 ± 0.612
0.767MetPhe: 0.767 ± 0.435
2.044MetGly: 2.044 ± 0.873
0.767MetHis: 0.767 ± 0.419
2.556MetIle: 2.556 ± 0.808
2.3MetLys: 2.3 ± 1.147
3.322MetLeu: 3.322 ± 1.306
2.3MetMet: 2.3 ± 0.747
1.789MetAsn: 1.789 ± 0.544
0.511MetPro: 0.511 ± 0.337
2.044MetGln: 2.044 ± 0.949
3.067MetArg: 3.067 ± 1.197
3.578MetSer: 3.578 ± 0.965
2.3MetThr: 2.3 ± 0.815
1.022MetVal: 1.022 ± 0.499
0.256MetTrp: 0.256 ± 0.158
1.533MetTyr: 1.533 ± 0.678
0.0MetXaa: 0.0 ± 0.0
Asn
1.278AsnAla: 1.278 ± 0.544
0.256AsnCys: 0.256 ± 0.158
2.556AsnAsp: 2.556 ± 1.095
2.556AsnGlu: 2.556 ± 0.494
3.067AsnPhe: 3.067 ± 0.913
1.789AsnGly: 1.789 ± 1.069
1.022AsnHis: 1.022 ± 0.703
4.089AsnIle: 4.089 ± 1.083
2.3AsnLys: 2.3 ± 0.898
6.645AsnLeu: 6.645 ± 1.483
2.3AsnMet: 2.3 ± 0.763
3.322AsnAsn: 3.322 ± 1.279
3.578AsnPro: 3.578 ± 1.408
2.556AsnGln: 2.556 ± 0.495
1.533AsnArg: 1.533 ± 0.494
3.833AsnSer: 3.833 ± 0.834
4.856AsnThr: 4.856 ± 0.955
3.322AsnVal: 3.322 ± 0.917
0.256AsnTrp: 0.256 ± 0.321
1.533AsnTyr: 1.533 ± 0.564
0.0AsnXaa: 0.0 ± 0.0
Pro
3.578ProAla: 3.578 ± 1.716
0.511ProCys: 0.511 ± 0.274
4.344ProAsp: 4.344 ± 1.343
2.3ProGlu: 2.3 ± 0.942
1.022ProPhe: 1.022 ± 0.41
2.811ProGly: 2.811 ± 0.531
1.278ProHis: 1.278 ± 0.507
2.3ProIle: 2.3 ± 1.102
2.556ProLys: 2.556 ± 0.649
3.322ProLeu: 3.322 ± 0.792
1.789ProMet: 1.789 ± 0.654
3.067ProAsn: 3.067 ± 0.828
3.833ProPro: 3.833 ± 1.877
1.022ProGln: 1.022 ± 0.98
3.067ProArg: 3.067 ± 1.059
4.6ProSer: 4.6 ± 1.141
4.089ProThr: 4.089 ± 2.187
3.322ProVal: 3.322 ± 0.965
0.256ProTrp: 0.256 ± 0.321
2.044ProTyr: 2.044 ± 0.818
0.0ProXaa: 0.0 ± 0.0
Gln
1.022GlnAla: 1.022 ± 0.724
0.511GlnCys: 0.511 ± 0.642
1.022GlnAsp: 1.022 ± 0.609
2.044GlnGlu: 2.044 ± 0.645
1.278GlnPhe: 1.278 ± 0.616
2.044GlnGly: 2.044 ± 0.699
1.022GlnHis: 1.022 ± 0.557
2.044GlnIle: 2.044 ± 1.122
4.344GlnLys: 4.344 ± 0.972
3.322GlnLeu: 3.322 ± 1.469
1.278GlnMet: 1.278 ± 0.471
2.044GlnAsn: 2.044 ± 0.458
0.256GlnPro: 0.256 ± 0.158
2.044GlnGln: 2.044 ± 1.094
1.022GlnArg: 1.022 ± 0.631
2.3GlnSer: 2.3 ± 0.799
1.789GlnThr: 1.789 ± 0.987
1.533GlnVal: 1.533 ± 0.584
0.0GlnTrp: 0.0 ± 0.0
0.767GlnTyr: 0.767 ± 0.522
0.0GlnXaa: 0.0 ± 0.0
Arg
3.578ArgAla: 3.578 ± 1.091
0.256ArgCys: 0.256 ± 0.158
3.578ArgAsp: 3.578 ± 0.895
2.811ArgGlu: 2.811 ± 1.182
3.067ArgPhe: 3.067 ± 1.039
3.578ArgGly: 3.578 ± 0.789
1.022ArgHis: 1.022 ± 0.478
3.833ArgIle: 3.833 ± 1.239
1.533ArgLys: 1.533 ± 0.569
4.856ArgLeu: 4.856 ± 1.212
2.044ArgMet: 2.044 ± 1.073
2.811ArgAsn: 2.811 ± 1.147
2.044ArgPro: 2.044 ± 0.518
1.278ArgGln: 1.278 ± 0.398
2.811ArgArg: 2.811 ± 0.925
5.111ArgSer: 5.111 ± 1.827
2.556ArgThr: 2.556 ± 0.769
4.344ArgVal: 4.344 ± 2.065
0.0ArgTrp: 0.0 ± 0.0
2.044ArgTyr: 2.044 ± 0.57
0.0ArgXaa: 0.0 ± 0.0
Ser
5.622SerAla: 5.622 ± 1.461
1.533SerCys: 1.533 ± 0.954
6.389SerAsp: 6.389 ± 1.937
3.578SerGlu: 3.578 ± 0.849
2.811SerPhe: 2.811 ± 0.76
4.856SerGly: 4.856 ± 1.231
1.278SerHis: 1.278 ± 0.565
6.389SerIle: 6.389 ± 2.28
2.3SerLys: 2.3 ± 1.247
9.967SerLeu: 9.967 ± 1.673
2.3SerMet: 2.3 ± 1.023
3.067SerAsn: 3.067 ± 0.604
2.811SerPro: 2.811 ± 1.119
1.533SerGln: 1.533 ± 0.593
4.856SerArg: 4.856 ± 0.331
5.622SerSer: 5.622 ± 3.314
6.9SerThr: 6.9 ± 1.259
6.389SerVal: 6.389 ± 1.086
0.767SerTrp: 0.767 ± 0.311
3.833SerTyr: 3.833 ± 1.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.856ThrAla: 4.856 ± 1.87
2.3ThrCys: 2.3 ± 0.45
3.322ThrAsp: 3.322 ± 1.009
3.578ThrGlu: 3.578 ± 1.022
1.278ThrPhe: 1.278 ± 0.765
4.344ThrGly: 4.344 ± 1.292
1.533ThrHis: 1.533 ± 0.622
6.9ThrIle: 6.9 ± 1.367
3.322ThrLys: 3.322 ± 1.636
4.856ThrLeu: 4.856 ± 1.031
1.789ThrMet: 1.789 ± 0.75
2.811ThrAsn: 2.811 ± 0.399
4.6ThrPro: 4.6 ± 1.704
1.789ThrGln: 1.789 ± 0.511
3.833ThrArg: 3.833 ± 1.16
6.9ThrSer: 6.9 ± 1.079
4.6ThrThr: 4.6 ± 0.808
4.856ThrVal: 4.856 ± 0.966
0.511ThrTrp: 0.511 ± 0.315
2.556ThrTyr: 2.556 ± 0.693
0.0ThrXaa: 0.0 ± 0.0
Val
3.322ValAla: 3.322 ± 0.455
0.256ValCys: 0.256 ± 0.321
3.322ValAsp: 3.322 ± 0.874
2.811ValGlu: 2.811 ± 0.958
3.833ValPhe: 3.833 ± 0.984
5.622ValGly: 5.622 ± 1.007
1.022ValHis: 1.022 ± 0.631
3.833ValIle: 3.833 ± 1.576
2.556ValLys: 2.556 ± 0.805
5.878ValLeu: 5.878 ± 1.477
1.022ValMet: 1.022 ± 0.436
3.833ValAsn: 3.833 ± 1.041
0.767ValPro: 0.767 ± 0.576
1.789ValGln: 1.789 ± 0.897
2.3ValArg: 2.3 ± 0.562
7.156ValSer: 7.156 ± 1.267
3.578ValThr: 3.578 ± 0.634
3.578ValVal: 3.578 ± 0.533
1.278ValTrp: 1.278 ± 0.584
1.789ValTyr: 1.789 ± 0.541
0.0ValXaa: 0.0 ± 0.0
Trp
0.511TrpAla: 0.511 ± 0.274
0.256TrpCys: 0.256 ± 0.158
1.533TrpAsp: 1.533 ± 0.636
0.256TrpGlu: 0.256 ± 0.158
0.767TrpPhe: 0.767 ± 0.688
0.767TrpGly: 0.767 ± 0.397
0.0TrpHis: 0.0 ± 0.0
0.511TrpIle: 0.511 ± 0.274
1.022TrpLys: 1.022 ± 0.358
0.511TrpLeu: 0.511 ± 0.642
0.256TrpMet: 0.256 ± 0.334
1.278TrpAsn: 1.278 ± 0.512
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.511TrpArg: 0.511 ± 0.274
1.022TrpSer: 1.022 ± 0.584
1.533TrpThr: 1.533 ± 0.426
0.767TrpVal: 0.767 ± 0.561
0.767TrpTrp: 0.767 ± 0.334
0.256TrpTyr: 0.256 ± 0.321
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.511TyrAla: 0.511 ± 0.497
0.256TyrCys: 0.256 ± 0.321
2.556TyrAsp: 2.556 ± 1.527
1.533TyrGlu: 1.533 ± 0.773
1.278TyrPhe: 1.278 ± 0.402
3.322TyrGly: 3.322 ± 0.918
1.278TyrHis: 1.278 ± 0.524
3.067TyrIle: 3.067 ± 0.544
2.3TyrLys: 2.3 ± 1.058
4.089TyrLeu: 4.089 ± 1.279
0.767TyrMet: 0.767 ± 0.381
0.767TyrAsn: 0.767 ± 0.359
2.3TyrPro: 2.3 ± 0.535
1.022TyrGln: 1.022 ± 0.655
2.044TyrArg: 2.044 ± 0.534
3.067TyrSer: 3.067 ± 0.646
2.3TyrThr: 2.3 ± 1.221
2.556TyrVal: 2.556 ± 1.0
0.511TyrTrp: 0.511 ± 0.433
1.022TyrTyr: 1.022 ± 0.408
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3914 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski