Amino acid dipepetide frequency for Beet black scorch virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.264AlaAla: 8.264 ± 4.656
4.508AlaCys: 4.508 ± 2.774
3.005AlaAsp: 3.005 ± 1.164
2.254AlaGlu: 2.254 ± 1.322
3.005AlaPhe: 3.005 ± 0.739
5.259AlaGly: 5.259 ± 5.384
3.005AlaHis: 3.005 ± 1.849
4.508AlaIle: 4.508 ± 2.609
4.508AlaLys: 4.508 ± 1.834
5.259AlaLeu: 5.259 ± 2.237
2.254AlaMet: 2.254 ± 1.196
5.259AlaAsn: 5.259 ± 1.427
4.508AlaPro: 4.508 ± 2.529
0.0AlaGln: 0.0 ± 0.0
1.503AlaArg: 1.503 ± 0.882
6.762AlaSer: 6.762 ± 1.922
5.259AlaThr: 5.259 ± 4.233
10.518AlaVal: 10.518 ± 3.896
1.503AlaTrp: 1.503 ± 0.882
2.254AlaTyr: 2.254 ± 0.948
0.0AlaXaa: 0.0 ± 0.0
Cys
3.757CysAla: 3.757 ± 1.82
0.0CysCys: 0.0 ± 0.0
1.503CysAsp: 1.503 ± 0.925
0.0CysGlu: 0.0 ± 0.0
2.254CysPhe: 2.254 ± 0.948
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.503CysIle: 1.503 ± 1.595
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.751CysMet: 0.751 ± 0.441
0.751CysAsn: 0.751 ± 0.925
0.751CysPro: 0.751 ± 0.441
0.751CysGln: 0.751 ± 0.441
2.254CysArg: 2.254 ± 0.948
0.751CysSer: 0.751 ± 0.441
0.751CysThr: 0.751 ± 0.441
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.751CysTyr: 0.751 ± 0.441
0.0CysXaa: 0.0 ± 0.0
Asp
3.005AspAla: 3.005 ± 1.259
0.751AspCys: 0.751 ± 0.441
2.254AspAsp: 2.254 ± 0.948
6.762AspGlu: 6.762 ± 2.267
3.005AspPhe: 3.005 ± 1.849
3.757AspGly: 3.757 ± 1.614
0.751AspHis: 0.751 ± 0.441
2.254AspIle: 2.254 ± 0.948
0.751AspLys: 0.751 ± 0.441
3.757AspLeu: 3.757 ± 1.467
3.005AspMet: 3.005 ± 1.336
0.751AspAsn: 0.751 ± 0.441
3.005AspPro: 3.005 ± 0.739
2.254AspGln: 2.254 ± 1.322
2.254AspArg: 2.254 ± 1.257
3.005AspSer: 3.005 ± 1.849
0.751AspThr: 0.751 ± 0.441
4.508AspVal: 4.508 ± 2.683
0.751AspTrp: 0.751 ± 0.441
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.757GluAla: 3.757 ± 1.713
0.0GluCys: 0.0 ± 0.0
3.005GluAsp: 3.005 ± 1.336
3.005GluGlu: 3.005 ± 1.154
3.005GluPhe: 3.005 ± 1.429
3.005GluGly: 3.005 ± 1.154
2.254GluHis: 2.254 ± 1.322
0.751GluIle: 0.751 ± 0.441
1.503GluLys: 1.503 ± 1.142
4.508GluLeu: 4.508 ± 1.288
3.757GluMet: 3.757 ± 1.82
3.005GluAsn: 3.005 ± 3.168
1.503GluPro: 1.503 ± 0.843
3.005GluGln: 3.005 ± 2.383
5.259GluArg: 5.259 ± 1.441
2.254GluSer: 2.254 ± 0.948
1.503GluThr: 1.503 ± 0.843
4.508GluVal: 4.508 ± 1.896
0.0GluTrp: 0.0 ± 0.0
0.751GluTyr: 0.751 ± 0.441
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.751PheCys: 0.751 ± 0.441
3.757PheAsp: 3.757 ± 1.614
0.751PheGlu: 0.751 ± 0.441
2.254PhePhe: 2.254 ± 0.948
2.254PheGly: 2.254 ± 0.976
3.757PheHis: 3.757 ± 2.353
4.508PheIle: 4.508 ± 2.774
4.508PheLys: 4.508 ± 1.896
4.508PheLeu: 4.508 ± 2.087
2.254PheMet: 2.254 ± 1.244
0.751PheAsn: 0.751 ± 0.441
3.005PhePro: 3.005 ± 0.739
2.254PheGln: 2.254 ± 0.918
3.005PheArg: 3.005 ± 0.739
4.508PheSer: 4.508 ± 1.059
3.005PheThr: 3.005 ± 1.164
0.751PheVal: 0.751 ± 0.441
0.751PheTrp: 0.751 ± 0.441
2.254PheTyr: 2.254 ± 0.948
0.0PheXaa: 0.0 ± 0.0
Gly
6.762GlyAla: 6.762 ± 4.209
1.503GlyCys: 1.503 ± 0.882
1.503GlyAsp: 1.503 ± 0.882
2.254GlyGlu: 2.254 ± 1.257
8.264GlyPhe: 8.264 ± 1.385
5.259GlyGly: 5.259 ± 2.477
0.751GlyHis: 0.751 ± 1.606
3.757GlyIle: 3.757 ± 1.381
2.254GlyLys: 2.254 ± 0.918
8.264GlyLeu: 8.264 ± 2.006
3.005GlyMet: 3.005 ± 1.763
3.005GlyAsn: 3.005 ± 1.259
3.757GlyPro: 3.757 ± 1.323
0.751GlyGln: 0.751 ± 1.301
3.005GlyArg: 3.005 ± 1.763
1.503GlySer: 1.503 ± 0.882
3.005GlyThr: 3.005 ± 2.041
7.513GlyVal: 7.513 ± 2.135
0.751GlyTrp: 0.751 ± 0.925
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.254HisAla: 2.254 ± 0.948
0.0HisCys: 0.0 ± 0.0
2.254HisAsp: 2.254 ± 1.322
3.005HisGlu: 3.005 ± 2.485
1.503HisPhe: 1.503 ± 2.601
2.254HisGly: 2.254 ± 1.667
0.0HisHis: 0.0 ± 0.0
2.254HisIle: 2.254 ± 0.948
2.254HisLys: 2.254 ± 0.948
1.503HisLeu: 1.503 ± 0.882
0.0HisMet: 0.0 ± 0.0
3.005HisAsn: 3.005 ± 1.154
1.503HisPro: 1.503 ± 0.925
0.751HisGln: 0.751 ± 0.441
0.0HisArg: 0.0 ± 0.0
3.757HisSer: 3.757 ± 1.252
0.0HisThr: 0.0 ± 0.0
2.254HisVal: 2.254 ± 0.948
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.005IleAla: 3.005 ± 0.739
0.0IleCys: 0.0 ± 0.0
0.751IleAsp: 0.751 ± 0.925
2.254IleGlu: 2.254 ± 1.715
4.508IlePhe: 4.508 ± 2.774
3.757IleGly: 3.757 ± 2.304
2.254IleHis: 2.254 ± 0.948
3.005IleIle: 3.005 ± 2.697
4.508IleLys: 4.508 ± 1.836
4.508IleLeu: 4.508 ± 2.774
0.751IleMet: 0.751 ± 0.441
3.757IleAsn: 3.757 ± 0.8
7.513IlePro: 7.513 ± 3.313
1.503IleGln: 1.503 ± 0.843
4.508IleArg: 4.508 ± 1.059
1.503IleSer: 1.503 ± 1.499
2.254IleThr: 2.254 ± 2.252
1.503IleVal: 1.503 ± 1.595
0.751IleTrp: 0.751 ± 0.441
1.503IleTyr: 1.503 ± 1.595
0.0IleXaa: 0.0 ± 0.0
Lys
3.005LysAla: 3.005 ± 1.154
0.0LysCys: 0.0 ± 0.0
6.011LysAsp: 6.011 ± 1.97
0.751LysGlu: 0.751 ± 0.441
1.503LysPhe: 1.503 ± 0.843
1.503LysGly: 1.503 ± 0.843
0.0LysHis: 0.0 ± 0.0
2.254LysIle: 2.254 ± 0.948
1.503LysLys: 1.503 ± 1.48
5.259LysLeu: 5.259 ± 2.411
0.751LysMet: 0.751 ± 0.921
0.751LysAsn: 0.751 ± 0.441
2.254LysPro: 2.254 ± 1.125
0.0LysGln: 0.0 ± 0.0
3.757LysArg: 3.757 ± 0.8
4.508LysSer: 4.508 ± 0.837
2.254LysThr: 2.254 ± 1.125
4.508LysVal: 4.508 ± 1.979
0.0LysTrp: 0.0 ± 0.0
1.503LysTyr: 1.503 ± 0.843
0.751LysXaa: 0.751 ± 0.441
Leu
11.27LeuAla: 11.27 ± 2.399
2.254LeuCys: 2.254 ± 0.948
3.757LeuAsp: 3.757 ± 1.82
3.757LeuGlu: 3.757 ± 2.204
3.757LeuPhe: 3.757 ± 0.8
8.264LeuGly: 8.264 ± 1.578
2.254LeuHis: 2.254 ± 1.322
4.508LeuIle: 4.508 ± 1.574
0.751LeuLys: 0.751 ± 0.441
8.264LeuLeu: 8.264 ± 1.102
0.751LeuMet: 0.751 ± 0.925
3.005LeuAsn: 3.005 ± 1.183
1.503LeuPro: 1.503 ± 1.499
1.503LeuGln: 1.503 ± 0.882
2.254LeuArg: 2.254 ± 0.948
6.011LeuSer: 6.011 ± 1.943
3.005LeuThr: 3.005 ± 1.437
9.016LeuVal: 9.016 ± 2.991
1.503LeuTrp: 1.503 ± 0.925
1.503LeuTyr: 1.503 ± 0.843
0.0LeuXaa: 0.0 ± 0.0
Met
1.503MetAla: 1.503 ± 1.851
0.751MetCys: 0.751 ± 0.441
2.254MetAsp: 2.254 ± 0.948
3.005MetGlu: 3.005 ± 1.336
1.503MetPhe: 1.503 ± 0.925
1.503MetGly: 1.503 ± 2.574
0.0MetHis: 0.0 ± 0.0
0.751MetIle: 0.751 ± 0.925
0.751MetLys: 0.751 ± 0.441
0.751MetLeu: 0.751 ± 0.441
0.0MetMet: 0.0 ± 0.0
1.503MetAsn: 1.503 ± 1.499
0.0MetPro: 0.0 ± 0.0
2.254MetGln: 2.254 ± 1.322
0.0MetArg: 0.0 ± 0.0
4.508MetSer: 4.508 ± 1.59
1.503MetThr: 1.503 ± 1.48
4.508MetVal: 4.508 ± 1.896
0.751MetTrp: 0.751 ± 0.441
1.503MetTyr: 1.503 ± 0.843
0.0MetXaa: 0.0 ± 0.0
Asn
0.751AsnAla: 0.751 ± 0.441
2.254AsnCys: 2.254 ± 0.948
1.503AsnAsp: 1.503 ± 0.843
1.503AsnGlu: 1.503 ± 0.925
0.0AsnPhe: 0.0 ± 0.0
2.254AsnGly: 2.254 ± 0.976
2.254AsnHis: 2.254 ± 1.527
0.751AsnIle: 0.751 ± 0.441
0.751AsnLys: 0.751 ± 0.925
5.259AsnLeu: 5.259 ± 2.481
1.503AsnMet: 1.503 ± 1.403
1.503AsnAsn: 1.503 ± 1.142
1.503AsnPro: 1.503 ± 0.882
0.751AsnGln: 0.751 ± 1.315
4.508AsnArg: 4.508 ± 2.112
0.751AsnSer: 0.751 ± 0.925
4.508AsnThr: 4.508 ± 1.447
4.508AsnVal: 4.508 ± 2.476
0.0AsnTrp: 0.0 ± 0.0
1.503AsnTyr: 1.503 ± 0.843
0.0AsnXaa: 0.0 ± 0.0
Pro
4.508ProAla: 4.508 ± 2.232
0.0ProCys: 0.0 ± 0.0
3.757ProAsp: 3.757 ± 1.467
3.005ProGlu: 3.005 ± 1.325
0.0ProPhe: 0.0 ± 0.0
1.503ProGly: 1.503 ± 0.843
0.0ProHis: 0.0 ± 0.0
3.757ProIle: 3.757 ± 1.381
1.503ProLys: 1.503 ± 0.843
4.508ProLeu: 4.508 ± 1.896
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
2.254ProPro: 2.254 ± 1.333
1.503ProGln: 1.503 ± 0.882
4.508ProArg: 4.508 ± 1.632
7.513ProSer: 7.513 ± 2.816
7.513ProThr: 7.513 ± 2.228
3.005ProVal: 3.005 ± 1.259
0.751ProTrp: 0.751 ± 1.606
1.503ProTyr: 1.503 ± 0.925
0.0ProXaa: 0.0 ± 0.0
Gln
2.254GlnAla: 2.254 ± 1.322
0.0GlnCys: 0.0 ± 0.0
1.503GlnAsp: 1.503 ± 0.882
3.005GlnGlu: 3.005 ± 1.272
2.254GlnPhe: 2.254 ± 1.322
1.503GlnGly: 1.503 ± 0.882
1.503GlnHis: 1.503 ± 1.191
1.503GlnIle: 1.503 ± 0.882
0.751GlnLys: 0.751 ± 1.315
1.503GlnLeu: 1.503 ± 0.843
0.751GlnMet: 0.751 ± 0.925
0.0GlnAsn: 0.0 ± 0.0
1.503GlnPro: 1.503 ± 0.882
3.005GlnGln: 3.005 ± 1.858
3.757GlnArg: 3.757 ± 2.493
1.503GlnSer: 1.503 ± 1.191
0.751GlnThr: 0.751 ± 0.441
0.751GlnVal: 0.751 ± 0.925
0.0GlnTrp: 0.0 ± 0.0
0.751GlnTyr: 0.751 ± 1.315
0.0GlnXaa: 0.0 ± 0.0
Arg
7.513ArgAla: 7.513 ± 3.212
0.751ArgCys: 0.751 ± 0.441
1.503ArgAsp: 1.503 ± 0.925
3.757ArgGlu: 3.757 ± 2.46
6.011ArgPhe: 6.011 ± 1.802
3.757ArgGly: 3.757 ± 2.204
0.751ArgHis: 0.751 ± 0.441
2.254ArgIle: 2.254 ± 0.918
3.757ArgLys: 3.757 ± 1.713
7.513ArgLeu: 7.513 ± 2.268
3.005ArgMet: 3.005 ± 1.437
3.757ArgAsn: 3.757 ± 1.381
3.757ArgPro: 3.757 ± 0.8
0.751ArgGln: 0.751 ± 0.441
6.762ArgArg: 6.762 ± 4.884
4.508ArgSer: 4.508 ± 4.709
3.005ArgThr: 3.005 ± 1.437
9.016ArgVal: 9.016 ± 2.083
0.751ArgTrp: 0.751 ± 0.925
4.508ArgTyr: 4.508 ± 2.003
0.0ArgXaa: 0.0 ± 0.0
Ser
4.508SerAla: 4.508 ± 3.73
0.0SerCys: 0.0 ± 0.0
5.259SerAsp: 5.259 ± 1.51
1.503SerGlu: 1.503 ± 2.601
3.005SerPhe: 3.005 ± 1.259
7.513SerGly: 7.513 ± 2.701
1.503SerHis: 1.503 ± 0.925
6.762SerIle: 6.762 ± 2.607
5.259SerLys: 5.259 ± 0.833
5.259SerLeu: 5.259 ± 1.783
3.757SerMet: 3.757 ± 2.46
1.503SerAsn: 1.503 ± 1.851
2.254SerPro: 2.254 ± 0.948
0.751SerGln: 0.751 ± 0.441
8.264SerArg: 8.264 ± 1.586
3.757SerSer: 3.757 ± 4.565
3.757SerThr: 3.757 ± 2.304
3.757SerVal: 3.757 ± 1.699
0.751SerTrp: 0.751 ± 1.606
5.259SerTyr: 5.259 ± 2.892
0.0SerXaa: 0.0 ± 0.0
Thr
4.508ThrAla: 4.508 ± 4.682
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
0.751ThrGlu: 0.751 ± 0.925
1.503ThrPhe: 1.503 ± 1.499
1.503ThrGly: 1.503 ± 0.843
3.757ThrHis: 3.757 ± 1.82
3.005ThrIle: 3.005 ± 1.509
3.005ThrLys: 3.005 ± 1.154
1.503ThrLeu: 1.503 ± 0.843
1.503ThrMet: 1.503 ± 0.843
2.254ThrAsn: 2.254 ± 2.252
6.011ThrPro: 6.011 ± 2.13
2.254ThrGln: 2.254 ± 1.926
6.011ThrArg: 6.011 ± 1.355
4.508ThrSer: 4.508 ± 1.708
7.513ThrThr: 7.513 ± 3.526
4.508ThrVal: 4.508 ± 1.647
1.503ThrTrp: 1.503 ± 0.843
0.751ThrTyr: 0.751 ± 0.441
0.0ThrXaa: 0.0 ± 0.0
Val
7.513ValAla: 7.513 ± 2.289
2.254ValCys: 2.254 ± 2.252
3.005ValAsp: 3.005 ± 0.739
7.513ValGlu: 7.513 ± 2.722
2.254ValPhe: 2.254 ± 0.948
6.762ValGly: 6.762 ± 1.913
3.005ValHis: 3.005 ± 1.154
3.005ValIle: 3.005 ± 0.739
3.005ValLys: 3.005 ± 1.429
3.757ValLeu: 3.757 ± 1.467
0.751ValMet: 0.751 ± 0.441
2.254ValAsn: 2.254 ± 2.098
4.508ValPro: 4.508 ± 1.447
3.005ValGln: 3.005 ± 1.991
8.264ValArg: 8.264 ± 2.058
8.264ValSer: 8.264 ± 2.798
3.757ValThr: 3.757 ± 1.593
3.757ValVal: 3.757 ± 3.088
1.503ValTrp: 1.503 ± 1.595
3.757ValTyr: 3.757 ± 1.467
0.0ValXaa: 0.0 ± 0.0
Trp
3.005TrpAla: 3.005 ± 1.509
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.751TrpLys: 0.751 ± 0.441
3.005TrpLeu: 3.005 ± 1.686
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.751TrpGln: 0.751 ± 0.441
2.254TrpArg: 2.254 ± 1.518
0.0TrpSer: 0.0 ± 0.0
0.751TrpThr: 0.751 ± 1.606
0.751TrpVal: 0.751 ± 0.441
0.0TrpTrp: 0.0 ± 0.0
0.751TrpTyr: 0.751 ± 0.925
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.005TyrAla: 3.005 ± 0.739
0.751TyrCys: 0.751 ± 0.441
0.751TyrAsp: 0.751 ± 0.925
1.503TyrGlu: 1.503 ± 0.882
0.0TyrPhe: 0.0 ± 0.0
4.508TyrGly: 4.508 ± 1.896
0.751TyrHis: 0.751 ± 1.315
3.005TyrIle: 3.005 ± 1.991
0.751TyrLys: 0.751 ± 0.441
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
1.503TyrAsn: 1.503 ± 0.843
0.0TyrPro: 0.0 ± 0.0
0.751TyrGln: 0.751 ± 0.441
5.259TyrArg: 5.259 ± 1.411
4.508TyrSer: 4.508 ± 1.288
1.503TyrThr: 1.503 ± 1.142
2.254TyrVal: 2.254 ± 1.527
0.0TyrTrp: 0.0 ± 0.0
0.751TyrTyr: 0.751 ± 0.441
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.751XaaGly: 0.751 ± 0.441
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1332 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski