Amino acid dipepetide frequency for Strawberry chlorotic fleck-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.379AlaAla: 4.379 ± 1.256
0.547AlaCys: 0.547 ± 0.309
3.284AlaAsp: 3.284 ± 0.614
4.379AlaGlu: 4.379 ± 1.157
2.919AlaPhe: 2.919 ± 0.623
2.372AlaGly: 2.372 ± 0.545
0.912AlaHis: 0.912 ± 0.319
3.284AlaIle: 3.284 ± 1.186
5.291AlaLys: 5.291 ± 0.789
7.115AlaLeu: 7.115 ± 0.895
2.007AlaMet: 2.007 ± 0.434
2.007AlaAsn: 2.007 ± 0.621
2.737AlaPro: 2.737 ± 0.964
1.824AlaGln: 1.824 ± 0.429
2.554AlaArg: 2.554 ± 1.07
4.561AlaSer: 4.561 ± 0.933
4.744AlaThr: 4.744 ± 0.73
6.386AlaVal: 6.386 ± 1.124
0.0AlaTrp: 0.0 ± 0.0
3.102AlaTyr: 3.102 ± 0.723
0.0AlaXaa: 0.0 ± 0.0
Cys
2.007CysAla: 2.007 ± 0.665
0.547CysCys: 0.547 ± 0.274
0.912CysAsp: 0.912 ± 0.282
1.642CysGlu: 1.642 ± 0.429
1.277CysPhe: 1.277 ± 0.42
1.277CysGly: 1.277 ± 0.414
0.182CysHis: 0.182 ± 0.105
1.642CysIle: 1.642 ± 0.366
1.277CysLys: 1.277 ± 0.342
3.102CysLeu: 3.102 ± 0.824
0.365CysMet: 0.365 ± 0.347
0.547CysAsn: 0.547 ± 0.299
0.912CysPro: 0.912 ± 0.429
0.182CysGln: 0.182 ± 0.105
0.365CysArg: 0.365 ± 0.21
3.102CysSer: 3.102 ± 0.927
1.642CysThr: 1.642 ± 0.603
1.46CysVal: 1.46 ± 0.439
0.0CysTrp: 0.0 ± 0.0
1.277CysTyr: 1.277 ± 0.413
0.0CysXaa: 0.0 ± 0.0
Asp
3.102AspAla: 3.102 ± 0.728
1.642AspCys: 1.642 ± 0.494
2.554AspAsp: 2.554 ± 0.717
3.649AspGlu: 3.649 ± 0.694
5.291AspPhe: 5.291 ± 0.672
2.919AspGly: 2.919 ± 0.584
1.824AspHis: 1.824 ± 0.356
3.649AspIle: 3.649 ± 0.781
2.372AspLys: 2.372 ± 0.577
5.656AspLeu: 5.656 ± 1.209
1.277AspMet: 1.277 ± 0.555
2.007AspAsn: 2.007 ± 0.63
1.095AspPro: 1.095 ± 0.419
1.277AspGln: 1.277 ± 0.452
2.189AspArg: 2.189 ± 0.505
4.196AspSer: 4.196 ± 1.011
2.554AspThr: 2.554 ± 0.62
4.196AspVal: 4.196 ± 0.775
0.365AspTrp: 0.365 ± 0.21
2.737AspTyr: 2.737 ± 1.396
0.0AspXaa: 0.0 ± 0.0
Glu
4.926GluAla: 4.926 ± 0.94
1.642GluCys: 1.642 ± 0.735
4.196GluAsp: 4.196 ± 1.13
4.379GluGlu: 4.379 ± 0.898
2.919GluPhe: 2.919 ± 0.778
3.649GluGly: 3.649 ± 0.627
1.095GluHis: 1.095 ± 0.624
4.744GluIle: 4.744 ± 1.115
5.109GluLys: 5.109 ± 0.818
4.926GluLeu: 4.926 ± 1.674
0.912GluMet: 0.912 ± 0.325
1.46GluAsn: 1.46 ± 0.523
0.547GluPro: 0.547 ± 0.392
0.547GluGln: 0.547 ± 0.221
2.919GluArg: 2.919 ± 1.513
5.291GluSer: 5.291 ± 1.035
2.919GluThr: 2.919 ± 0.497
4.561GluVal: 4.561 ± 0.921
1.095GluTrp: 1.095 ± 0.362
2.919GluTyr: 2.919 ± 0.666
0.0GluXaa: 0.0 ± 0.0
Phe
2.372PheAla: 2.372 ± 0.692
1.824PheCys: 1.824 ± 0.45
2.919PheAsp: 2.919 ± 0.782
4.379PheGlu: 4.379 ± 0.929
2.007PhePhe: 2.007 ± 0.753
3.649PheGly: 3.649 ± 0.898
0.73PheHis: 0.73 ± 0.246
3.284PheIle: 3.284 ± 0.813
3.102PheLys: 3.102 ± 0.802
5.109PheLeu: 5.109 ± 0.726
1.642PheMet: 1.642 ± 0.423
1.824PheAsn: 1.824 ± 0.502
2.554PhePro: 2.554 ± 0.557
0.73PheGln: 0.73 ± 0.385
2.189PheArg: 2.189 ± 0.924
6.021PheSer: 6.021 ± 1.375
3.284PheThr: 3.284 ± 0.933
3.467PheVal: 3.467 ± 1.227
0.182PheTrp: 0.182 ± 0.105
1.095PheTyr: 1.095 ± 0.773
0.0PheXaa: 0.0 ± 0.0
Gly
3.831GlyAla: 3.831 ± 0.584
1.824GlyCys: 1.824 ± 0.484
3.831GlyAsp: 3.831 ± 0.638
4.014GlyGlu: 4.014 ± 0.73
3.467GlyPhe: 3.467 ± 0.54
3.102GlyGly: 3.102 ± 1.042
0.547GlyHis: 0.547 ± 0.352
2.007GlyIle: 2.007 ± 0.862
3.831GlyLys: 3.831 ± 0.989
3.831GlyLeu: 3.831 ± 1.285
0.182GlyMet: 0.182 ± 0.407
2.007GlyAsn: 2.007 ± 0.505
1.277GlyPro: 1.277 ± 0.561
0.365GlyGln: 0.365 ± 0.21
1.46GlyArg: 1.46 ± 0.455
5.473GlySer: 5.473 ± 1.031
2.737GlyThr: 2.737 ± 1.181
5.838GlyVal: 5.838 ± 1.733
0.365GlyTrp: 0.365 ± 0.329
2.919GlyTyr: 2.919 ± 1.057
0.0GlyXaa: 0.0 ± 0.0
His
1.277HisAla: 1.277 ± 0.366
0.547HisCys: 0.547 ± 0.386
1.095HisAsp: 1.095 ± 0.362
1.46HisGlu: 1.46 ± 0.746
1.277HisPhe: 1.277 ± 0.477
0.912HisGly: 0.912 ± 0.479
0.73HisHis: 0.73 ± 0.269
1.642HisIle: 1.642 ± 0.814
1.642HisLys: 1.642 ± 0.455
2.372HisLeu: 2.372 ± 0.642
0.365HisMet: 0.365 ± 0.336
1.277HisAsn: 1.277 ± 0.472
1.095HisPro: 1.095 ± 0.444
0.0HisGln: 0.0 ± 0.0
0.73HisArg: 0.73 ± 0.246
1.095HisSer: 1.095 ± 0.477
0.912HisThr: 0.912 ± 0.241
2.007HisVal: 2.007 ± 0.475
0.182HisTrp: 0.182 ± 0.241
1.095HisTyr: 1.095 ± 0.68
0.0HisXaa: 0.0 ± 0.0
Ile
1.824IleAla: 1.824 ± 1.152
1.642IleCys: 1.642 ± 0.693
2.007IleAsp: 2.007 ± 0.76
2.372IleGlu: 2.372 ± 0.609
1.46IlePhe: 1.46 ± 0.39
1.824IleGly: 1.824 ± 0.515
1.46IleHis: 1.46 ± 0.402
2.554IleIle: 2.554 ± 0.656
4.196IleLys: 4.196 ± 0.868
4.926IleLeu: 4.926 ± 1.071
1.277IleMet: 1.277 ± 0.321
2.737IleAsn: 2.737 ± 0.972
3.284IlePro: 3.284 ± 0.731
1.095IleGln: 1.095 ± 0.391
2.737IleArg: 2.737 ± 0.905
5.838IleSer: 5.838 ± 1.492
3.467IleThr: 3.467 ± 0.557
4.014IleVal: 4.014 ± 0.739
0.365IleTrp: 0.365 ± 0.376
2.372IleTyr: 2.372 ± 0.664
0.0IleXaa: 0.0 ± 0.0
Lys
4.926LysAla: 4.926 ± 0.771
1.642LysCys: 1.642 ± 0.41
4.196LysAsp: 4.196 ± 1.318
3.649LysGlu: 3.649 ± 0.553
4.561LysPhe: 4.561 ± 1.462
2.919LysGly: 2.919 ± 0.863
1.642LysHis: 1.642 ± 0.566
3.831LysIle: 3.831 ± 1.08
2.919LysLys: 2.919 ± 0.754
6.751LysLeu: 6.751 ± 1.156
0.73LysMet: 0.73 ± 0.358
1.642LysAsn: 1.642 ± 0.828
1.46LysPro: 1.46 ± 0.455
1.642LysGln: 1.642 ± 0.35
3.284LysArg: 3.284 ± 0.965
6.203LysSer: 6.203 ± 1.306
5.291LysThr: 5.291 ± 1.687
4.561LysVal: 4.561 ± 0.814
0.547LysTrp: 0.547 ± 0.292
4.014LysTyr: 4.014 ± 0.674
0.0LysXaa: 0.0 ± 0.0
Leu
5.109LeuAla: 5.109 ± 0.858
2.007LeuCys: 2.007 ± 0.875
5.291LeuAsp: 5.291 ± 0.779
4.744LeuGlu: 4.744 ± 0.872
3.649LeuPhe: 3.649 ± 0.908
6.203LeuGly: 6.203 ± 0.809
2.372LeuHis: 2.372 ± 0.618
4.926LeuIle: 4.926 ± 0.416
8.575LeuLys: 8.575 ± 1.461
7.298LeuLeu: 7.298 ± 1.146
1.824LeuMet: 1.824 ± 0.76
5.838LeuAsn: 5.838 ± 0.716
3.831LeuPro: 3.831 ± 1.104
2.372LeuGln: 2.372 ± 0.575
5.291LeuArg: 5.291 ± 1.116
10.582LeuSer: 10.582 ± 1.664
4.196LeuThr: 4.196 ± 0.568
7.663LeuVal: 7.663 ± 0.948
0.547LeuTrp: 0.547 ± 0.212
4.196LeuTyr: 4.196 ± 0.686
0.0LeuXaa: 0.0 ± 0.0
Met
1.46MetAla: 1.46 ± 0.456
0.365MetCys: 0.365 ± 0.21
0.73MetAsp: 0.73 ± 0.228
1.824MetGlu: 1.824 ± 0.969
0.547MetPhe: 0.547 ± 0.331
1.46MetGly: 1.46 ± 0.455
0.547MetHis: 0.547 ± 0.315
1.277MetIle: 1.277 ± 0.418
0.73MetLys: 0.73 ± 0.293
2.189MetLeu: 2.189 ± 0.587
0.365MetMet: 0.365 ± 0.21
1.277MetAsn: 1.277 ± 0.447
0.365MetPro: 0.365 ± 0.23
0.73MetGln: 0.73 ± 0.464
1.46MetArg: 1.46 ± 0.426
2.372MetSer: 2.372 ± 1.093
0.547MetThr: 0.547 ± 0.234
2.189MetVal: 2.189 ± 0.661
0.0MetTrp: 0.0 ± 0.0
0.73MetTyr: 0.73 ± 0.385
0.0MetXaa: 0.0 ± 0.0
Asn
2.919AsnAla: 2.919 ± 0.472
1.095AsnCys: 1.095 ± 0.432
2.007AsnAsp: 2.007 ± 0.615
2.007AsnGlu: 2.007 ± 0.773
2.554AsnPhe: 2.554 ± 1.048
1.46AsnGly: 1.46 ± 0.469
1.642AsnHis: 1.642 ± 0.517
2.007AsnIle: 2.007 ± 0.722
1.642AsnLys: 1.642 ± 0.638
6.021AsnLeu: 6.021 ± 1.264
1.095AsnMet: 1.095 ± 0.511
2.554AsnAsn: 2.554 ± 0.8
1.642AsnPro: 1.642 ± 0.671
1.095AsnGln: 1.095 ± 0.488
2.372AsnArg: 2.372 ± 0.482
4.561AsnSer: 4.561 ± 0.921
2.737AsnThr: 2.737 ± 0.415
2.919AsnVal: 2.919 ± 1.084
0.0AsnTrp: 0.0 ± 0.0
1.095AsnTyr: 1.095 ± 0.237
0.0AsnXaa: 0.0 ± 0.0
Pro
2.372ProAla: 2.372 ± 0.408
0.365ProCys: 0.365 ± 0.193
0.912ProAsp: 0.912 ± 0.665
3.649ProGlu: 3.649 ± 0.691
2.554ProPhe: 2.554 ± 0.621
2.554ProGly: 2.554 ± 0.439
0.547ProHis: 0.547 ± 0.302
0.912ProIle: 0.912 ± 0.409
2.554ProLys: 2.554 ± 0.91
2.554ProLeu: 2.554 ± 0.633
0.912ProMet: 0.912 ± 0.371
1.095ProAsn: 1.095 ± 0.805
0.912ProPro: 0.912 ± 0.444
0.365ProGln: 0.365 ± 0.21
1.642ProArg: 1.642 ± 0.483
3.467ProSer: 3.467 ± 0.821
1.642ProThr: 1.642 ± 0.643
3.649ProVal: 3.649 ± 1.231
0.365ProTrp: 0.365 ± 0.21
1.46ProTyr: 1.46 ± 0.541
0.0ProXaa: 0.0 ± 0.0
Gln
1.642GlnAla: 1.642 ± 0.786
0.365GlnCys: 0.365 ± 0.21
1.277GlnAsp: 1.277 ± 0.647
1.46GlnGlu: 1.46 ± 0.428
1.277GlnPhe: 1.277 ± 0.42
1.095GlnGly: 1.095 ± 0.515
0.73GlnHis: 0.73 ± 0.384
0.73GlnIle: 0.73 ± 0.429
1.46GlnLys: 1.46 ± 0.356
1.46GlnLeu: 1.46 ± 0.455
0.182GlnMet: 0.182 ± 0.105
1.642GlnAsn: 1.642 ± 0.405
0.547GlnPro: 0.547 ± 0.344
0.365GlnGln: 0.365 ± 0.21
0.912GlnArg: 0.912 ± 0.264
1.277GlnSer: 1.277 ± 0.423
0.912GlnThr: 0.912 ± 0.393
0.912GlnVal: 0.912 ± 0.349
0.182GlnTrp: 0.182 ± 0.105
0.73GlnTyr: 0.73 ± 0.269
0.0GlnXaa: 0.0 ± 0.0
Arg
3.102ArgAla: 3.102 ± 0.922
1.277ArgCys: 1.277 ± 0.418
2.189ArgAsp: 2.189 ± 0.772
1.46ArgGlu: 1.46 ± 0.66
2.554ArgPhe: 2.554 ± 0.382
2.737ArgGly: 2.737 ± 0.777
1.46ArgHis: 1.46 ± 0.455
1.824ArgIle: 1.824 ± 0.45
3.467ArgLys: 3.467 ± 0.807
5.109ArgLeu: 5.109 ± 1.133
1.642ArgMet: 1.642 ± 0.563
2.189ArgAsn: 2.189 ± 0.474
1.824ArgPro: 1.824 ± 1.1
0.365ArgGln: 0.365 ± 0.193
4.379ArgArg: 4.379 ± 1.19
4.744ArgSer: 4.744 ± 0.701
3.467ArgThr: 3.467 ± 0.946
4.926ArgVal: 4.926 ± 1.239
0.547ArgTrp: 0.547 ± 0.221
1.095ArgTyr: 1.095 ± 0.441
0.0ArgXaa: 0.0 ± 0.0
Ser
6.386SerAla: 6.386 ± 0.752
2.007SerCys: 2.007 ± 0.72
6.751SerAsp: 6.751 ± 0.892
6.568SerGlu: 6.568 ± 1.257
4.561SerPhe: 4.561 ± 0.922
4.744SerGly: 4.744 ± 1.172
2.372SerHis: 2.372 ± 0.786
3.467SerIle: 3.467 ± 1.034
4.379SerLys: 4.379 ± 0.695
9.487SerLeu: 9.487 ± 0.901
2.919SerMet: 2.919 ± 0.85
5.473SerAsn: 5.473 ± 1.619
2.737SerPro: 2.737 ± 0.586
2.919SerGln: 2.919 ± 0.919
5.838SerArg: 5.838 ± 0.867
10.4SerSer: 10.4 ± 1.758
5.291SerThr: 5.291 ± 0.875
6.386SerVal: 6.386 ± 0.862
0.365SerTrp: 0.365 ± 0.29
4.196SerTyr: 4.196 ± 1.283
0.0SerXaa: 0.0 ± 0.0
Thr
4.744ThrAla: 4.744 ± 1.494
1.277ThrCys: 1.277 ± 0.488
3.102ThrAsp: 3.102 ± 0.684
2.007ThrGlu: 2.007 ± 0.83
2.919ThrPhe: 2.919 ± 0.533
2.919ThrGly: 2.919 ± 1.051
0.73ThrHis: 0.73 ± 0.42
3.284ThrIle: 3.284 ± 0.789
3.102ThrLys: 3.102 ± 0.946
5.838ThrLeu: 5.838 ± 1.312
0.547ThrMet: 0.547 ± 0.212
2.737ThrAsn: 2.737 ± 1.067
3.102ThrPro: 3.102 ± 0.753
0.365ThrGln: 0.365 ± 0.21
2.554ThrArg: 2.554 ± 1.225
5.473ThrSer: 5.473 ± 1.075
3.649ThrThr: 3.649 ± 1.099
4.926ThrVal: 4.926 ± 0.949
0.73ThrTrp: 0.73 ± 0.665
2.372ThrTyr: 2.372 ± 0.612
0.0ThrXaa: 0.0 ± 0.0
Val
4.196ValAla: 4.196 ± 1.226
2.007ValCys: 2.007 ± 0.691
3.467ValAsp: 3.467 ± 0.745
4.561ValGlu: 4.561 ± 0.82
4.926ValPhe: 4.926 ± 0.931
5.109ValGly: 5.109 ± 1.289
1.46ValHis: 1.46 ± 0.465
4.014ValIle: 4.014 ± 0.909
7.298ValLys: 7.298 ± 1.072
6.751ValLeu: 6.751 ± 1.025
1.277ValMet: 1.277 ± 0.473
2.919ValAsn: 2.919 ± 0.639
3.284ValPro: 3.284 ± 0.642
2.007ValGln: 2.007 ± 0.584
5.656ValArg: 5.656 ± 0.764
7.298ValSer: 7.298 ± 1.092
3.831ValThr: 3.831 ± 1.006
8.21ValVal: 8.21 ± 1.88
0.182ValTrp: 0.182 ± 0.244
4.196ValTyr: 4.196 ± 1.202
0.0ValXaa: 0.0 ± 0.0
Trp
0.912TrpAla: 0.912 ± 0.286
0.0TrpCys: 0.0 ± 0.0
0.365TrpAsp: 0.365 ± 0.21
0.0TrpGlu: 0.0 ± 0.0
0.182TrpPhe: 0.182 ± 0.105
0.547TrpGly: 0.547 ± 0.284
0.0TrpHis: 0.0 ± 0.0
0.365TrpIle: 0.365 ± 0.215
0.365TrpLys: 0.365 ± 0.293
0.547TrpLeu: 0.547 ± 0.346
0.365TrpMet: 0.365 ± 0.487
0.547TrpAsn: 0.547 ± 0.293
0.182TrpPro: 0.182 ± 0.105
0.0TrpGln: 0.0 ± 0.0
0.365TrpArg: 0.365 ± 0.198
0.547TrpSer: 0.547 ± 0.262
0.365TrpThr: 0.365 ± 0.29
0.365TrpVal: 0.365 ± 0.327
0.182TrpTrp: 0.182 ± 0.244
0.182TrpTyr: 0.182 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.919TyrAla: 2.919 ± 0.75
1.095TyrCys: 1.095 ± 0.462
3.467TyrAsp: 3.467 ± 1.546
2.554TyrGlu: 2.554 ± 0.514
1.46TyrPhe: 1.46 ± 0.586
1.46TyrGly: 1.46 ± 0.622
0.73TyrHis: 0.73 ± 0.359
2.007TyrIle: 2.007 ± 0.713
3.102TyrLys: 3.102 ± 1.269
5.473TyrLeu: 5.473 ± 1.029
1.095TyrMet: 1.095 ± 0.524
1.642TyrAsn: 1.642 ± 0.622
1.095TyrPro: 1.095 ± 0.44
0.912TyrGln: 0.912 ± 0.403
1.46TyrArg: 1.46 ± 0.818
4.744TyrSer: 4.744 ± 0.725
2.189TyrThr: 2.189 ± 0.467
4.196TyrVal: 4.196 ± 0.822
0.182TyrTrp: 0.182 ± 0.105
2.189TyrTyr: 2.189 ± 0.373
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (5482 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski