Amino acid dipepetide frequency for Maize yellow striate virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.036AlaAla: 3.036 ± 1.456
0.759AlaCys: 0.759 ± 0.373
2.53AlaAsp: 2.53 ± 1.217
4.048AlaGlu: 4.048 ± 1.909
2.277AlaPhe: 2.277 ± 1.552
2.783AlaGly: 2.783 ± 1.506
0.506AlaHis: 0.506 ± 0.293
2.277AlaIle: 2.277 ± 1.201
3.036AlaLys: 3.036 ± 0.761
4.301AlaLeu: 4.301 ± 0.946
0.759AlaMet: 0.759 ± 0.385
1.265AlaAsn: 1.265 ± 0.781
1.012AlaPro: 1.012 ± 0.439
1.771AlaGln: 1.771 ± 0.703
4.048AlaArg: 4.048 ± 0.562
7.589AlaSer: 7.589 ± 0.775
2.783AlaThr: 2.783 ± 0.719
3.289AlaVal: 3.289 ± 0.779
0.253AlaTrp: 0.253 ± 0.328
1.518AlaTyr: 1.518 ± 1.272
0.0AlaXaa: 0.0 ± 0.0
Cys
0.253CysAla: 0.253 ± 0.156
0.506CysCys: 0.506 ± 0.616
1.265CysAsp: 1.265 ± 0.613
0.506CysGlu: 0.506 ± 0.285
0.253CysPhe: 0.253 ± 0.344
0.759CysGly: 0.759 ± 0.371
1.518CysHis: 1.518 ± 0.77
1.265CysIle: 1.265 ± 0.564
1.265CysLys: 1.265 ± 0.664
1.265CysLeu: 1.265 ± 0.551
0.506CysMet: 0.506 ± 0.267
1.012CysAsn: 1.012 ± 0.95
2.024CysPro: 2.024 ± 0.867
0.506CysGln: 0.506 ± 0.312
0.759CysArg: 0.759 ± 0.562
0.506CysSer: 0.506 ± 0.285
0.253CysThr: 0.253 ± 0.343
1.012CysVal: 1.012 ± 0.342
0.253CysTrp: 0.253 ± 0.156
1.012CysTyr: 1.012 ± 0.51
0.0CysXaa: 0.0 ± 0.0
Asp
2.53AspAla: 2.53 ± 0.76
1.518AspCys: 1.518 ± 0.62
2.53AspAsp: 2.53 ± 1.302
3.795AspGlu: 3.795 ± 1.351
2.783AspPhe: 2.783 ± 0.862
2.53AspGly: 2.53 ± 0.711
1.518AspHis: 1.518 ± 0.379
4.301AspIle: 4.301 ± 1.248
5.059AspLys: 5.059 ± 1.23
4.554AspLeu: 4.554 ± 0.671
1.771AspMet: 1.771 ± 0.597
3.036AspAsn: 3.036 ± 1.179
3.289AspPro: 3.289 ± 1.07
0.759AspGln: 0.759 ± 0.31
2.53AspArg: 2.53 ± 0.666
3.542AspSer: 3.542 ± 0.774
2.783AspThr: 2.783 ± 0.784
2.783AspVal: 2.783 ± 0.555
0.506AspTrp: 0.506 ± 0.312
1.771AspTyr: 1.771 ± 0.629
0.0AspXaa: 0.0 ± 0.0
Glu
1.518GluAla: 1.518 ± 0.535
1.265GluCys: 1.265 ± 0.571
2.277GluAsp: 2.277 ± 0.888
5.818GluGlu: 5.818 ± 3.35
1.771GluPhe: 1.771 ± 0.581
3.036GluGly: 3.036 ± 1.173
1.012GluHis: 1.012 ± 0.307
5.312GluIle: 5.312 ± 1.32
5.565GluLys: 5.565 ± 1.548
5.818GluLeu: 5.818 ± 1.211
3.036GluMet: 3.036 ± 1.233
3.036GluAsn: 3.036 ± 0.651
1.771GluPro: 1.771 ± 0.637
2.024GluGln: 2.024 ± 0.699
3.542GluArg: 3.542 ± 1.338
4.301GluSer: 4.301 ± 0.81
4.806GluThr: 4.806 ± 0.892
6.071GluVal: 6.071 ± 1.016
1.012GluTrp: 1.012 ± 1.052
2.024GluTyr: 2.024 ± 0.71
0.0GluXaa: 0.0 ± 0.0
Phe
1.518PheAla: 1.518 ± 0.547
0.253PheCys: 0.253 ± 0.156
2.783PheAsp: 2.783 ± 0.813
2.53PheGlu: 2.53 ± 0.682
1.012PhePhe: 1.012 ± 0.506
1.265PheGly: 1.265 ± 0.564
0.759PheHis: 0.759 ± 0.334
2.783PheIle: 2.783 ± 0.933
2.53PheLys: 2.53 ± 0.853
5.818PheLeu: 5.818 ± 1.317
0.506PheMet: 0.506 ± 0.312
1.518PheAsn: 1.518 ± 0.469
2.277PhePro: 2.277 ± 0.436
1.518PheGln: 1.518 ± 0.28
2.277PheArg: 2.277 ± 0.678
4.554PheSer: 4.554 ± 1.153
2.53PheThr: 2.53 ± 0.798
1.012PheVal: 1.012 ± 0.77
0.506PheTrp: 0.506 ± 0.285
1.265PheTyr: 1.265 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
2.783GlyAla: 2.783 ± 0.594
0.506GlyCys: 0.506 ± 0.687
2.53GlyAsp: 2.53 ± 0.508
4.301GlyGlu: 4.301 ± 1.001
2.024GlyPhe: 2.024 ± 0.466
2.024GlyGly: 2.024 ± 0.667
0.759GlyHis: 0.759 ± 0.339
6.071GlyIle: 6.071 ± 1.019
5.312GlyLys: 5.312 ± 2.067
6.324GlyLeu: 6.324 ± 1.365
2.783GlyMet: 2.783 ± 1.672
1.771GlyAsn: 1.771 ± 0.55
0.759GlyPro: 0.759 ± 0.375
1.518GlyGln: 1.518 ± 0.65
2.024GlyArg: 2.024 ± 0.678
4.301GlySer: 4.301 ± 1.049
1.265GlyThr: 1.265 ± 0.463
5.312GlyVal: 5.312 ± 0.932
1.771GlyTrp: 1.771 ± 0.59
1.771GlyTyr: 1.771 ± 0.576
0.0GlyXaa: 0.0 ± 0.0
His
1.012HisAla: 1.012 ± 0.543
0.506HisCys: 0.506 ± 0.293
0.759HisAsp: 0.759 ± 0.306
2.024HisGlu: 2.024 ± 1.051
0.506HisPhe: 0.506 ± 0.285
1.012HisGly: 1.012 ± 0.347
0.253HisHis: 0.253 ± 0.156
2.277HisIle: 2.277 ± 1.013
1.265HisLys: 1.265 ± 0.809
2.783HisLeu: 2.783 ± 1.431
0.0HisMet: 0.0 ± 0.0
1.012HisAsn: 1.012 ± 0.394
1.771HisPro: 1.771 ± 0.584
0.0HisGln: 0.0 ± 0.0
1.012HisArg: 1.012 ± 0.443
1.518HisSer: 1.518 ± 0.517
0.506HisThr: 0.506 ± 0.534
2.024HisVal: 2.024 ± 0.971
0.253HisTrp: 0.253 ± 0.156
1.012HisTyr: 1.012 ± 0.394
0.0HisXaa: 0.0 ± 0.0
Ile
2.53IleAla: 2.53 ± 0.651
0.506IleCys: 0.506 ± 0.654
3.289IleAsp: 3.289 ± 1.075
3.036IleGlu: 3.036 ± 0.8
4.048IlePhe: 4.048 ± 0.691
4.806IleGly: 4.806 ± 0.78
1.265IleHis: 1.265 ± 0.555
3.795IleIle: 3.795 ± 1.68
4.301IleLys: 4.301 ± 1.139
6.83IleLeu: 6.83 ± 1.289
1.518IleMet: 1.518 ± 0.864
4.048IleAsn: 4.048 ± 1.007
2.53IlePro: 2.53 ± 0.423
2.53IleGln: 2.53 ± 1.353
4.048IleArg: 4.048 ± 0.667
8.348IleSer: 8.348 ± 1.756
5.565IleThr: 5.565 ± 1.074
3.795IleVal: 3.795 ± 1.083
1.012IleTrp: 1.012 ± 0.394
2.53IleTyr: 2.53 ± 0.735
0.0IleXaa: 0.0 ± 0.0
Lys
1.518LysAla: 1.518 ± 0.974
1.518LysCys: 1.518 ± 0.642
2.783LysAsp: 2.783 ± 0.858
4.048LysGlu: 4.048 ± 0.77
2.277LysPhe: 2.277 ± 0.692
5.818LysGly: 5.818 ± 1.534
1.265LysHis: 1.265 ± 0.398
5.565LysIle: 5.565 ± 1.052
5.818LysLys: 5.818 ± 1.815
4.554LysLeu: 4.554 ± 0.891
2.783LysMet: 2.783 ± 1.194
4.554LysAsn: 4.554 ± 1.85
1.518LysPro: 1.518 ± 0.452
0.759LysGln: 0.759 ± 0.306
4.048LysArg: 4.048 ± 0.643
5.312LysSer: 5.312 ± 1.075
3.795LysThr: 3.795 ± 1.808
4.048LysVal: 4.048 ± 0.908
1.771LysTrp: 1.771 ± 0.581
2.53LysTyr: 2.53 ± 0.437
0.0LysXaa: 0.0 ± 0.0
Leu
3.795LeuAla: 3.795 ± 1.225
2.024LeuCys: 2.024 ± 0.988
6.577LeuAsp: 6.577 ± 1.909
4.048LeuGlu: 4.048 ± 1.163
3.036LeuPhe: 3.036 ± 0.821
4.806LeuGly: 4.806 ± 0.841
2.53LeuHis: 2.53 ± 0.925
7.842LeuIle: 7.842 ± 2.063
8.348LeuLys: 8.348 ± 1.144
10.119LeuLeu: 10.119 ± 1.953
4.301LeuMet: 4.301 ± 1.224
1.771LeuAsn: 1.771 ± 0.605
3.795LeuPro: 3.795 ± 1.178
2.53LeuGln: 2.53 ± 0.803
5.565LeuArg: 5.565 ± 1.373
7.336LeuSer: 7.336 ± 0.936
6.577LeuThr: 6.577 ± 1.314
5.818LeuVal: 5.818 ± 1.754
1.771LeuTrp: 1.771 ± 0.739
3.542LeuTyr: 3.542 ± 1.589
0.0LeuXaa: 0.0 ± 0.0
Met
1.518MetAla: 1.518 ± 0.451
0.253MetCys: 0.253 ± 0.156
1.265MetAsp: 1.265 ± 0.614
2.277MetGlu: 2.277 ± 1.63
1.265MetPhe: 1.265 ± 0.332
2.277MetGly: 2.277 ± 0.901
0.759MetHis: 0.759 ± 0.339
2.53MetIle: 2.53 ± 0.423
3.542MetLys: 3.542 ± 1.345
2.024MetLeu: 2.024 ± 0.662
1.012MetMet: 1.012 ± 0.382
1.771MetAsn: 1.771 ± 0.815
1.518MetPro: 1.518 ± 0.75
0.506MetGln: 0.506 ± 0.456
1.771MetArg: 1.771 ± 0.691
3.795MetSer: 3.795 ± 0.825
1.518MetThr: 1.518 ± 0.451
2.024MetVal: 2.024 ± 0.744
0.759MetTrp: 0.759 ± 0.527
0.506MetTyr: 0.506 ± 0.293
0.0MetXaa: 0.0 ± 0.0
Asn
3.795AsnAla: 3.795 ± 0.808
1.012AsnCys: 1.012 ± 0.795
2.024AsnAsp: 2.024 ± 0.596
1.518AsnGlu: 1.518 ± 0.346
1.518AsnPhe: 1.518 ± 0.77
1.265AsnGly: 1.265 ± 0.585
1.012AsnHis: 1.012 ± 0.46
4.048AsnIle: 4.048 ± 1.119
1.265AsnLys: 1.265 ± 0.382
3.795AsnLeu: 3.795 ± 1.36
2.024AsnMet: 2.024 ± 0.482
1.771AsnAsn: 1.771 ± 1.403
2.783AsnPro: 2.783 ± 0.93
1.012AsnGln: 1.012 ± 0.571
2.783AsnArg: 2.783 ± 0.629
3.542AsnSer: 3.542 ± 1.613
2.783AsnThr: 2.783 ± 0.879
3.289AsnVal: 3.289 ± 1.792
0.506AsnTrp: 0.506 ± 0.433
2.277AsnTyr: 2.277 ± 0.857
0.0AsnXaa: 0.0 ± 0.0
Pro
4.048ProAla: 4.048 ± 0.712
0.0ProCys: 0.0 ± 0.0
3.795ProAsp: 3.795 ± 0.943
1.771ProGlu: 1.771 ± 0.547
0.506ProPhe: 0.506 ± 0.312
4.301ProGly: 4.301 ± 0.917
1.012ProHis: 1.012 ± 0.738
1.771ProIle: 1.771 ± 0.933
1.771ProLys: 1.771 ± 0.6
3.795ProLeu: 3.795 ± 0.701
1.771ProMet: 1.771 ± 0.402
1.771ProAsn: 1.771 ± 0.505
1.518ProPro: 1.518 ± 0.75
1.012ProGln: 1.012 ± 0.589
3.036ProArg: 3.036 ± 0.42
4.048ProSer: 4.048 ± 1.095
2.783ProThr: 2.783 ± 0.807
1.012ProVal: 1.012 ± 0.571
0.759ProTrp: 0.759 ± 0.306
2.53ProTyr: 2.53 ± 0.985
0.0ProXaa: 0.0 ± 0.0
Gln
1.518GlnAla: 1.518 ± 0.667
0.759GlnCys: 0.759 ± 0.612
2.783GlnAsp: 2.783 ± 0.725
2.53GlnGlu: 2.53 ± 0.722
1.518GlnPhe: 1.518 ± 0.826
2.783GlnGly: 2.783 ± 0.865
1.771GlnHis: 1.771 ± 0.847
1.012GlnIle: 1.012 ± 0.394
1.012GlnLys: 1.012 ± 0.655
2.277GlnLeu: 2.277 ± 1.104
1.012GlnMet: 1.012 ± 0.59
0.506GlnAsn: 0.506 ± 0.285
0.506GlnPro: 0.506 ± 0.267
1.012GlnGln: 1.012 ± 0.733
1.265GlnArg: 1.265 ± 0.891
2.277GlnSer: 2.277 ± 0.741
1.518GlnThr: 1.518 ± 0.775
0.506GlnVal: 0.506 ± 0.433
0.0GlnTrp: 0.0 ± 0.0
0.759GlnTyr: 0.759 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
2.783ArgAla: 2.783 ± 0.983
0.759ArgCys: 0.759 ± 0.371
2.53ArgAsp: 2.53 ± 0.623
4.048ArgGlu: 4.048 ± 0.723
2.783ArgPhe: 2.783 ± 0.997
2.783ArgGly: 2.783 ± 1.11
0.506ArgHis: 0.506 ± 0.267
2.277ArgIle: 2.277 ± 0.725
2.783ArgLys: 2.783 ± 0.705
7.336ArgLeu: 7.336 ± 1.16
2.277ArgMet: 2.277 ± 1.135
2.277ArgAsn: 2.277 ± 0.867
2.783ArgPro: 2.783 ± 0.345
1.012ArgGln: 1.012 ± 0.506
2.783ArgArg: 2.783 ± 0.675
2.53ArgSer: 2.53 ± 0.831
4.301ArgThr: 4.301 ± 0.892
3.795ArgVal: 3.795 ± 0.923
0.253ArgTrp: 0.253 ± 0.156
3.795ArgTyr: 3.795 ± 1.085
0.0ArgXaa: 0.0 ± 0.0
Ser
4.554SerAla: 4.554 ± 2.028
2.277SerCys: 2.277 ± 1.573
5.312SerAsp: 5.312 ± 1.167
5.059SerGlu: 5.059 ± 1.941
2.783SerPhe: 2.783 ± 0.948
5.818SerGly: 5.818 ± 1.312
1.518SerHis: 1.518 ± 0.942
5.818SerIle: 5.818 ± 0.991
4.301SerLys: 4.301 ± 0.723
10.119SerLeu: 10.119 ± 2.08
2.024SerMet: 2.024 ± 0.55
4.048SerAsn: 4.048 ± 0.783
3.289SerPro: 3.289 ± 0.775
2.277SerGln: 2.277 ± 1.018
4.048SerArg: 4.048 ± 1.267
8.348SerSer: 8.348 ± 1.677
3.036SerThr: 3.036 ± 1.087
6.83SerVal: 6.83 ± 1.294
2.277SerTrp: 2.277 ± 0.606
2.024SerTyr: 2.024 ± 0.773
0.0SerXaa: 0.0 ± 0.0
Thr
4.301ThrAla: 4.301 ± 1.502
0.506ThrCys: 0.506 ± 0.285
3.036ThrAsp: 3.036 ± 1.016
4.806ThrGlu: 4.806 ± 1.558
3.542ThrPhe: 3.542 ± 0.533
3.036ThrGly: 3.036 ± 1.123
1.012ThrHis: 1.012 ± 0.548
3.795ThrIle: 3.795 ± 1.162
3.289ThrLys: 3.289 ± 1.321
4.048ThrLeu: 4.048 ± 0.952
1.771ThrMet: 1.771 ± 0.772
2.53ThrAsn: 2.53 ± 0.996
3.289ThrPro: 3.289 ± 0.663
2.53ThrGln: 2.53 ± 1.32
2.783ThrArg: 2.783 ± 1.135
3.795ThrSer: 3.795 ± 1.158
3.036ThrThr: 3.036 ± 1.489
3.289ThrVal: 3.289 ± 1.215
1.012ThrTrp: 1.012 ± 0.634
3.795ThrTyr: 3.795 ± 1.061
0.0ThrXaa: 0.0 ± 0.0
Val
3.542ValAla: 3.542 ± 0.932
1.518ValCys: 1.518 ± 0.523
3.289ValAsp: 3.289 ± 0.968
5.059ValGlu: 5.059 ± 1.494
2.53ValPhe: 2.53 ± 0.733
2.277ValGly: 2.277 ± 0.533
1.012ValHis: 1.012 ± 0.835
4.048ValIle: 4.048 ± 1.011
3.036ValLys: 3.036 ± 1.099
5.818ValLeu: 5.818 ± 1.44
1.518ValMet: 1.518 ± 0.375
3.795ValAsn: 3.795 ± 0.834
2.53ValPro: 2.53 ± 0.859
1.012ValGln: 1.012 ± 0.714
3.289ValArg: 3.289 ± 0.834
6.577ValSer: 6.577 ± 2.147
4.806ValThr: 4.806 ± 1.352
4.806ValVal: 4.806 ± 1.347
1.012ValTrp: 1.012 ± 0.872
2.024ValTyr: 2.024 ± 0.687
0.0ValXaa: 0.0 ± 0.0
Trp
1.012TrpAla: 1.012 ± 0.398
0.0TrpCys: 0.0 ± 0.0
1.012TrpAsp: 1.012 ± 0.46
1.518TrpGlu: 1.518 ± 0.794
0.759TrpPhe: 0.759 ± 0.469
1.012TrpGly: 1.012 ± 0.625
0.253TrpHis: 0.253 ± 0.552
1.771TrpIle: 1.771 ± 0.504
1.265TrpLys: 1.265 ± 0.717
1.012TrpLeu: 1.012 ± 0.483
0.506TrpMet: 0.506 ± 0.312
1.012TrpAsn: 1.012 ± 0.503
0.0TrpPro: 0.0 ± 0.0
0.253TrpGln: 0.253 ± 0.156
1.012TrpArg: 1.012 ± 0.625
0.253TrpSer: 0.253 ± 0.156
2.277TrpThr: 2.277 ± 1.552
0.759TrpVal: 0.759 ± 0.621
0.0TrpTrp: 0.0 ± 0.0
0.253TrpTyr: 0.253 ± 0.343
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.277TyrAla: 2.277 ± 0.844
0.253TyrCys: 0.253 ± 0.156
1.518TyrAsp: 1.518 ± 0.58
2.277TyrGlu: 2.277 ± 0.61
2.024TyrPhe: 2.024 ± 0.816
1.518TyrGly: 1.518 ± 0.419
1.265TyrHis: 1.265 ± 0.433
1.771TyrIle: 1.771 ± 0.603
1.771TyrLys: 1.771 ± 0.846
3.542TyrLeu: 3.542 ± 1.068
0.759TyrMet: 0.759 ± 0.459
1.518TyrAsn: 1.518 ± 0.522
4.048TyrPro: 4.048 ± 0.866
2.783TyrGln: 2.783 ± 1.063
1.771TyrArg: 1.771 ± 0.41
3.289TyrSer: 3.289 ± 0.868
2.277TyrThr: 2.277 ± 0.859
2.024TyrVal: 2.024 ± 0.35
0.253TyrTrp: 0.253 ± 0.156
2.277TyrTyr: 2.277 ± 0.775
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski