Amino acid dipepetide frequency for Bean golden yellow mosaic virus (isolate Puerto Rico) (BGYMV) (Bean golden mosaic virus (isolate Puerto Rico))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.652AlaAla: 0.652 ± 0.524
0.652AlaCys: 0.652 ± 0.483
2.608AlaAsp: 2.608 ± 0.948
0.652AlaGlu: 0.652 ± 0.689
1.304AlaPhe: 1.304 ± 0.845
1.956AlaGly: 1.956 ± 1.061
1.304AlaHis: 1.304 ± 0.747
3.911AlaIle: 3.911 ± 1.456
4.563AlaLys: 4.563 ± 0.773
6.519AlaLeu: 6.519 ± 2.649
0.0AlaMet: 0.0 ± 0.0
2.608AlaAsn: 2.608 ± 0.513
1.956AlaPro: 1.956 ± 0.634
3.911AlaGln: 3.911 ± 1.516
3.259AlaArg: 3.259 ± 1.315
7.171AlaSer: 7.171 ± 2.434
2.608AlaThr: 2.608 ± 1.234
1.304AlaVal: 1.304 ± 0.856
0.652AlaTrp: 0.652 ± 0.483
0.652AlaTyr: 0.652 ± 0.689
0.0AlaXaa: 0.0 ± 0.0
Cys
1.956CysAla: 1.956 ± 2.043
0.652CysCys: 0.652 ± 0.681
0.0CysAsp: 0.0 ± 0.0
1.304CysGlu: 1.304 ± 0.605
0.0CysPhe: 0.0 ± 0.0
0.652CysGly: 0.652 ± 0.681
0.0CysHis: 0.0 ± 0.0
2.608CysIle: 2.608 ± 1.072
1.304CysLys: 1.304 ± 0.605
0.652CysLeu: 0.652 ± 0.622
0.652CysMet: 0.652 ± 0.547
1.956CysAsn: 1.956 ± 0.606
0.0CysPro: 0.0 ± 0.0
0.652CysGln: 0.652 ± 0.524
0.652CysArg: 0.652 ± 0.622
1.304CysSer: 1.304 ± 0.988
1.956CysThr: 1.956 ± 1.024
1.304CysVal: 1.304 ± 0.643
0.652CysTrp: 0.652 ± 0.622
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.304AspAla: 1.304 ± 0.643
1.304AspCys: 1.304 ± 0.966
2.608AspAsp: 2.608 ± 1.606
1.956AspGlu: 1.956 ± 0.74
4.563AspPhe: 4.563 ± 1.016
2.608AspGly: 2.608 ± 1.247
2.608AspHis: 2.608 ± 1.379
6.519AspIle: 6.519 ± 1.67
2.608AspLys: 2.608 ± 0.711
5.215AspLeu: 5.215 ± 1.271
0.0AspMet: 0.0 ± 0.0
2.608AspAsn: 2.608 ± 0.954
2.608AspPro: 2.608 ± 1.208
0.652AspGln: 0.652 ± 0.483
2.608AspArg: 2.608 ± 1.474
7.171AspSer: 7.171 ± 1.584
1.304AspThr: 1.304 ± 0.714
2.608AspVal: 2.608 ± 1.102
1.956AspTrp: 1.956 ± 1.572
2.608AspTyr: 2.608 ± 0.879
0.0AspXaa: 0.0 ± 0.0
Glu
1.304GluAla: 1.304 ± 0.605
0.652GluCys: 0.652 ± 0.547
1.304GluAsp: 1.304 ± 0.856
3.911GluGlu: 3.911 ± 2.18
0.652GluPhe: 0.652 ± 0.681
3.911GluGly: 3.911 ± 1.427
0.652GluHis: 0.652 ± 0.681
1.956GluIle: 1.956 ± 1.64
0.652GluLys: 0.652 ± 0.524
3.259GluLeu: 3.259 ± 0.871
0.0GluMet: 0.0 ± 0.0
4.563GluAsn: 4.563 ± 2.576
2.608GluPro: 2.608 ± 0.901
1.956GluGln: 1.956 ± 1.024
3.259GluArg: 3.259 ± 1.315
4.563GluSer: 4.563 ± 1.714
0.0GluThr: 0.0 ± 0.0
1.304GluVal: 1.304 ± 1.053
0.652GluTrp: 0.652 ± 0.524
3.259GluTyr: 3.259 ± 1.3
0.0GluXaa: 0.0 ± 0.0
Phe
1.304PheAla: 1.304 ± 0.845
0.652PheCys: 0.652 ± 0.483
2.608PheAsp: 2.608 ± 1.102
0.652PheGlu: 0.652 ± 0.524
1.304PhePhe: 1.304 ± 0.655
2.608PheGly: 2.608 ± 1.102
0.652PheHis: 0.652 ± 0.524
3.259PheIle: 3.259 ± 1.006
3.911PheLys: 3.911 ± 1.224
1.956PheLeu: 1.956 ± 1.091
0.0PheMet: 0.0 ± 0.0
3.911PheAsn: 3.911 ± 0.912
1.956PhePro: 1.956 ± 1.086
1.956PheGln: 1.956 ± 0.919
4.563PheArg: 4.563 ± 0.742
4.563PheSer: 4.563 ± 2.587
0.652PheThr: 0.652 ± 0.681
2.608PheVal: 2.608 ± 1.247
1.956PheTrp: 1.956 ± 1.061
2.608PheTyr: 2.608 ± 1.502
0.0PheXaa: 0.0 ± 0.0
Gly
1.956GlyAla: 1.956 ± 0.606
1.956GlyCys: 1.956 ± 0.888
1.956GlyAsp: 1.956 ± 0.941
3.911GlyGlu: 3.911 ± 1.458
1.304GlyPhe: 1.304 ± 0.877
3.911GlyGly: 3.911 ± 1.549
0.652GlyHis: 0.652 ± 0.524
1.956GlyIle: 1.956 ± 0.84
7.171GlyLys: 7.171 ± 1.573
1.304GlyLeu: 1.304 ± 0.714
1.304GlyMet: 1.304 ± 0.709
1.956GlyAsn: 1.956 ± 1.08
3.259GlyPro: 3.259 ± 0.514
3.259GlyGln: 3.259 ± 1.472
2.608GlyArg: 2.608 ± 1.606
5.215GlySer: 5.215 ± 0.838
3.259GlyThr: 3.259 ± 1.284
3.259GlyVal: 3.259 ± 1.022
0.0GlyTrp: 0.0 ± 0.0
0.652GlyTyr: 0.652 ± 0.622
0.0GlyXaa: 0.0 ± 0.0
His
1.956HisAla: 1.956 ± 0.775
1.304HisCys: 1.304 ± 0.783
4.563HisAsp: 4.563 ± 1.068
0.0HisGlu: 0.0 ± 0.0
1.304HisPhe: 1.304 ± 0.655
1.304HisGly: 1.304 ± 0.877
0.652HisHis: 0.652 ± 0.681
1.956HisIle: 1.956 ± 1.132
1.304HisLys: 1.304 ± 0.925
3.259HisLeu: 3.259 ± 1.102
0.652HisMet: 0.652 ± 0.765
3.911HisAsn: 3.911 ± 1.808
1.956HisPro: 1.956 ± 0.806
3.259HisGln: 3.259 ± 0.949
3.259HisArg: 3.259 ± 1.632
0.652HisSer: 0.652 ± 0.547
4.563HisThr: 4.563 ± 1.453
2.608HisVal: 2.608 ± 1.286
0.0HisTrp: 0.0 ± 0.0
0.652HisTyr: 0.652 ± 0.547
0.0HisXaa: 0.0 ± 0.0
Ile
1.956IleAla: 1.956 ± 0.862
0.652IleCys: 0.652 ± 0.524
5.215IleAsp: 5.215 ± 1.628
5.215IleGlu: 5.215 ± 1.785
3.259IlePhe: 3.259 ± 1.502
2.608IleGly: 2.608 ± 1.16
3.911IleHis: 3.911 ± 1.657
1.956IleIle: 1.956 ± 1.086
5.215IleLys: 5.215 ± 1.978
2.608IleLeu: 2.608 ± 1.017
2.608IleMet: 2.608 ± 1.051
3.259IleAsn: 3.259 ± 1.463
4.563IlePro: 4.563 ± 1.869
3.259IleGln: 3.259 ± 1.53
6.519IleArg: 6.519 ± 1.262
3.259IleSer: 3.259 ± 1.284
3.911IleThr: 3.911 ± 1.463
3.911IleVal: 3.911 ± 1.558
1.304IleTrp: 1.304 ± 0.874
1.956IleTyr: 1.956 ± 1.233
0.0IleXaa: 0.0 ± 0.0
Lys
4.563LysAla: 4.563 ± 1.633
0.652LysCys: 0.652 ± 0.681
7.171LysAsp: 7.171 ± 2.019
3.911LysGlu: 3.911 ± 2.619
2.608LysPhe: 2.608 ± 1.068
2.608LysGly: 2.608 ± 0.513
1.304LysHis: 1.304 ± 0.783
5.867LysIle: 5.867 ± 1.546
0.652LysLys: 0.652 ± 0.524
4.563LysLeu: 4.563 ± 1.891
1.956LysMet: 1.956 ± 1.03
4.563LysAsn: 4.563 ± 0.92
3.259LysPro: 3.259 ± 0.726
0.652LysGln: 0.652 ± 0.547
6.519LysArg: 6.519 ± 2.653
6.519LysSer: 6.519 ± 1.222
1.304LysThr: 1.304 ± 0.803
5.215LysVal: 5.215 ± 2.526
0.652LysTrp: 0.652 ± 0.547
3.259LysTyr: 3.259 ± 0.642
0.0LysXaa: 0.0 ± 0.0
Leu
1.304LeuAla: 1.304 ± 0.714
0.652LeuCys: 0.652 ± 0.524
4.563LeuAsp: 4.563 ± 1.36
1.304LeuGlu: 1.304 ± 0.747
2.608LeuPhe: 2.608 ± 1.064
3.911LeuGly: 3.911 ± 0.575
3.911LeuHis: 3.911 ± 1.236
1.956LeuIle: 1.956 ± 1.053
8.475LeuLys: 8.475 ± 1.472
4.563LeuLeu: 4.563 ± 2.301
0.652LeuMet: 0.652 ± 0.483
5.215LeuAsn: 5.215 ± 1.497
1.304LeuPro: 1.304 ± 0.988
3.911LeuGln: 3.911 ± 1.707
1.304LeuArg: 1.304 ± 0.856
8.475LeuSer: 8.475 ± 3.054
2.608LeuThr: 2.608 ± 1.529
4.563LeuVal: 4.563 ± 1.247
0.0LeuTrp: 0.0 ± 0.0
3.911LeuTyr: 3.911 ± 1.352
0.0LeuXaa: 0.0 ± 0.0
Met
1.304MetAla: 1.304 ± 0.967
0.652MetCys: 0.652 ± 0.483
3.259MetAsp: 3.259 ± 1.204
1.304MetGlu: 1.304 ± 0.988
0.652MetPhe: 0.652 ± 0.483
0.652MetGly: 0.652 ± 0.483
1.304MetHis: 1.304 ± 0.737
0.652MetIle: 0.652 ± 0.622
0.652MetLys: 0.652 ± 0.547
1.304MetLeu: 1.304 ± 0.737
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.956MetPro: 1.956 ± 0.635
0.652MetGln: 0.652 ± 0.524
1.956MetArg: 1.956 ± 1.132
1.956MetSer: 1.956 ± 1.867
0.652MetThr: 0.652 ± 0.547
0.0MetVal: 0.0 ± 0.0
1.304MetTrp: 1.304 ± 0.623
1.956MetTyr: 1.956 ± 1.08
0.0MetXaa: 0.0 ± 0.0
Asn
5.215AsnAla: 5.215 ± 1.23
1.956AsnCys: 1.956 ± 0.806
2.608AsnAsp: 2.608 ± 0.941
3.259AsnGlu: 3.259 ± 1.533
0.652AsnPhe: 0.652 ± 0.689
2.608AsnGly: 2.608 ± 1.057
4.563AsnHis: 4.563 ± 2.325
5.215AsnIle: 5.215 ± 1.103
5.215AsnLys: 5.215 ± 1.033
2.608AsnLeu: 2.608 ± 0.68
2.608AsnMet: 2.608 ± 1.391
2.608AsnAsn: 2.608 ± 1.266
2.608AsnPro: 2.608 ± 0.826
2.608AsnGln: 2.608 ± 1.545
2.608AsnArg: 2.608 ± 0.943
4.563AsnSer: 4.563 ± 1.845
1.956AsnThr: 1.956 ± 1.17
1.956AsnVal: 1.956 ± 0.808
0.652AsnTrp: 0.652 ± 0.524
4.563AsnTyr: 4.563 ± 1.196
0.0AsnXaa: 0.0 ± 0.0
Pro
1.304ProAla: 1.304 ± 0.747
0.652ProCys: 0.652 ± 0.483
1.304ProAsp: 1.304 ± 0.737
1.956ProGlu: 1.956 ± 1.172
1.956ProPhe: 1.956 ± 0.806
2.608ProGly: 2.608 ± 1.198
2.608ProHis: 2.608 ± 1.065
3.259ProIle: 3.259 ± 2.116
4.563ProLys: 4.563 ± 1.016
2.608ProLeu: 2.608 ± 1.176
1.304ProMet: 1.304 ± 0.967
3.911ProAsn: 3.911 ± 0.798
3.259ProPro: 3.259 ± 1.539
3.259ProGln: 3.259 ± 1.463
1.304ProArg: 1.304 ± 0.605
5.867ProSer: 5.867 ± 1.821
1.956ProThr: 1.956 ± 1.665
2.608ProVal: 2.608 ± 1.09
1.956ProTrp: 1.956 ± 0.634
1.304ProTyr: 1.304 ± 0.643
0.0ProXaa: 0.0 ± 0.0
Gln
2.608GlnAla: 2.608 ± 1.518
0.652GlnCys: 0.652 ± 0.622
1.956GlnAsp: 1.956 ± 1.172
1.956GlnGlu: 1.956 ± 1.06
3.911GlnPhe: 3.911 ± 1.024
1.956GlnGly: 1.956 ± 0.606
1.956GlnHis: 1.956 ± 1.133
1.956GlnIle: 1.956 ± 1.064
1.304GlnLys: 1.304 ± 1.048
4.563GlnLeu: 4.563 ± 1.891
0.0GlnMet: 0.0 ± 0.0
0.652GlnAsn: 0.652 ± 0.524
3.259GlnPro: 3.259 ± 1.356
1.304GlnGln: 1.304 ± 0.655
4.563GlnArg: 4.563 ± 1.372
7.171GlnSer: 7.171 ± 2.087
0.652GlnThr: 0.652 ± 0.524
5.215GlnVal: 5.215 ± 1.376
0.0GlnTrp: 0.0 ± 0.0
0.652GlnTyr: 0.652 ± 0.483
0.0GlnXaa: 0.0 ± 0.0
Arg
2.608ArgAla: 2.608 ± 1.16
1.956ArgCys: 1.956 ± 0.606
2.608ArgAsp: 2.608 ± 1.43
1.956ArgGlu: 1.956 ± 1.133
7.171ArgPhe: 7.171 ± 1.742
3.911ArgGly: 3.911 ± 1.151
3.259ArgHis: 3.259 ± 1.389
4.563ArgIle: 4.563 ± 1.393
3.259ArgLys: 3.259 ± 0.949
3.911ArgLeu: 3.911 ± 1.34
1.956ArgMet: 1.956 ± 0.74
0.0ArgAsn: 0.0 ± 0.0
2.608ArgPro: 2.608 ± 1.211
1.304ArgGln: 1.304 ± 0.714
5.215ArgArg: 5.215 ± 2.563
6.519ArgSer: 6.519 ± 1.493
3.911ArgThr: 3.911 ± 1.34
3.911ArgVal: 3.911 ± 0.575
0.0ArgTrp: 0.0 ± 0.0
1.956ArgTyr: 1.956 ± 1.276
0.0ArgXaa: 0.0 ± 0.0
Ser
6.519SerAla: 6.519 ± 2.892
1.956SerCys: 1.956 ± 0.862
3.259SerAsp: 3.259 ± 0.849
0.0SerGlu: 0.0 ± 0.0
1.956SerPhe: 1.956 ± 0.806
4.563SerGly: 4.563 ± 1.396
3.911SerHis: 3.911 ± 1.705
8.475SerIle: 8.475 ± 2.094
7.171SerLys: 7.171 ± 2.629
5.215SerLeu: 5.215 ± 1.796
0.0SerMet: 0.0 ± 0.0
8.475SerAsn: 8.475 ± 1.949
4.563SerPro: 4.563 ± 2.244
4.563SerGln: 4.563 ± 1.714
3.911SerArg: 3.911 ± 0.575
10.43SerSer: 10.43 ± 3.466
10.43SerThr: 10.43 ± 2.661
5.215SerVal: 5.215 ± 1.41
1.304SerTrp: 1.304 ± 1.093
3.911SerTyr: 3.911 ± 1.112
0.0SerXaa: 0.0 ± 0.0
Thr
5.867ThrAla: 5.867 ± 0.982
0.0ThrCys: 0.0 ± 0.0
1.956ThrAsp: 1.956 ± 0.862
1.304ThrGlu: 1.304 ± 0.643
3.911ThrPhe: 3.911 ± 2.514
3.259ThrGly: 3.259 ± 1.362
3.259ThrHis: 3.259 ± 1.204
3.259ThrIle: 3.259 ± 2.028
1.956ThrLys: 1.956 ± 1.47
2.608ThrLeu: 2.608 ± 0.513
1.956ThrMet: 1.956 ± 0.917
3.911ThrAsn: 3.911 ± 1.175
1.304ThrPro: 1.304 ± 0.605
0.652ThrGln: 0.652 ± 0.848
3.259ThrArg: 3.259 ± 1.33
3.259ThrSer: 3.259 ± 1.845
5.215ThrThr: 5.215 ± 1.621
3.259ThrVal: 3.259 ± 1.197
0.652ThrTrp: 0.652 ± 0.848
1.304ThrTyr: 1.304 ± 0.747
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.652ValCys: 0.652 ± 0.524
3.911ValAsp: 3.911 ± 1.212
3.259ValGlu: 3.259 ± 0.822
1.956ValPhe: 1.956 ± 0.971
1.956ValGly: 1.956 ± 1.08
0.652ValHis: 0.652 ± 0.547
4.563ValIle: 4.563 ± 1.161
3.259ValLys: 3.259 ± 0.791
3.911ValLeu: 3.911 ± 1.236
3.259ValMet: 3.259 ± 1.272
4.563ValAsn: 4.563 ± 1.144
4.563ValPro: 4.563 ± 1.044
4.563ValGln: 4.563 ± 0.557
2.608ValArg: 2.608 ± 1.018
4.563ValSer: 4.563 ± 1.152
1.956ValThr: 1.956 ± 0.969
3.259ValVal: 3.259 ± 0.849
0.652ValTrp: 0.652 ± 0.689
4.563ValTyr: 4.563 ± 1.957
0.0ValXaa: 0.0 ± 0.0
Trp
1.304TrpAla: 1.304 ± 0.655
0.0TrpCys: 0.0 ± 0.0
0.652TrpAsp: 0.652 ± 0.681
1.304TrpGlu: 1.304 ± 0.845
0.0TrpPhe: 0.0 ± 0.0
0.652TrpGly: 0.652 ± 0.524
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.956TrpLys: 1.956 ± 0.634
1.304TrpLeu: 1.304 ± 0.967
1.304TrpMet: 1.304 ± 0.737
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.652TrpGln: 0.652 ± 0.524
0.0TrpArg: 0.0 ± 0.0
0.652TrpSer: 0.652 ± 0.622
1.956TrpThr: 1.956 ± 0.808
1.956TrpVal: 1.956 ± 1.024
0.0TrpTrp: 0.0 ± 0.0
0.652TrpTyr: 0.652 ± 0.848
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.259TyrAla: 3.259 ± 1.272
0.652TyrCys: 0.652 ± 0.622
1.304TyrAsp: 1.304 ± 0.737
1.304TyrGlu: 1.304 ± 0.967
2.608TyrPhe: 2.608 ± 1.068
2.608TyrGly: 2.608 ± 1.017
1.956TyrHis: 1.956 ± 1.471
3.259TyrIle: 3.259 ± 0.948
3.259TyrLys: 3.259 ± 0.642
3.259TyrLeu: 3.259 ± 1.53
1.304TyrMet: 1.304 ± 0.828
1.956TyrAsn: 1.956 ± 0.635
1.956TyrPro: 1.956 ± 1.025
3.259TyrGln: 3.259 ± 0.948
2.608TyrArg: 2.608 ± 1.286
1.956TyrSer: 1.956 ± 0.635
1.304TyrThr: 1.304 ± 0.925
2.608TyrVal: 2.608 ± 0.782
0.0TyrTrp: 0.0 ± 0.0
1.304TyrTyr: 1.304 ± 0.714
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1535 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski