Amino acid dipepetide frequency for White clover mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.642AlaAla: 3.642 ± 0.886
0.607AlaCys: 0.607 ± 0.397
1.517AlaAsp: 1.517 ± 0.742
5.159AlaGlu: 5.159 ± 1.466
3.035AlaPhe: 3.035 ± 0.653
2.731AlaGly: 2.731 ± 0.855
0.607AlaHis: 0.607 ± 0.261
2.731AlaIle: 2.731 ± 0.701
3.338AlaLys: 3.338 ± 1.037
6.07AlaLeu: 6.07 ± 1.606
0.303AlaMet: 0.303 ± 0.243
2.428AlaAsn: 2.428 ± 0.814
3.945AlaPro: 3.945 ± 0.91
2.428AlaGln: 2.428 ± 1.016
5.463AlaArg: 5.463 ± 1.07
5.463AlaSer: 5.463 ± 1.064
0.303AlaThr: 0.303 ± 0.539
3.945AlaVal: 3.945 ± 0.793
0.91AlaTrp: 0.91 ± 0.395
4.249AlaTyr: 4.249 ± 0.86
0.0AlaXaa: 0.0 ± 0.0
Cys
0.303CysAla: 0.303 ± 0.198
0.0CysCys: 0.0 ± 0.0
1.517CysAsp: 1.517 ± 0.504
0.303CysGlu: 0.303 ± 0.27
0.91CysPhe: 0.91 ± 0.399
1.821CysGly: 1.821 ± 0.88
0.607CysHis: 0.607 ± 0.937
0.91CysIle: 0.91 ± 0.399
2.124CysLys: 2.124 ± 0.651
0.91CysLeu: 0.91 ± 0.395
0.0CysMet: 0.0 ± 0.0
1.214CysAsn: 1.214 ± 0.521
1.214CysPro: 1.214 ± 0.456
0.607CysGln: 0.607 ± 0.522
0.303CysArg: 0.303 ± 0.27
2.428CysSer: 2.428 ± 0.905
1.821CysThr: 1.821 ± 0.782
0.91CysVal: 0.91 ± 0.289
0.303CysTrp: 0.303 ± 0.198
0.607CysTyr: 0.607 ± 0.261
0.0CysXaa: 0.0 ± 0.0
Asp
4.249AspAla: 4.249 ± 1.489
3.035AspCys: 3.035 ± 0.355
4.552AspAsp: 4.552 ± 1.109
5.463AspGlu: 5.463 ± 0.736
2.428AspPhe: 2.428 ± 0.747
2.428AspGly: 2.428 ± 0.53
0.0AspHis: 0.0 ± 0.0
1.821AspIle: 1.821 ± 0.572
0.91AspLys: 0.91 ± 0.653
3.945AspLeu: 3.945 ± 1.094
0.607AspMet: 0.607 ± 0.261
1.517AspAsn: 1.517 ± 1.102
2.124AspPro: 2.124 ± 0.473
2.124AspGln: 2.124 ± 0.473
1.821AspArg: 1.821 ± 0.585
2.428AspSer: 2.428 ± 1.218
1.517AspThr: 1.517 ± 0.451
3.338AspVal: 3.338 ± 1.338
2.731AspTrp: 2.731 ± 0.816
0.303AspTyr: 0.303 ± 0.27
0.0AspXaa: 0.0 ± 0.0
Glu
3.642GluAla: 3.642 ± 0.408
2.124GluCys: 2.124 ± 0.71
6.07GluAsp: 6.07 ± 1.352
4.856GluGlu: 4.856 ± 1.654
3.642GluPhe: 3.642 ± 0.895
5.159GluGly: 5.159 ± 2.453
1.517GluHis: 1.517 ± 0.504
3.338GluIle: 3.338 ± 0.717
4.249GluLys: 4.249 ± 0.981
7.891GluLeu: 7.891 ± 1.021
0.607GluMet: 0.607 ± 0.397
3.035GluAsn: 3.035 ± 0.784
2.731GluPro: 2.731 ± 0.978
3.642GluGln: 3.642 ± 0.954
3.338GluArg: 3.338 ± 0.703
2.428GluSer: 2.428 ± 0.842
3.642GluThr: 3.642 ± 1.364
3.338GluVal: 3.338 ± 1.037
1.821GluTrp: 1.821 ± 0.454
0.91GluTyr: 0.91 ± 0.664
0.0GluXaa: 0.0 ± 0.0
Phe
1.214PheAla: 1.214 ± 0.494
1.821PheCys: 1.821 ± 0.574
1.821PheAsp: 1.821 ± 0.572
1.821PheGlu: 1.821 ± 0.73
1.214PhePhe: 1.214 ± 0.453
3.945PheGly: 3.945 ± 1.087
0.91PheHis: 0.91 ± 0.289
2.428PheIle: 2.428 ± 1.557
0.91PheLys: 0.91 ± 0.395
4.856PheLeu: 4.856 ± 1.355
0.607PheMet: 0.607 ± 0.261
2.428PheAsn: 2.428 ± 0.811
1.517PhePro: 1.517 ± 0.757
1.821PheGln: 1.821 ± 0.64
3.338PheArg: 3.338 ± 0.833
6.373PheSer: 6.373 ± 1.03
0.303PheThr: 0.303 ± 0.469
1.517PheVal: 1.517 ± 0.504
0.91PheTrp: 0.91 ± 0.615
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.731GlyAla: 2.731 ± 0.49
1.821GlyCys: 1.821 ± 0.667
3.945GlyAsp: 3.945 ± 1.377
3.945GlyGlu: 3.945 ± 1.142
2.731GlyPhe: 2.731 ± 0.727
3.945GlyGly: 3.945 ± 1.123
0.303GlyHis: 0.303 ± 0.27
3.338GlyIle: 3.338 ± 1.286
5.463GlyLys: 5.463 ± 1.336
5.159GlyLeu: 5.159 ± 1.39
0.607GlyMet: 0.607 ± 0.27
3.338GlyAsn: 3.338 ± 1.061
3.035GlyPro: 3.035 ± 1.068
2.124GlyGln: 2.124 ± 0.717
3.945GlyArg: 3.945 ± 2.27
6.98GlySer: 6.98 ± 2.985
4.249GlyThr: 4.249 ± 1.351
3.642GlyVal: 3.642 ± 1.029
0.607GlyTrp: 0.607 ± 0.397
3.035GlyTyr: 3.035 ± 0.682
0.0GlyXaa: 0.0 ± 0.0
His
0.303HisAla: 0.303 ± 0.469
1.517HisCys: 1.517 ± 0.366
2.124HisAsp: 2.124 ± 0.892
1.214HisGlu: 1.214 ± 0.639
0.303HisPhe: 0.303 ± 0.198
0.303HisGly: 0.303 ± 0.469
0.0HisHis: 0.0 ± 0.0
1.517HisIle: 1.517 ± 0.639
2.124HisLys: 2.124 ± 0.892
0.303HisLeu: 0.303 ± 0.198
0.0HisMet: 0.0 ± 0.0
0.91HisAsn: 0.91 ± 0.49
2.731HisPro: 2.731 ± 0.929
0.0HisGln: 0.0 ± 0.0
1.821HisArg: 1.821 ± 0.88
2.428HisSer: 2.428 ± 0.521
1.517HisThr: 1.517 ± 0.479
1.517HisVal: 1.517 ± 0.51
0.91HisTrp: 0.91 ± 0.395
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.856IleAla: 4.856 ± 0.825
0.0IleCys: 0.0 ± 0.0
2.124IleAsp: 2.124 ± 0.626
4.249IleGlu: 4.249 ± 0.913
1.821IlePhe: 1.821 ± 0.461
0.91IleGly: 0.91 ± 0.541
0.303IleHis: 0.303 ± 0.198
2.124IleIle: 2.124 ± 0.715
4.552IleLys: 4.552 ± 1.049
3.642IleLeu: 3.642 ± 0.646
0.303IleMet: 0.303 ± 0.198
3.642IleAsn: 3.642 ± 1.053
5.766IlePro: 5.766 ± 1.58
2.731IleGln: 2.731 ± 0.906
2.124IleArg: 2.124 ± 0.476
3.338IleSer: 3.338 ± 1.207
3.945IleThr: 3.945 ± 1.872
2.124IleVal: 2.124 ± 0.626
0.0IleTrp: 0.0 ± 0.0
1.821IleTyr: 1.821 ± 0.574
0.0IleXaa: 0.0 ± 0.0
Lys
5.159LysAla: 5.159 ± 1.083
0.91LysCys: 0.91 ± 0.448
2.124LysAsp: 2.124 ± 0.75
5.159LysGlu: 5.159 ± 1.194
3.035LysPhe: 3.035 ± 1.126
5.766LysGly: 5.766 ± 1.025
0.91LysHis: 0.91 ± 0.395
6.677LysIle: 6.677 ± 1.378
2.124LysLys: 2.124 ± 0.922
6.677LysLeu: 6.677 ± 1.284
3.338LysMet: 3.338 ± 0.74
3.035LysAsn: 3.035 ± 1.278
3.338LysPro: 3.338 ± 0.675
2.124LysGln: 2.124 ± 0.892
1.214LysArg: 1.214 ± 0.521
7.284LysSer: 7.284 ± 0.817
3.338LysThr: 3.338 ± 0.761
1.214LysVal: 1.214 ± 0.568
0.607LysTrp: 0.607 ± 0.336
1.821LysTyr: 1.821 ± 0.782
0.303LysXaa: 0.303 ± 0.27
Leu
4.249LeuAla: 4.249 ± 1.193
0.91LeuCys: 0.91 ± 0.395
3.338LeuAsp: 3.338 ± 1.162
6.373LeuGlu: 6.373 ± 1.604
2.731LeuPhe: 2.731 ± 0.748
5.766LeuGly: 5.766 ± 1.499
2.731LeuHis: 2.731 ± 0.9
3.945LeuIle: 3.945 ± 1.142
5.463LeuLys: 5.463 ± 1.194
8.194LeuLeu: 8.194 ± 1.963
2.428LeuMet: 2.428 ± 0.547
2.731LeuAsn: 2.731 ± 0.581
3.642LeuPro: 3.642 ± 1.242
1.821LeuGln: 1.821 ± 0.895
5.159LeuArg: 5.159 ± 1.102
8.194LeuSer: 8.194 ± 0.962
5.463LeuThr: 5.463 ± 1.417
5.159LeuVal: 5.159 ± 1.877
3.642LeuTrp: 3.642 ± 1.261
4.856LeuTyr: 4.856 ± 0.462
0.0LeuXaa: 0.0 ± 0.0
Met
1.517MetAla: 1.517 ± 0.479
0.607MetCys: 0.607 ± 0.27
0.0MetAsp: 0.0 ± 0.0
2.124MetGlu: 2.124 ± 0.647
0.607MetPhe: 0.607 ± 0.261
1.821MetGly: 1.821 ± 0.73
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.428MetLys: 2.428 ± 0.747
0.91MetLeu: 0.91 ± 0.395
1.821MetMet: 1.821 ± 0.782
2.124MetAsn: 2.124 ± 0.795
0.0MetPro: 0.0 ± 0.0
0.303MetGln: 0.303 ± 0.27
1.821MetArg: 1.821 ± 0.926
1.517MetSer: 1.517 ± 0.51
0.607MetThr: 0.607 ± 0.261
1.517MetVal: 1.517 ± 0.518
0.303MetTrp: 0.303 ± 0.27
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.214AsnAla: 1.214 ± 1.28
0.91AsnCys: 0.91 ± 0.395
0.91AsnAsp: 0.91 ± 0.289
1.821AsnGlu: 1.821 ± 0.585
2.731AsnPhe: 2.731 ± 0.9
4.552AsnGly: 4.552 ± 1.444
1.214AsnHis: 1.214 ± 0.568
1.214AsnIle: 1.214 ± 0.547
4.249AsnLys: 4.249 ± 1.168
4.552AsnLeu: 4.552 ± 0.792
1.517AsnMet: 1.517 ± 0.59
3.338AsnAsn: 3.338 ± 0.674
1.821AsnPro: 1.821 ± 0.907
1.214AsnGln: 1.214 ± 0.951
3.642AsnArg: 3.642 ± 1.181
3.945AsnSer: 3.945 ± 1.853
4.249AsnThr: 4.249 ± 1.157
2.731AsnVal: 2.731 ± 0.62
2.124AsnTrp: 2.124 ± 0.71
2.428AsnTyr: 2.428 ± 1.222
0.0AsnXaa: 0.0 ± 0.0
Pro
3.338ProAla: 3.338 ± 1.194
0.607ProCys: 0.607 ± 0.476
2.428ProAsp: 2.428 ± 1.222
4.249ProGlu: 4.249 ± 1.171
0.607ProPhe: 0.607 ± 0.261
1.821ProGly: 1.821 ± 0.683
1.517ProHis: 1.517 ± 0.366
3.035ProIle: 3.035 ± 0.611
3.338ProLys: 3.338 ± 1.056
2.731ProLeu: 2.731 ± 1.81
0.607ProMet: 0.607 ± 0.937
3.642ProAsn: 3.642 ± 1.11
7.891ProPro: 7.891 ± 2.263
2.731ProGln: 2.731 ± 1.62
2.428ProArg: 2.428 ± 1.763
3.945ProSer: 3.945 ± 0.961
2.731ProThr: 2.731 ± 0.774
4.856ProVal: 4.856 ± 0.652
0.303ProTrp: 0.303 ± 0.27
1.517ProTyr: 1.517 ± 0.504
0.0ProXaa: 0.0 ± 0.0
Gln
1.821GlnAla: 1.821 ± 0.722
0.0GlnCys: 0.0 ± 0.0
0.91GlnAsp: 0.91 ± 0.736
1.214GlnGlu: 1.214 ± 0.541
2.124GlnPhe: 2.124 ± 0.468
1.517GlnGly: 1.517 ± 0.373
2.428GlnHis: 2.428 ± 0.768
1.214GlnIle: 1.214 ± 0.453
4.856GlnLys: 4.856 ± 1.73
3.945GlnLeu: 3.945 ± 1.88
1.214GlnMet: 1.214 ± 0.421
2.124GlnAsn: 2.124 ± 1.093
2.124GlnPro: 2.124 ± 1.226
0.607GlnGln: 0.607 ± 0.27
3.338GlnArg: 3.338 ± 1.195
2.731GlnSer: 2.731 ± 0.917
2.731GlnThr: 2.731 ± 1.055
1.214GlnVal: 1.214 ± 0.47
0.607GlnTrp: 0.607 ± 1.078
2.428GlnTyr: 2.428 ± 0.69
0.0GlnXaa: 0.0 ± 0.0
Arg
3.945ArgAla: 3.945 ± 1.7
0.607ArgCys: 0.607 ± 0.261
1.821ArgAsp: 1.821 ± 0.572
3.642ArgGlu: 3.642 ± 1.144
0.91ArgPhe: 0.91 ± 0.664
6.677ArgGly: 6.677 ± 1.567
2.428ArgHis: 2.428 ± 0.842
4.552ArgIle: 4.552 ± 1.668
2.731ArgLys: 2.731 ± 0.891
3.642ArgLeu: 3.642 ± 0.916
0.91ArgMet: 0.91 ± 0.957
4.552ArgAsn: 4.552 ± 2.025
1.821ArgPro: 1.821 ± 1.358
1.821ArgGln: 1.821 ± 0.731
12.443ArgArg: 12.443 ± 7.489
3.338ArgSer: 3.338 ± 1.401
1.214ArgThr: 1.214 ± 1.019
2.731ArgVal: 2.731 ± 0.906
0.91ArgTrp: 0.91 ± 0.541
1.517ArgTyr: 1.517 ± 0.339
0.0ArgXaa: 0.0 ± 0.0
Ser
5.159SerAla: 5.159 ± 1.545
1.517SerCys: 1.517 ± 0.485
4.552SerAsp: 4.552 ± 1.516
5.463SerGlu: 5.463 ± 1.535
3.642SerPhe: 3.642 ± 1.062
6.98SerGly: 6.98 ± 1.895
2.731SerHis: 2.731 ± 0.817
2.124SerIle: 2.124 ± 1.048
6.373SerLys: 6.373 ± 1.649
7.891SerLeu: 7.891 ± 3.027
0.607SerMet: 0.607 ± 0.261
2.428SerAsn: 2.428 ± 0.71
5.159SerPro: 5.159 ± 1.517
4.249SerGln: 4.249 ± 1.04
3.338SerArg: 3.338 ± 1.592
12.14SerSer: 12.14 ± 1.937
4.856SerThr: 4.856 ± 1.444
4.249SerVal: 4.249 ± 1.171
1.821SerTrp: 1.821 ± 0.533
2.731SerTyr: 2.731 ± 0.725
0.0SerXaa: 0.0 ± 0.0
Thr
4.856ThrAla: 4.856 ± 0.871
0.91ThrCys: 0.91 ± 0.905
1.821ThrAsp: 1.821 ± 0.683
3.338ThrGlu: 3.338 ± 0.761
2.428ThrPhe: 2.428 ± 1.748
2.731ThrGly: 2.731 ± 0.8
0.303ThrHis: 0.303 ± 0.198
2.428ThrIle: 2.428 ± 0.611
3.338ThrLys: 3.338 ± 0.488
2.124ThrLeu: 2.124 ± 1.353
2.731ThrMet: 2.731 ± 0.609
2.124ThrAsn: 2.124 ± 0.464
2.428ThrPro: 2.428 ± 0.956
3.338ThrGln: 3.338 ± 1.066
1.517ThrArg: 1.517 ± 0.989
5.159ThrSer: 5.159 ± 1.303
2.428ThrThr: 2.428 ± 1.133
4.552ThrVal: 4.552 ± 0.78
0.303ThrTrp: 0.303 ± 0.27
1.821ThrTyr: 1.821 ± 0.574
0.0ThrXaa: 0.0 ± 0.0
Val
3.338ValAla: 3.338 ± 0.703
0.303ValCys: 0.303 ± 0.469
3.945ValAsp: 3.945 ± 0.71
3.945ValGlu: 3.945 ± 1.188
3.642ValPhe: 3.642 ± 0.872
3.642ValGly: 3.642 ± 1.082
1.214ValHis: 1.214 ± 0.364
3.642ValIle: 3.642 ± 1.082
3.338ValLys: 3.338 ± 0.872
6.07ValLeu: 6.07 ± 1.791
0.91ValMet: 0.91 ± 0.395
1.821ValAsn: 1.821 ± 0.585
2.428ValPro: 2.428 ± 0.585
3.642ValGln: 3.642 ± 0.82
1.821ValArg: 1.821 ± 0.725
3.338ValSer: 3.338 ± 0.657
2.428ValThr: 2.428 ± 0.552
4.552ValVal: 4.552 ± 2.263
1.214ValTrp: 1.214 ± 0.33
1.517ValTyr: 1.517 ± 0.51
0.0ValXaa: 0.0 ± 0.0
Trp
0.91TrpAla: 0.91 ± 0.395
0.0TrpCys: 0.0 ± 0.0
0.607TrpAsp: 0.607 ± 0.27
1.214TrpGlu: 1.214 ± 0.47
0.303TrpPhe: 0.303 ± 0.27
0.91TrpGly: 0.91 ± 0.502
0.607TrpHis: 0.607 ± 0.522
0.607TrpIle: 0.607 ± 0.261
1.517TrpLys: 1.517 ± 0.504
3.945TrpLeu: 3.945 ± 1.079
0.0TrpMet: 0.0 ± 0.0
0.91TrpAsn: 0.91 ± 0.289
0.303TrpPro: 0.303 ± 0.198
1.517TrpGln: 1.517 ± 0.485
1.517TrpArg: 1.517 ± 0.543
1.821TrpSer: 1.821 ± 0.852
1.821TrpThr: 1.821 ± 0.782
1.821TrpVal: 1.821 ± 0.782
0.0TrpTrp: 0.0 ± 0.0
0.91TrpTyr: 0.91 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.731TyrAla: 2.731 ± 0.837
0.607TyrCys: 0.607 ± 0.476
1.214TyrAsp: 1.214 ± 0.364
2.428TyrGlu: 2.428 ± 0.657
0.91TyrPhe: 0.91 ± 0.49
1.517TyrGly: 1.517 ± 0.854
1.214TyrHis: 1.214 ± 0.639
2.731TyrIle: 2.731 ± 0.465
2.731TyrLys: 2.731 ± 1.104
3.338TyrLeu: 3.338 ± 0.935
0.607TyrMet: 0.607 ± 0.54
3.035TyrAsn: 3.035 ± 0.906
0.303TyrPro: 0.303 ± 0.469
0.303TyrGln: 0.303 ± 0.539
1.821TyrArg: 1.821 ± 0.574
2.731TyrSer: 2.731 ± 1.1
1.517TyrThr: 1.517 ± 0.51
1.517TyrVal: 1.517 ± 0.742
1.214TyrTrp: 1.214 ± 0.521
1.821TyrTyr: 1.821 ± 0.67
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.303XaaVal: 0.303 ± 0.27
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3296 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski