Amino acid dipepetide frequency for Red clover vein mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.345AlaAla: 4.345 ± 3.439
1.81AlaCys: 1.81 ± 0.538
2.534AlaAsp: 2.534 ± 1.383
4.707AlaGlu: 4.707 ± 2.36
3.983AlaPhe: 3.983 ± 2.119
3.259AlaGly: 3.259 ± 2.073
1.086AlaHis: 1.086 ± 0.593
4.345AlaIle: 4.345 ± 1.224
6.879AlaLys: 6.879 ± 1.252
7.965AlaLeu: 7.965 ± 1.167
0.362AlaMet: 0.362 ± 0.198
3.259AlaAsn: 3.259 ± 0.914
1.81AlaPro: 1.81 ± 0.929
1.81AlaGln: 1.81 ± 0.538
3.259AlaArg: 3.259 ± 0.645
3.983AlaSer: 3.983 ± 1.918
3.621AlaThr: 3.621 ± 0.547
3.983AlaVal: 3.983 ± 1.49
0.0AlaTrp: 0.0 ± 0.0
1.086AlaTyr: 1.086 ± 0.593
0.0AlaXaa: 0.0 ± 0.0
Cys
3.621CysAla: 3.621 ± 0.603
0.362CysCys: 0.362 ± 0.198
0.724CysAsp: 0.724 ± 0.971
1.81CysGlu: 1.81 ± 1.36
2.534CysPhe: 2.534 ± 1.383
1.81CysGly: 1.81 ± 0.999
0.0CysHis: 0.0 ± 0.0
1.448CysIle: 1.448 ± 0.801
1.81CysLys: 1.81 ± 0.988
0.724CysLeu: 0.724 ± 1.368
0.724CysMet: 0.724 ± 0.525
1.086CysAsn: 1.086 ± 1.859
0.362CysPro: 0.362 ± 0.198
1.086CysGln: 1.086 ± 0.476
1.086CysArg: 1.086 ± 0.551
2.534CysSer: 2.534 ± 1.074
3.259CysThr: 3.259 ± 0.715
3.621CysVal: 3.621 ± 3.128
0.0CysTrp: 0.0 ± 0.0
0.362CysTyr: 0.362 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
3.259AspAla: 3.259 ± 0.645
1.81AspCys: 1.81 ± 0.988
3.983AspAsp: 3.983 ± 1.013
3.983AspGlu: 3.983 ± 1.044
3.621AspPhe: 3.621 ± 1.368
4.707AspGly: 4.707 ± 1.58
1.086AspHis: 1.086 ± 0.476
1.086AspIle: 1.086 ± 0.551
1.086AspLys: 1.086 ± 0.551
6.155AspLeu: 6.155 ± 1.068
1.086AspMet: 1.086 ± 0.593
1.81AspAsn: 1.81 ± 0.611
2.896AspPro: 2.896 ± 1.31
0.362AspGln: 0.362 ± 0.198
3.621AspArg: 3.621 ± 0.925
2.896AspSer: 2.896 ± 1.422
1.81AspThr: 1.81 ± 0.767
3.983AspVal: 3.983 ± 2.208
0.724AspTrp: 0.724 ± 0.582
2.172AspTyr: 2.172 ± 0.741
0.0AspXaa: 0.0 ± 0.0
Glu
4.345GluAla: 4.345 ± 0.954
1.086GluCys: 1.086 ± 0.593
3.259GluAsp: 3.259 ± 0.914
4.707GluGlu: 4.707 ± 0.95
4.707GluPhe: 4.707 ± 1.224
4.707GluGly: 4.707 ± 0.82
0.724GluHis: 0.724 ± 0.541
4.707GluIle: 4.707 ± 1.966
2.896GluLys: 2.896 ± 1.046
9.776GluLeu: 9.776 ± 1.504
0.724GluMet: 0.724 ± 0.395
2.534GluAsn: 2.534 ± 1.123
2.172GluPro: 2.172 ± 0.903
2.172GluGln: 2.172 ± 0.817
2.172GluArg: 2.172 ± 1.186
5.069GluSer: 5.069 ± 1.311
2.534GluThr: 2.534 ± 0.972
6.155GluVal: 6.155 ± 2.74
0.362GluTrp: 0.362 ± 0.198
1.81GluTyr: 1.81 ± 1.437
0.0GluXaa: 0.0 ± 0.0
Phe
4.345PheAla: 4.345 ± 2.204
1.086PheCys: 1.086 ± 0.476
3.983PheAsp: 3.983 ± 1.493
5.431PheGlu: 5.431 ± 1.895
2.172PhePhe: 2.172 ± 0.817
5.431PheGly: 5.431 ± 1.299
1.448PheHis: 1.448 ± 0.791
3.259PheIle: 3.259 ± 0.715
2.534PheLys: 2.534 ± 0.863
7.603PheLeu: 7.603 ± 2.551
1.086PheMet: 1.086 ± 1.234
3.983PheAsn: 3.983 ± 1.044
2.534PhePro: 2.534 ± 1.123
2.172PheGln: 2.172 ± 0.477
1.81PheArg: 1.81 ± 0.988
6.155PheSer: 6.155 ± 1.379
3.983PheThr: 3.983 ± 4.212
4.707PheVal: 4.707 ± 1.322
1.086PheTrp: 1.086 ± 0.551
1.086PheTyr: 1.086 ± 0.476
0.0PheXaa: 0.0 ± 0.0
Gly
2.534GlyAla: 2.534 ± 0.863
2.534GlyCys: 2.534 ± 1.85
4.707GlyAsp: 4.707 ± 1.511
4.345GlyGlu: 4.345 ± 2.483
2.534GlyPhe: 2.534 ± 0.945
5.793GlyGly: 5.793 ± 1.806
1.81GlyHis: 1.81 ± 0.767
2.896GlyIle: 2.896 ± 1.035
4.345GlyLys: 4.345 ± 0.954
7.603GlyLeu: 7.603 ± 3.593
0.362GlyMet: 0.362 ± 0.672
2.896GlyAsn: 2.896 ± 0.841
2.534GlyPro: 2.534 ± 0.945
1.086GlyGln: 1.086 ± 0.593
5.431GlyArg: 5.431 ± 1.932
5.793GlySer: 5.793 ± 1.618
3.621GlyThr: 3.621 ± 0.927
5.431GlyVal: 5.431 ± 1.324
1.448GlyTrp: 1.448 ± 1.164
1.086GlyTyr: 1.086 ± 0.593
0.0GlyXaa: 0.0 ± 0.0
His
1.81HisAla: 1.81 ± 0.574
0.362HisCys: 0.362 ± 0.678
0.362HisAsp: 0.362 ± 0.198
0.0HisGlu: 0.0 ± 0.0
2.172HisPhe: 2.172 ± 1.186
1.448HisGly: 1.448 ± 0.84
1.086HisHis: 1.086 ± 0.597
1.448HisIle: 1.448 ± 1.352
1.448HisLys: 1.448 ± 0.589
2.896HisLeu: 2.896 ± 1.286
0.724HisMet: 0.724 ± 0.395
0.362HisAsn: 0.362 ± 0.198
0.362HisPro: 0.362 ± 0.198
0.724HisGln: 0.724 ± 0.395
0.724HisArg: 0.724 ± 0.541
1.81HisSer: 1.81 ± 0.988
0.724HisThr: 0.724 ± 1.321
2.172HisVal: 2.172 ± 0.889
0.0HisTrp: 0.0 ± 0.0
1.448HisTyr: 1.448 ± 0.801
0.0HisXaa: 0.0 ± 0.0
Ile
5.069IleAla: 5.069 ± 1.552
2.896IleCys: 2.896 ± 0.841
2.896IleAsp: 2.896 ± 1.14
3.983IleGlu: 3.983 ± 1.586
2.896IlePhe: 2.896 ± 1.035
2.172IleGly: 2.172 ± 0.741
1.086IleHis: 1.086 ± 0.597
2.534IleIle: 2.534 ± 0.863
4.345IleLys: 4.345 ± 1.633
3.983IleLeu: 3.983 ± 2.518
1.448IleMet: 1.448 ± 0.791
1.086IleAsn: 1.086 ± 0.593
2.172IlePro: 2.172 ± 1.492
1.448IleGln: 1.448 ± 0.589
2.534IleArg: 2.534 ± 1.268
3.983IleSer: 3.983 ± 2.06
4.345IleThr: 4.345 ± 2.535
1.81IleVal: 1.81 ± 0.83
0.0IleTrp: 0.0 ± 0.0
2.172IleTyr: 2.172 ± 1.186
0.0IleXaa: 0.0 ± 0.0
Lys
4.707LysAla: 4.707 ± 0.74
0.362LysCys: 0.362 ± 0.198
2.896LysAsp: 2.896 ± 0.764
3.259LysGlu: 3.259 ± 1.261
3.621LysPhe: 3.621 ± 0.873
3.259LysGly: 3.259 ± 1.261
1.086LysHis: 1.086 ± 1.234
2.172LysIle: 2.172 ± 0.728
3.621LysLys: 3.621 ± 0.986
6.879LysLeu: 6.879 ± 1.279
2.534LysMet: 2.534 ± 1.693
2.896LysAsn: 2.896 ± 1.141
2.172LysPro: 2.172 ± 1.186
2.534LysGln: 2.534 ± 1.383
4.707LysArg: 4.707 ± 2.36
4.707LysSer: 4.707 ± 1.631
3.259LysThr: 3.259 ± 1.214
4.707LysVal: 4.707 ± 1.845
1.086LysTrp: 1.086 ± 0.593
0.724LysTyr: 0.724 ± 0.582
0.0LysXaa: 0.0 ± 0.0
Leu
4.345LeuAla: 4.345 ± 3.159
3.621LeuCys: 3.621 ± 2.606
4.707LeuAsp: 4.707 ± 0.998
7.241LeuGlu: 7.241 ± 1.433
5.069LeuPhe: 5.069 ± 1.535
12.31LeuGly: 12.31 ± 2.034
1.448LeuHis: 1.448 ± 1.1
4.707LeuIle: 4.707 ± 1.666
7.965LeuLys: 7.965 ± 2.298
10.5LeuLeu: 10.5 ± 2.53
1.81LeuMet: 1.81 ± 0.988
5.431LeuAsn: 5.431 ± 2.479
7.603LeuPro: 7.603 ± 1.619
1.448LeuGln: 1.448 ± 1.164
7.965LeuArg: 7.965 ± 3.231
9.051LeuSer: 9.051 ± 3.352
5.793LeuThr: 5.793 ± 1.922
5.069LeuVal: 5.069 ± 1.668
0.724LeuTrp: 0.724 ± 0.395
1.81LeuTyr: 1.81 ± 0.753
0.0LeuXaa: 0.0 ± 0.0
Met
3.259MetAla: 3.259 ± 1.317
0.724MetCys: 0.724 ± 0.395
1.448MetAsp: 1.448 ± 1.361
1.81MetGlu: 1.81 ± 1.169
0.724MetPhe: 0.724 ± 0.582
0.362MetGly: 0.362 ± 0.198
1.086MetHis: 1.086 ± 0.551
0.0MetIle: 0.0 ± 0.0
1.086MetLys: 1.086 ± 0.551
1.086MetLeu: 1.086 ± 0.593
0.0MetMet: 0.0 ± 0.0
0.724MetAsn: 0.724 ± 0.395
1.086MetPro: 1.086 ± 0.597
0.0MetGln: 0.0 ± 0.0
2.172MetArg: 2.172 ± 1.186
1.448MetSer: 1.448 ± 0.589
0.724MetThr: 0.724 ± 1.311
0.362MetVal: 0.362 ± 0.198
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.896AsnAla: 2.896 ± 1.14
1.086AsnCys: 1.086 ± 0.593
1.086AsnAsp: 1.086 ± 0.593
1.086AsnGlu: 1.086 ± 0.593
3.621AsnPhe: 3.621 ± 1.274
3.621AsnGly: 3.621 ± 0.954
1.086AsnHis: 1.086 ± 0.593
3.983AsnIle: 3.983 ± 1.154
2.172AsnLys: 2.172 ± 0.728
3.983AsnLeu: 3.983 ± 0.649
0.724AsnMet: 0.724 ± 0.395
1.086AsnAsn: 1.086 ± 0.551
2.896AsnPro: 2.896 ± 0.832
1.448AsnGln: 1.448 ± 0.904
3.621AsnArg: 3.621 ± 2.548
1.81AsnSer: 1.81 ± 1.366
1.086AsnThr: 1.086 ± 0.804
1.448AsnVal: 1.448 ± 0.655
1.086AsnTrp: 1.086 ± 0.476
1.448AsnTyr: 1.448 ± 1.164
0.0AsnXaa: 0.0 ± 0.0
Pro
2.534ProAla: 2.534 ± 1.226
2.172ProCys: 2.172 ± 0.817
2.172ProAsp: 2.172 ± 0.741
4.707ProGlu: 4.707 ± 1.177
1.81ProPhe: 1.81 ± 1.116
2.172ProGly: 2.172 ± 0.741
1.81ProHis: 1.81 ± 1.45
2.896ProIle: 2.896 ± 0.74
2.534ProLys: 2.534 ± 1.123
2.896ProLeu: 2.896 ± 1.553
0.724ProMet: 0.724 ± 0.395
1.81ProAsn: 1.81 ± 1.239
2.896ProPro: 2.896 ± 1.708
1.81ProGln: 1.81 ± 0.83
2.534ProArg: 2.534 ± 1.383
2.172ProSer: 2.172 ± 0.477
2.896ProThr: 2.896 ± 1.046
2.896ProVal: 2.896 ± 1.537
0.362ProTrp: 0.362 ± 0.198
1.086ProTyr: 1.086 ± 0.476
0.0ProXaa: 0.0 ± 0.0
Gln
1.086GlnAla: 1.086 ± 0.593
0.362GlnCys: 0.362 ± 1.411
1.086GlnAsp: 1.086 ± 0.593
2.534GlnGlu: 2.534 ± 0.972
1.448GlnPhe: 1.448 ± 0.489
1.448GlnGly: 1.448 ± 0.791
0.724GlnHis: 0.724 ± 0.395
0.724GlnIle: 0.724 ± 0.395
1.448GlnLys: 1.448 ± 1.911
2.534GlnLeu: 2.534 ± 0.494
0.362GlnMet: 0.362 ± 0.807
1.086GlnAsn: 1.086 ± 1.234
1.81GlnPro: 1.81 ± 1.116
0.724GlnGln: 0.724 ± 0.395
1.81GlnArg: 1.81 ± 0.574
1.448GlnSer: 1.448 ± 0.791
0.724GlnThr: 0.724 ± 0.884
0.724GlnVal: 0.724 ± 0.582
1.086GlnTrp: 1.086 ± 0.82
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.534ArgAla: 2.534 ± 0.927
0.362ArgCys: 0.362 ± 0.983
2.534ArgAsp: 2.534 ± 1.915
3.621ArgGlu: 3.621 ± 1.498
7.603ArgPhe: 7.603 ± 2.254
4.345ArgGly: 4.345 ± 3.197
1.086ArgHis: 1.086 ± 0.476
1.448ArgIle: 1.448 ± 0.791
2.172ArgLys: 2.172 ± 1.186
6.517ArgLeu: 6.517 ± 3.698
0.362ArgMet: 0.362 ± 0.198
2.534ArgAsn: 2.534 ± 1.383
2.534ArgPro: 2.534 ± 2.015
0.362ArgGln: 0.362 ± 0.672
2.534ArgArg: 2.534 ± 1.825
4.345ArgSer: 4.345 ± 1.966
2.534ArgThr: 2.534 ± 0.863
4.345ArgVal: 4.345 ± 3.607
0.724ArgTrp: 0.724 ± 0.395
2.896ArgTyr: 2.896 ± 1.14
0.0ArgXaa: 0.0 ± 0.0
Ser
3.259SerAla: 3.259 ± 1.449
3.621SerCys: 3.621 ± 1.416
5.069SerAsp: 5.069 ± 1.379
5.069SerGlu: 5.069 ± 1.596
3.983SerPhe: 3.983 ± 1.35
4.345SerGly: 4.345 ± 1.485
1.448SerHis: 1.448 ± 0.489
6.517SerIle: 6.517 ± 2.398
6.517SerLys: 6.517 ± 1.782
7.241SerLeu: 7.241 ± 1.206
1.81SerMet: 1.81 ± 0.752
1.81SerAsn: 1.81 ± 1.876
2.896SerPro: 2.896 ± 1.661
2.172SerGln: 2.172 ± 1.342
2.896SerArg: 2.896 ± 1.192
7.603SerSer: 7.603 ± 4.229
3.983SerThr: 3.983 ± 1.236
3.621SerVal: 3.621 ± 1.933
0.362SerTrp: 0.362 ± 0.198
2.534SerTyr: 2.534 ± 1.234
0.0SerXaa: 0.0 ± 0.0
Thr
2.534ThrAla: 2.534 ± 1.034
0.362ThrCys: 0.362 ± 0.66
1.448ThrAsp: 1.448 ± 1.1
1.81ThrGlu: 1.81 ± 0.988
6.879ThrPhe: 6.879 ± 1.729
3.621ThrGly: 3.621 ± 1.494
1.448ThrHis: 1.448 ± 0.589
2.172ThrIle: 2.172 ± 2.69
2.896ThrLys: 2.896 ± 1.212
5.793ThrLeu: 5.793 ± 1.182
1.448ThrMet: 1.448 ± 0.505
1.448ThrAsn: 1.448 ± 0.589
1.448ThrPro: 1.448 ± 0.678
0.724ThrGln: 0.724 ± 1.311
2.896ThrArg: 2.896 ± 2.489
3.621ThrSer: 3.621 ± 0.603
1.448ThrThr: 1.448 ± 1.081
2.896ThrVal: 2.896 ± 1.035
0.724ThrTrp: 0.724 ± 0.931
2.896ThrTyr: 2.896 ± 0.909
0.0ThrXaa: 0.0 ± 0.0
Val
5.431ValAla: 5.431 ± 1.127
3.621ValCys: 3.621 ± 1.559
4.707ValAsp: 4.707 ± 1.408
3.621ValGlu: 3.621 ± 0.771
4.345ValPhe: 4.345 ± 1.633
2.896ValGly: 2.896 ± 1.785
1.448ValHis: 1.448 ± 1.186
4.707ValIle: 4.707 ± 2.774
2.534ValLys: 2.534 ± 0.709
9.776ValLeu: 9.776 ± 3.477
0.724ValMet: 0.724 ± 1.169
3.259ValAsn: 3.259 ± 1.805
2.896ValPro: 2.896 ± 0.832
1.448ValGln: 1.448 ± 0.791
1.81ValArg: 1.81 ± 1.716
6.155ValSer: 6.155 ± 1.26
1.086ValThr: 1.086 ± 1.59
4.707ValVal: 4.707 ± 2.285
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.362TrpAla: 0.362 ± 0.672
0.362TrpCys: 0.362 ± 0.198
1.086TrpAsp: 1.086 ± 0.551
1.81TrpGlu: 1.81 ± 0.796
1.086TrpPhe: 1.086 ± 0.476
0.362TrpGly: 0.362 ± 0.198
0.362TrpHis: 0.362 ± 0.198
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.448TrpLeu: 1.448 ± 0.791
0.0TrpMet: 0.0 ± 0.0
0.724TrpAsn: 0.724 ± 0.582
0.362TrpPro: 0.362 ± 0.198
0.0TrpGln: 0.0 ± 0.0
0.724TrpArg: 0.724 ± 0.582
0.0TrpSer: 0.0 ± 0.0
0.362TrpThr: 0.362 ± 0.198
1.086TrpVal: 1.086 ± 0.597
0.0TrpTrp: 0.0 ± 0.0
0.362TrpTyr: 0.362 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.81TyrAla: 1.81 ± 1.36
0.362TyrCys: 0.362 ± 0.672
1.448TyrAsp: 1.448 ± 0.655
1.086TyrGlu: 1.086 ± 0.593
1.086TyrPhe: 1.086 ± 0.593
0.724TyrGly: 0.724 ± 0.884
0.724TyrHis: 0.724 ± 0.395
1.81TyrIle: 1.81 ± 0.83
2.896TyrLys: 2.896 ± 0.841
3.259TyrLeu: 3.259 ± 1.196
0.724TyrMet: 0.724 ± 0.582
1.448TyrAsn: 1.448 ± 0.791
1.448TyrPro: 1.448 ± 0.791
0.0TyrGln: 0.0 ± 0.0
1.086TyrArg: 1.086 ± 0.593
2.172TyrSer: 2.172 ± 1.076
0.724TyrThr: 0.724 ± 0.395
1.448TyrVal: 1.448 ± 0.589
0.724TyrTrp: 0.724 ± 0.395
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2763 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski