Amino acid dipepetide frequency for Broad bean mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.982AlaAla: 3.982 ± 1.512
0.885AlaCys: 0.885 ± 0.311
7.965AlaAsp: 7.965 ± 1.413
4.425AlaGlu: 4.425 ± 1.039
2.655AlaPhe: 2.655 ± 1.212
2.655AlaGly: 2.655 ± 0.659
0.0AlaHis: 0.0 ± 0.0
3.097AlaIle: 3.097 ± 1.771
4.425AlaLys: 4.425 ± 0.685
7.08AlaLeu: 7.08 ± 3.04
3.54AlaMet: 3.54 ± 1.032
3.982AlaAsn: 3.982 ± 1.874
2.655AlaPro: 2.655 ± 1.477
0.885AlaGln: 0.885 ± 0.679
3.097AlaArg: 3.097 ± 1.085
4.867AlaSer: 4.867 ± 1.258
3.097AlaThr: 3.097 ± 0.855
3.982AlaVal: 3.982 ± 1.897
0.442AlaTrp: 0.442 ± 0.367
3.097AlaTyr: 3.097 ± 2.036
0.0AlaXaa: 0.0 ± 0.0
Cys
0.885CysAla: 0.885 ± 0.647
0.442CysCys: 0.442 ± 0.324
1.77CysAsp: 1.77 ± 0.622
1.327CysGlu: 1.327 ± 1.101
2.655CysPhe: 2.655 ± 1.099
1.327CysGly: 1.327 ± 0.518
0.442CysHis: 0.442 ± 0.324
1.327CysIle: 1.327 ± 0.971
2.212CysLys: 2.212 ± 0.634
3.097CysLeu: 3.097 ± 0.585
0.442CysMet: 0.442 ± 0.745
1.327CysAsn: 1.327 ± 0.76
1.327CysPro: 1.327 ± 0.971
0.0CysGln: 0.0 ± 0.0
2.212CysArg: 2.212 ± 1.113
3.54CysSer: 3.54 ± 1.301
0.0CysThr: 0.0 ± 0.0
0.442CysVal: 0.442 ± 0.676
0.0CysTrp: 0.0 ± 0.0
0.885CysTyr: 0.885 ± 0.734
0.0CysXaa: 0.0 ± 0.0
Asp
4.425AspAla: 4.425 ± 1.816
1.77AspCys: 1.77 ± 0.467
2.212AspAsp: 2.212 ± 0.881
2.655AspGlu: 2.655 ± 1.197
3.982AspPhe: 3.982 ± 1.386
4.425AspGly: 4.425 ± 0.841
1.327AspHis: 1.327 ± 0.972
1.327AspIle: 1.327 ± 0.518
6.637AspLys: 6.637 ± 1.8
3.982AspLeu: 3.982 ± 1.609
1.327AspMet: 1.327 ± 0.518
0.442AspAsn: 0.442 ± 0.367
3.54AspPro: 3.54 ± 1.788
0.442AspGln: 0.442 ± 0.367
2.655AspArg: 2.655 ± 0.554
6.195AspSer: 6.195 ± 0.754
2.212AspThr: 2.212 ± 1.159
7.965AspVal: 7.965 ± 2.389
3.097AspTrp: 3.097 ± 1.085
1.77AspTyr: 1.77 ± 0.943
0.0AspXaa: 0.0 ± 0.0
Glu
4.425GluAla: 4.425 ± 1.298
1.77GluCys: 1.77 ± 0.467
3.097GluAsp: 3.097 ± 1.085
6.195GluGlu: 6.195 ± 1.512
4.425GluPhe: 4.425 ± 0.879
1.77GluGly: 1.77 ± 0.545
1.77GluHis: 1.77 ± 0.571
2.212GluIle: 2.212 ± 1.113
3.54GluLys: 3.54 ± 1.472
5.752GluLeu: 5.752 ± 1.955
2.212GluMet: 2.212 ± 1.181
3.54GluAsn: 3.54 ± 1.486
1.77GluPro: 1.77 ± 1.171
0.885GluGln: 0.885 ± 0.647
4.425GluArg: 4.425 ± 1.582
8.407GluSer: 8.407 ± 1.806
4.425GluThr: 4.425 ± 1.556
7.08GluVal: 7.08 ± 1.901
0.885GluTrp: 0.885 ± 0.311
4.425GluTyr: 4.425 ± 1.267
0.0GluXaa: 0.0 ± 0.0
Phe
2.655PheAla: 2.655 ± 0.797
0.885PheCys: 0.885 ± 0.311
5.31PheAsp: 5.31 ± 1.225
4.425PheGlu: 4.425 ± 1.556
1.77PhePhe: 1.77 ± 0.943
3.097PheGly: 3.097 ± 0.453
1.77PheHis: 1.77 ± 1.468
2.212PheIle: 2.212 ± 1.534
3.097PheLys: 3.097 ± 1.291
3.097PheLeu: 3.097 ± 1.177
0.0PheMet: 0.0 ± 0.0
0.885PheAsn: 0.885 ± 0.734
1.77PhePro: 1.77 ± 0.622
3.982PheGln: 3.982 ± 1.297
3.54PheArg: 3.54 ± 1.364
4.867PheSer: 4.867 ± 2.229
0.885PheThr: 0.885 ± 0.693
3.097PheVal: 3.097 ± 0.762
0.0PheTrp: 0.0 ± 0.0
1.327PheTyr: 1.327 ± 0.758
0.0PheXaa: 0.0 ± 0.0
Gly
4.867GlyAla: 4.867 ± 1.003
1.327GlyCys: 1.327 ± 0.518
3.982GlyAsp: 3.982 ± 1.049
3.097GlyGlu: 3.097 ± 1.207
3.54GlyPhe: 3.54 ± 0.832
3.54GlyGly: 3.54 ± 2.065
1.77GlyHis: 1.77 ± 0.545
2.655GlyIle: 2.655 ± 1.207
2.212GlyLys: 2.212 ± 0.588
3.54GlyLeu: 3.54 ± 1.112
0.442GlyMet: 0.442 ± 0.745
1.77GlyAsn: 1.77 ± 1.225
1.327GlyPro: 1.327 ± 0.599
2.655GlyGln: 2.655 ± 1.299
1.77GlyArg: 1.77 ± 1.608
4.867GlySer: 4.867 ± 1.498
2.212GlyThr: 2.212 ± 0.746
3.54GlyVal: 3.54 ± 1.577
1.327GlyTrp: 1.327 ± 0.518
2.212GlyTyr: 2.212 ± 1.042
0.0GlyXaa: 0.0 ± 0.0
His
1.327HisAla: 1.327 ± 0.494
0.885HisCys: 0.885 ± 0.311
1.327HisAsp: 1.327 ± 0.599
2.655HisGlu: 2.655 ± 1.036
1.327HisPhe: 1.327 ± 0.518
2.212HisGly: 2.212 ± 1.13
0.442HisHis: 0.442 ± 0.324
0.442HisIle: 0.442 ± 0.745
1.77HisLys: 1.77 ± 0.622
2.212HisLeu: 2.212 ± 0.588
0.442HisMet: 0.442 ± 0.367
0.885HisAsn: 0.885 ± 0.311
0.442HisPro: 0.442 ± 0.367
0.442HisGln: 0.442 ± 0.367
1.77HisArg: 1.77 ± 0.622
1.77HisSer: 1.77 ± 0.95
0.442HisThr: 0.442 ± 0.745
1.77HisVal: 1.77 ± 0.683
0.442HisTrp: 0.442 ± 0.324
1.327HisTyr: 1.327 ± 0.971
0.0HisXaa: 0.0 ± 0.0
Ile
1.77IleAla: 1.77 ± 0.95
2.212IleCys: 2.212 ± 1.299
3.097IleAsp: 3.097 ± 1.714
2.212IleGlu: 2.212 ± 0.881
0.442IlePhe: 0.442 ± 0.324
2.655IleGly: 2.655 ± 0.989
1.327IleHis: 1.327 ± 0.518
1.77IleIle: 1.77 ± 1.294
3.097IleLys: 3.097 ± 0.973
3.982IleLeu: 3.982 ± 0.749
2.655IleMet: 2.655 ± 1.428
3.097IleAsn: 3.097 ± 1.181
3.54IlePro: 3.54 ± 1.089
0.885IleGln: 0.885 ± 0.734
1.77IleArg: 1.77 ± 1.53
4.425IleSer: 4.425 ± 1.511
1.77IleThr: 1.77 ± 0.793
4.425IleVal: 4.425 ± 1.039
0.442IleTrp: 0.442 ± 0.324
2.212IleTyr: 2.212 ± 1.488
0.0IleXaa: 0.0 ± 0.0
Lys
6.195LysAla: 6.195 ± 1.174
1.77LysCys: 1.77 ± 1.294
3.097LysAsp: 3.097 ± 1.075
6.195LysGlu: 6.195 ± 2.89
5.752LysPhe: 5.752 ± 1.954
3.54LysGly: 3.54 ± 1.577
0.442LysHis: 0.442 ± 0.324
3.54LysIle: 3.54 ± 1.301
3.097LysLys: 3.097 ± 0.906
2.655LysLeu: 2.655 ± 1.212
1.327LysMet: 1.327 ± 0.494
1.327LysAsn: 1.327 ± 0.518
4.425LysPro: 4.425 ± 2.174
2.212LysGln: 2.212 ± 2.066
4.425LysArg: 4.425 ± 1.246
6.637LysSer: 6.637 ± 2.306
4.867LysThr: 4.867 ± 0.768
3.982LysVal: 3.982 ± 1.0
0.442LysTrp: 0.442 ± 0.367
2.655LysTyr: 2.655 ± 1.036
0.0LysXaa: 0.0 ± 0.0
Leu
6.637LeuAla: 6.637 ± 2.263
2.212LeuCys: 2.212 ± 0.791
4.425LeuAsp: 4.425 ± 0.953
7.08LeuGlu: 7.08 ± 1.183
1.77LeuPhe: 1.77 ± 0.622
3.982LeuGly: 3.982 ± 1.0
2.212LeuHis: 2.212 ± 0.791
2.212LeuIle: 2.212 ± 0.881
7.965LeuLys: 7.965 ± 0.811
7.08LeuLeu: 7.08 ± 1.116
0.885LeuMet: 0.885 ± 0.679
6.637LeuAsn: 6.637 ± 2.182
3.097LeuPro: 3.097 ± 1.145
2.212LeuGln: 2.212 ± 0.881
6.637LeuArg: 6.637 ± 1.835
7.965LeuSer: 7.965 ± 2.275
4.867LeuThr: 4.867 ± 1.289
3.982LeuVal: 3.982 ± 2.472
1.327LeuTrp: 1.327 ± 0.518
2.212LeuTyr: 2.212 ± 1.042
0.0LeuXaa: 0.0 ± 0.0
Met
1.77MetAla: 1.77 ± 1.468
0.442MetCys: 0.442 ± 0.324
0.442MetAsp: 0.442 ± 0.367
1.77MetGlu: 1.77 ± 0.571
2.655MetPhe: 2.655 ± 1.871
1.77MetGly: 1.77 ± 0.467
1.327MetHis: 1.327 ± 0.518
1.77MetIle: 1.77 ± 1.067
1.327MetLys: 1.327 ± 0.494
1.327MetLeu: 1.327 ± 0.76
0.442MetMet: 0.442 ± 0.367
0.0MetAsn: 0.0 ± 0.0
0.442MetPro: 0.442 ± 0.676
0.0MetGln: 0.0 ± 0.0
0.885MetArg: 0.885 ± 0.311
3.54MetSer: 3.54 ± 0.559
2.212MetThr: 2.212 ± 0.672
2.212MetVal: 2.212 ± 1.405
0.0MetTrp: 0.0 ± 0.0
0.442MetTyr: 0.442 ± 0.324
0.0MetXaa: 0.0 ± 0.0
Asn
2.655AsnAla: 2.655 ± 1.197
0.885AsnCys: 0.885 ± 0.311
1.77AsnAsp: 1.77 ± 0.622
2.655AsnGlu: 2.655 ± 1.428
1.77AsnPhe: 1.77 ± 0.622
3.097AsnGly: 3.097 ± 1.181
0.885AsnHis: 0.885 ± 0.679
3.982AsnIle: 3.982 ± 1.561
3.097AsnLys: 3.097 ± 0.647
4.425AsnLeu: 4.425 ± 1.075
1.327AsnMet: 1.327 ± 0.518
3.097AsnAsn: 3.097 ± 1.348
0.885AsnPro: 0.885 ± 1.09
0.442AsnGln: 0.442 ± 0.324
3.54AsnArg: 3.54 ± 2.577
1.327AsnSer: 1.327 ± 1.007
1.327AsnThr: 1.327 ± 0.971
2.655AsnVal: 2.655 ± 1.124
0.442AsnTrp: 0.442 ± 0.367
0.442AsnTyr: 0.442 ± 0.745
0.0AsnXaa: 0.0 ± 0.0
Pro
1.77ProAla: 1.77 ± 1.054
0.442ProCys: 0.442 ± 0.745
2.212ProAsp: 2.212 ± 0.588
5.752ProGlu: 5.752 ± 1.564
2.212ProPhe: 2.212 ± 1.591
2.212ProGly: 2.212 ± 1.475
0.442ProHis: 0.442 ± 0.324
2.655ProIle: 2.655 ± 1.197
2.655ProLys: 2.655 ± 1.639
3.54ProLeu: 3.54 ± 0.998
0.885ProMet: 0.885 ± 0.734
0.442ProAsn: 0.442 ± 0.324
0.885ProPro: 0.885 ± 0.647
0.885ProGln: 0.885 ± 0.679
1.327ProArg: 1.327 ± 1.388
1.327ProSer: 1.327 ± 0.821
3.097ProThr: 3.097 ± 1.135
3.097ProVal: 3.097 ± 1.603
0.442ProTrp: 0.442 ± 0.324
0.442ProTyr: 0.442 ± 0.367
0.0ProXaa: 0.0 ± 0.0
Gln
1.327GlnAla: 1.327 ± 0.552
0.885GlnCys: 0.885 ± 0.311
1.77GlnAsp: 1.77 ± 0.571
0.885GlnGlu: 0.885 ± 0.693
0.442GlnPhe: 0.442 ± 0.676
3.097GlnGly: 3.097 ± 1.315
0.442GlnHis: 0.442 ± 0.367
1.77GlnIle: 1.77 ± 1.294
2.212GlnLys: 2.212 ± 0.358
1.327GlnLeu: 1.327 ± 1.388
0.0GlnMet: 0.0 ± 0.0
0.442GlnAsn: 0.442 ± 0.367
0.442GlnPro: 0.442 ± 0.676
0.0GlnGln: 0.0 ± 0.0
3.54GlnArg: 3.54 ± 1.422
3.097GlnSer: 3.097 ± 0.973
0.442GlnThr: 0.442 ± 0.324
2.212GlnVal: 2.212 ± 0.786
0.442GlnTrp: 0.442 ± 0.367
0.442GlnTyr: 0.442 ± 0.324
0.0GlnXaa: 0.0 ± 0.0
Arg
3.982ArgAla: 3.982 ± 1.67
1.77ArgCys: 1.77 ± 0.95
2.212ArgAsp: 2.212 ± 0.791
2.212ArgGlu: 2.212 ± 1.299
1.77ArgPhe: 1.77 ± 0.571
2.655ArgGly: 2.655 ± 1.428
2.212ArgHis: 2.212 ± 1.113
3.097ArgIle: 3.097 ± 1.899
3.54ArgLys: 3.54 ± 1.141
7.965ArgLeu: 7.965 ± 0.656
3.54ArgMet: 3.54 ± 0.38
2.212ArgAsn: 2.212 ± 0.791
0.885ArgPro: 0.885 ± 0.734
1.327ArgGln: 1.327 ± 0.518
4.425ArgArg: 4.425 ± 3.393
3.982ArgSer: 3.982 ± 0.749
3.097ArgThr: 3.097 ± 2.165
3.982ArgVal: 3.982 ± 0.794
1.327ArgTrp: 1.327 ± 0.599
3.097ArgTyr: 3.097 ± 1.438
0.0ArgXaa: 0.0 ± 0.0
Ser
4.867SerAla: 4.867 ± 1.236
2.212SerCys: 2.212 ± 0.358
5.31SerAsp: 5.31 ± 2.105
6.637SerGlu: 6.637 ± 1.15
7.08SerPhe: 7.08 ± 1.969
4.425SerGly: 4.425 ± 2.653
1.327SerHis: 1.327 ± 0.821
8.407SerIle: 8.407 ± 1.46
7.522SerLys: 7.522 ± 2.124
7.965SerLeu: 7.965 ± 1.685
1.327SerMet: 1.327 ± 1.388
3.097SerAsn: 3.097 ± 1.735
2.212SerPro: 2.212 ± 2.066
2.655SerGln: 2.655 ± 0.934
4.425SerArg: 4.425 ± 1.075
8.407SerSer: 8.407 ± 3.873
3.982SerThr: 3.982 ± 1.065
5.31SerVal: 5.31 ± 0.732
0.442SerTrp: 0.442 ± 0.324
1.327SerTyr: 1.327 ± 0.599
0.0SerXaa: 0.0 ± 0.0
Thr
3.982ThrAla: 3.982 ± 1.386
1.327ThrCys: 1.327 ± 0.518
1.327ThrAsp: 1.327 ± 1.101
3.982ThrGlu: 3.982 ± 1.349
2.212ThrPhe: 2.212 ± 1.113
1.77ThrGly: 1.77 ± 1.195
1.327ThrHis: 1.327 ± 0.599
1.77ThrIle: 1.77 ± 1.225
1.327ThrLys: 1.327 ± 0.76
3.982ThrLeu: 3.982 ± 1.042
0.885ThrMet: 0.885 ± 0.481
2.655ThrAsn: 2.655 ± 1.099
0.885ThrPro: 0.885 ± 0.311
1.327ThrGln: 1.327 ± 0.972
3.54ThrArg: 3.54 ± 1.908
3.982ThrSer: 3.982 ± 0.842
2.212ThrThr: 2.212 ± 0.746
6.637ThrVal: 6.637 ± 1.578
0.442ThrTrp: 0.442 ± 0.324
3.097ThrTyr: 3.097 ± 1.69
0.0ThrXaa: 0.0 ± 0.0
Val
7.08ValAla: 7.08 ± 2.419
0.442ValCys: 0.442 ± 0.676
8.407ValAsp: 8.407 ± 0.51
6.195ValGlu: 6.195 ± 1.335
0.885ValPhe: 0.885 ± 0.734
3.097ValGly: 3.097 ± 1.234
3.097ValHis: 3.097 ± 1.207
2.212ValIle: 2.212 ± 0.672
4.867ValLys: 4.867 ± 2.066
5.752ValLeu: 5.752 ± 1.526
1.77ValMet: 1.77 ± 0.683
3.097ValAsn: 3.097 ± 1.315
5.31ValPro: 5.31 ± 2.105
2.212ValGln: 2.212 ± 0.791
3.982ValArg: 3.982 ± 1.244
7.08ValSer: 7.08 ± 2.969
4.867ValThr: 4.867 ± 1.232
3.54ValVal: 3.54 ± 1.582
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.327TrpAla: 1.327 ± 0.518
0.885TrpCys: 0.885 ± 0.311
0.0TrpAsp: 0.0 ± 0.0
0.442TrpGlu: 0.442 ± 0.367
0.885TrpPhe: 0.885 ± 0.311
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.442TrpIle: 0.442 ± 0.367
2.212TrpLys: 2.212 ± 0.634
1.77TrpLeu: 1.77 ± 0.622
0.442TrpMet: 0.442 ± 0.324
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.442TrpGln: 0.442 ± 0.324
0.442TrpArg: 0.442 ± 0.367
0.0TrpSer: 0.0 ± 0.0
1.327TrpThr: 1.327 ± 0.518
0.885TrpVal: 0.885 ± 0.647
0.885TrpTrp: 0.885 ± 0.311
0.885TrpTyr: 0.885 ± 0.734
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.885TyrAla: 0.885 ± 0.679
2.212TyrCys: 2.212 ± 0.588
2.655TyrAsp: 2.655 ± 1.124
2.212TyrGlu: 2.212 ± 0.881
0.442TyrPhe: 0.442 ± 0.367
1.327TyrGly: 1.327 ± 0.518
1.77TyrHis: 1.77 ± 0.622
0.885TyrIle: 0.885 ± 0.734
1.77TyrLys: 1.77 ± 0.943
4.867TyrLeu: 4.867 ± 0.569
0.442TyrMet: 0.442 ± 0.367
1.77TyrAsn: 1.77 ± 1.294
0.885TyrPro: 0.885 ± 0.311
1.327TyrGln: 1.327 ± 0.599
0.885TyrArg: 0.885 ± 0.734
2.655TyrSer: 2.655 ± 0.696
1.77TyrThr: 1.77 ± 1.359
3.097TyrVal: 3.097 ± 0.762
0.442TyrTrp: 0.442 ± 0.745
0.885TyrTyr: 0.885 ± 0.734
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2261 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski