Amino acid dipepetide frequency for Cowpea mild mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.523AlaAla: 4.523 ± 1.454
0.754AlaCys: 0.754 ± 0.415
2.262AlaAsp: 2.262 ± 1.094
5.277AlaGlu: 5.277 ± 2.306
3.015AlaPhe: 3.015 ± 1.496
3.015AlaGly: 3.015 ± 1.631
0.754AlaHis: 0.754 ± 0.501
3.392AlaIle: 3.392 ± 1.38
6.031AlaLys: 6.031 ± 1.559
6.408AlaLeu: 6.408 ± 2.131
1.131AlaMet: 1.131 ± 0.46
2.639AlaAsn: 2.639 ± 1.035
2.262AlaPro: 2.262 ± 1.502
0.377AlaGln: 0.377 ± 0.208
1.885AlaArg: 1.885 ± 0.851
3.769AlaSer: 3.769 ± 0.981
3.392AlaThr: 3.392 ± 0.798
3.769AlaVal: 3.769 ± 0.949
0.0AlaTrp: 0.0 ± 0.0
1.131AlaTyr: 1.131 ± 0.547
0.0AlaXaa: 0.0 ± 0.0
Cys
0.377CysAla: 0.377 ± 0.208
0.377CysCys: 0.377 ± 0.208
0.377CysAsp: 0.377 ± 0.996
0.754CysGlu: 0.754 ± 0.597
2.262CysPhe: 2.262 ± 0.528
1.885CysGly: 1.885 ± 1.031
1.508CysHis: 1.508 ± 2.365
1.885CysIle: 1.885 ± 1.284
1.131CysLys: 1.131 ± 0.46
1.885CysLeu: 1.885 ± 1.428
0.377CysMet: 0.377 ± 0.208
0.377CysAsn: 0.377 ± 0.208
0.754CysPro: 0.754 ± 1.144
0.377CysGln: 0.377 ± 0.208
1.885CysArg: 1.885 ± 0.627
1.131CysSer: 1.131 ± 1.231
1.885CysThr: 1.885 ± 1.038
2.262CysVal: 2.262 ± 1.296
0.0CysTrp: 0.0 ± 0.0
1.131CysTyr: 1.131 ± 1.505
0.0CysXaa: 0.0 ± 0.0
Asp
2.262AspAla: 2.262 ± 0.783
1.508AspCys: 1.508 ± 0.509
2.639AspAsp: 2.639 ± 1.433
2.262AspGlu: 2.262 ± 0.803
3.392AspPhe: 3.392 ± 1.329
3.015AspGly: 3.015 ± 1.124
1.508AspHis: 1.508 ± 0.768
2.262AspIle: 2.262 ± 0.528
2.262AspLys: 2.262 ± 0.783
9.8AspLeu: 9.8 ± 1.691
0.377AspMet: 0.377 ± 0.208
1.508AspAsn: 1.508 ± 0.509
2.262AspPro: 2.262 ± 1.791
1.508AspGln: 1.508 ± 0.509
1.508AspArg: 1.508 ± 1.002
4.523AspSer: 4.523 ± 1.316
0.377AspThr: 0.377 ± 0.208
1.508AspVal: 1.508 ± 0.509
1.131AspTrp: 1.131 ± 1.1
1.885AspTyr: 1.885 ± 0.666
0.0AspXaa: 0.0 ± 0.0
Glu
4.9GluAla: 4.9 ± 2.057
1.131GluCys: 1.131 ± 0.623
3.769GluAsp: 3.769 ± 1.332
6.408GluGlu: 6.408 ± 2.241
3.392GluPhe: 3.392 ± 1.225
4.9GluGly: 4.9 ± 1.161
1.885GluHis: 1.885 ± 0.627
3.392GluIle: 3.392 ± 1.381
5.277GluLys: 5.277 ± 1.117
8.669GluLeu: 8.669 ± 1.956
1.131GluMet: 1.131 ± 1.378
1.885GluAsn: 1.885 ± 0.627
0.377GluPro: 0.377 ± 0.208
2.262GluGln: 2.262 ± 0.759
3.015GluArg: 3.015 ± 1.661
6.785GluSer: 6.785 ± 1.085
2.262GluThr: 2.262 ± 0.803
6.031GluVal: 6.031 ± 1.66
0.754GluTrp: 0.754 ± 0.415
1.885GluTyr: 1.885 ± 0.576
0.0GluXaa: 0.0 ± 0.0
Phe
3.392PheAla: 3.392 ± 0.965
0.377PheCys: 0.377 ± 0.208
3.769PheAsp: 3.769 ± 1.401
6.785PheGlu: 6.785 ± 2.031
1.885PhePhe: 1.885 ± 1.038
4.9PheGly: 4.9 ± 2.785
0.754PheHis: 0.754 ± 0.415
3.392PheIle: 3.392 ± 1.948
4.146PheLys: 4.146 ± 1.513
6.785PheLeu: 6.785 ± 1.925
0.754PheMet: 0.754 ± 1.226
2.262PheAsn: 2.262 ± 1.288
2.262PhePro: 2.262 ± 0.892
2.639PheGln: 2.639 ± 0.966
1.885PheArg: 1.885 ± 1.038
3.015PheSer: 3.015 ± 1.209
3.769PheThr: 3.769 ± 1.654
2.639PheVal: 2.639 ± 0.78
0.377PheTrp: 0.377 ± 0.208
1.508PheTyr: 1.508 ± 1.118
0.0PheXaa: 0.0 ± 0.0
Gly
1.131GlyAla: 1.131 ± 0.547
0.754GlyCys: 0.754 ± 0.854
3.015GlyAsp: 3.015 ± 1.047
4.146GlyGlu: 4.146 ± 1.226
4.146GlyPhe: 4.146 ± 2.518
2.262GlyGly: 2.262 ± 1.288
0.754GlyHis: 0.754 ± 1.378
2.639GlyIle: 2.639 ± 1.172
6.408GlyLys: 6.408 ± 1.142
5.654GlyLeu: 5.654 ± 2.11
0.377GlyMet: 0.377 ± 0.208
1.508GlyAsn: 1.508 ± 0.831
1.131GlyPro: 1.131 ± 1.034
3.015GlyGln: 3.015 ± 1.661
4.146GlyArg: 4.146 ± 0.902
2.639GlySer: 2.639 ± 0.689
4.9GlyThr: 4.9 ± 1.202
3.769GlyVal: 3.769 ± 0.733
0.377GlyTrp: 0.377 ± 0.208
2.639GlyTyr: 2.639 ± 1.053
0.0GlyXaa: 0.0 ± 0.0
His
1.131HisAla: 1.131 ± 0.834
0.377HisCys: 0.377 ± 0.964
0.754HisAsp: 0.754 ± 0.415
0.754HisGlu: 0.754 ± 0.415
1.131HisPhe: 1.131 ± 0.828
0.0HisGly: 0.0 ± 0.0
0.754HisHis: 0.754 ± 0.415
1.508HisIle: 1.508 ± 0.972
2.262HisLys: 2.262 ± 0.528
4.9HisLeu: 4.9 ± 1.559
0.0HisMet: 0.0 ± 0.0
1.885HisAsn: 1.885 ± 0.865
0.377HisPro: 0.377 ± 0.208
0.754HisGln: 0.754 ± 0.415
1.885HisArg: 1.885 ± 2.546
3.392HisSer: 3.392 ± 1.329
1.885HisThr: 1.885 ± 0.865
1.131HisVal: 1.131 ± 1.107
0.0HisTrp: 0.0 ± 0.0
0.754HisTyr: 0.754 ± 0.415
0.0HisXaa: 0.0 ± 0.0
Ile
3.769IleAla: 3.769 ± 1.572
1.885IleCys: 1.885 ± 1.428
2.262IleAsp: 2.262 ± 0.921
4.523IleGlu: 4.523 ± 1.719
1.508IlePhe: 1.508 ± 0.573
3.015IleGly: 3.015 ± 1.984
1.131IleHis: 1.131 ± 0.623
1.508IleIle: 1.508 ± 0.644
4.146IleLys: 4.146 ± 2.284
5.654IleLeu: 5.654 ± 2.895
0.754IleMet: 0.754 ± 0.415
2.639IleAsn: 2.639 ± 0.668
1.508IlePro: 1.508 ± 0.768
1.131IleGln: 1.131 ± 0.46
2.262IleArg: 2.262 ± 1.038
4.523IleSer: 4.523 ± 1.523
3.769IleThr: 3.769 ± 1.641
5.277IleVal: 5.277 ± 1.379
0.0IleTrp: 0.0 ± 0.0
3.015IleTyr: 3.015 ± 1.696
0.0IleXaa: 0.0 ± 0.0
Lys
5.277LysAla: 5.277 ± 1.23
1.131LysCys: 1.131 ± 0.623
3.769LysAsp: 3.769 ± 0.848
4.146LysGlu: 4.146 ± 1.717
2.262LysPhe: 2.262 ± 0.803
4.146LysGly: 4.146 ± 1.034
2.262LysHis: 2.262 ± 0.528
5.277LysIle: 5.277 ± 1.491
6.031LysLys: 6.031 ± 2.29
9.423LysLeu: 9.423 ± 1.544
2.639LysMet: 2.639 ± 1.951
2.262LysAsn: 2.262 ± 0.892
2.639LysPro: 2.639 ± 0.959
1.885LysGln: 1.885 ± 0.774
3.769LysArg: 3.769 ± 0.843
6.031LysSer: 6.031 ± 1.995
3.392LysThr: 3.392 ± 1.379
4.9LysVal: 4.9 ± 1.524
0.754LysTrp: 0.754 ± 1.226
2.639LysTyr: 2.639 ± 0.966
0.0LysXaa: 0.0 ± 0.0
Leu
5.654LeuAla: 5.654 ± 2.207
2.262LeuCys: 2.262 ± 2.493
6.031LeuAsp: 6.031 ± 0.864
8.669LeuGlu: 8.669 ± 2.759
4.523LeuPhe: 4.523 ± 1.915
7.539LeuGly: 7.539 ± 1.009
3.015LeuHis: 3.015 ± 1.196
6.785LeuIle: 6.785 ± 2.404
9.423LeuLys: 9.423 ± 1.853
12.062LeuLeu: 12.062 ± 2.323
2.262LeuMet: 2.262 ± 0.629
7.539LeuAsn: 7.539 ± 1.225
6.785LeuPro: 6.785 ± 2.664
3.015LeuGln: 3.015 ± 0.855
4.9LeuArg: 4.9 ± 1.857
9.423LeuSer: 9.423 ± 4.156
7.539LeuThr: 7.539 ± 2.211
6.408LeuVal: 6.408 ± 1.079
0.377LeuTrp: 0.377 ± 0.208
3.015LeuTyr: 3.015 ± 0.899
0.0LeuXaa: 0.0 ± 0.0
Met
2.262MetAla: 2.262 ± 1.502
1.131MetCys: 1.131 ± 0.623
1.131MetAsp: 1.131 ± 0.46
1.131MetGlu: 1.131 ± 0.46
0.377MetPhe: 0.377 ± 0.731
1.131MetGly: 1.131 ± 0.46
0.754MetHis: 0.754 ± 0.415
1.131MetIle: 1.131 ± 0.828
0.754MetLys: 0.754 ± 0.415
1.131MetLeu: 1.131 ± 1.034
0.754MetMet: 0.754 ± 0.415
0.754MetAsn: 0.754 ± 1.378
0.377MetPro: 0.377 ± 0.964
1.131MetGln: 1.131 ± 0.623
1.131MetArg: 1.131 ± 0.623
0.377MetSer: 0.377 ± 0.208
0.0MetThr: 0.0 ± 0.0
0.377MetVal: 0.377 ± 0.208
0.0MetTrp: 0.0 ± 0.0
0.377MetTyr: 0.377 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
1.508AsnAla: 1.508 ± 0.573
2.262AsnCys: 2.262 ± 1.167
1.508AsnAsp: 1.508 ± 0.831
3.769AsnGlu: 3.769 ± 0.964
3.769AsnPhe: 3.769 ± 0.848
1.131AsnGly: 1.131 ± 0.623
1.508AsnHis: 1.508 ± 0.509
1.508AsnIle: 1.508 ± 0.831
3.015AsnLys: 3.015 ± 1.378
5.654AsnLeu: 5.654 ± 2.228
1.131AsnMet: 1.131 ± 0.46
1.885AsnAsn: 1.885 ± 0.627
2.262AsnPro: 2.262 ± 0.921
1.508AsnGln: 1.508 ± 0.831
2.262AsnArg: 2.262 ± 0.951
3.392AsnSer: 3.392 ± 0.961
3.769AsnThr: 3.769 ± 2.084
1.885AsnVal: 1.885 ± 1.038
0.754AsnTrp: 0.754 ± 0.415
1.131AsnTyr: 1.131 ± 0.607
0.0AsnXaa: 0.0 ± 0.0
Pro
2.262ProAla: 2.262 ± 1.042
1.131ProCys: 1.131 ± 0.547
3.392ProAsp: 3.392 ± 1.043
3.015ProGlu: 3.015 ± 1.018
1.508ProPhe: 1.508 ± 0.768
1.885ProGly: 1.885 ± 2.265
0.377ProHis: 0.377 ± 0.208
1.131ProIle: 1.131 ± 0.834
2.262ProLys: 2.262 ± 1.669
3.392ProLeu: 3.392 ± 1.577
0.377ProMet: 0.377 ± 0.208
3.392ProAsn: 3.392 ± 1.225
2.262ProPro: 2.262 ± 1.99
0.754ProGln: 0.754 ± 0.415
1.131ProArg: 1.131 ± 0.623
1.885ProSer: 1.885 ± 1.134
3.392ProThr: 3.392 ± 1.685
1.885ProVal: 1.885 ± 1.031
0.754ProTrp: 0.754 ± 0.415
1.508ProTyr: 1.508 ± 0.831
0.0ProXaa: 0.0 ± 0.0
Gln
0.754GlnAla: 0.754 ± 0.415
0.377GlnCys: 0.377 ± 0.707
1.885GlnAsp: 1.885 ± 1.038
1.131GlnGlu: 1.131 ± 0.623
1.131GlnPhe: 1.131 ± 0.623
1.131GlnGly: 1.131 ± 0.547
1.508GlnHis: 1.508 ± 1.709
3.392GlnIle: 3.392 ± 1.338
1.885GlnLys: 1.885 ± 0.627
3.015GlnLeu: 3.015 ± 1.145
0.377GlnMet: 0.377 ± 0.208
1.131GlnAsn: 1.131 ± 0.547
1.508GlnPro: 1.508 ± 0.509
0.754GlnGln: 0.754 ± 0.415
1.131GlnArg: 1.131 ± 0.607
3.392GlnSer: 3.392 ± 0.825
0.377GlnThr: 0.377 ± 0.208
2.639GlnVal: 2.639 ± 0.959
0.377GlnTrp: 0.377 ± 0.996
0.377GlnTyr: 0.377 ± 0.208
0.0GlnXaa: 0.0 ± 0.0
Arg
3.769ArgAla: 3.769 ± 0.653
1.131ArgCys: 1.131 ± 2.988
1.885ArgAsp: 1.885 ± 0.806
4.146ArgGlu: 4.146 ± 1.444
5.654ArgPhe: 5.654 ± 1.597
1.885ArgGly: 1.885 ± 1.038
1.508ArgHis: 1.508 ± 1.194
1.131ArgIle: 1.131 ± 1.572
3.015ArgLys: 3.015 ± 1.696
4.9ArgLeu: 4.9 ± 1.245
0.754ArgMet: 0.754 ± 0.415
1.131ArgAsn: 1.131 ± 1.1
1.508ArgPro: 1.508 ± 0.972
0.377ArgGln: 0.377 ± 0.208
3.015ArgArg: 3.015 ± 2.919
4.146ArgSer: 4.146 ± 1.179
2.639ArgThr: 2.639 ± 1.454
3.392ArgVal: 3.392 ± 0.833
1.131ArgTrp: 1.131 ± 0.623
3.392ArgTyr: 3.392 ± 0.912
0.0ArgXaa: 0.0 ± 0.0
Ser
4.523SerAla: 4.523 ± 1.341
2.639SerCys: 2.639 ± 1.498
5.277SerAsp: 5.277 ± 1.29
4.9SerGlu: 4.9 ± 1.383
6.408SerPhe: 6.408 ± 1.529
4.523SerGly: 4.523 ± 1.277
0.754SerHis: 0.754 ± 0.854
4.9SerIle: 4.9 ± 0.973
7.162SerLys: 7.162 ± 2.566
7.539SerLeu: 7.539 ± 2.282
1.131SerMet: 1.131 ± 0.623
3.015SerAsn: 3.015 ± 1.93
3.769SerPro: 3.769 ± 1.27
2.639SerGln: 2.639 ± 1.54
4.146SerArg: 4.146 ± 1.654
11.685SerSer: 11.685 ± 3.546
2.639SerThr: 2.639 ± 1.502
3.769SerVal: 3.769 ± 1.401
0.754SerTrp: 0.754 ± 0.415
2.639SerTyr: 2.639 ± 1.629
0.0SerXaa: 0.0 ± 0.0
Thr
1.885ThrAla: 1.885 ± 1.323
0.754ThrCys: 0.754 ± 0.597
0.754ThrAsp: 0.754 ± 0.415
3.392ThrGlu: 3.392 ± 0.825
6.408ThrPhe: 6.408 ± 1.5
4.523ThrGly: 4.523 ± 0.932
2.639ThrHis: 2.639 ± 0.93
2.639ThrIle: 2.639 ± 1.454
2.639ThrLys: 2.639 ± 1.657
6.408ThrLeu: 6.408 ± 1.798
0.377ThrMet: 0.377 ± 0.208
2.639ThrAsn: 2.639 ± 0.558
0.754ThrPro: 0.754 ± 0.597
1.508ThrGln: 1.508 ± 0.509
3.392ThrArg: 3.392 ± 2.1
3.769ThrSer: 3.769 ± 2.47
1.885ThrThr: 1.885 ± 1.659
3.392ThrVal: 3.392 ± 1.225
0.377ThrTrp: 0.377 ± 0.613
1.885ThrTyr: 1.885 ± 0.865
0.0ThrXaa: 0.0 ± 0.0
Val
3.769ValAla: 3.769 ± 1.602
1.508ValCys: 1.508 ± 0.972
1.885ValAsp: 1.885 ± 0.627
3.769ValGlu: 3.769 ± 1.174
3.015ValPhe: 3.015 ± 0.657
3.392ValGly: 3.392 ± 1.323
1.131ValHis: 1.131 ± 0.623
3.392ValIle: 3.392 ± 1.175
3.015ValLys: 3.015 ± 1.661
7.539ValLeu: 7.539 ± 1.895
0.754ValMet: 0.754 ± 0.415
4.523ValAsn: 4.523 ± 1.436
3.015ValPro: 3.015 ± 1.863
1.885ValGln: 1.885 ± 0.627
3.769ValArg: 3.769 ± 1.106
6.785ValSer: 6.785 ± 2.44
3.015ValThr: 3.015 ± 1.359
4.9ValVal: 4.9 ± 2.028
0.377ValTrp: 0.377 ± 0.964
1.131ValTyr: 1.131 ± 0.623
0.0ValXaa: 0.0 ± 0.0
Trp
0.754TrpAla: 0.754 ± 0.854
0.377TrpCys: 0.377 ± 0.208
0.754TrpAsp: 0.754 ± 0.501
0.0TrpGlu: 0.0 ± 0.0
0.754TrpPhe: 0.754 ± 0.415
0.0TrpGly: 0.0 ± 0.0
0.377TrpHis: 0.377 ± 0.208
0.0TrpIle: 0.0 ± 0.0
0.377TrpLys: 0.377 ± 0.613
1.131TrpLeu: 1.131 ± 0.623
0.0TrpMet: 0.0 ± 0.0
0.377TrpAsn: 0.377 ± 0.613
0.377TrpPro: 0.377 ± 0.208
0.0TrpGln: 0.0 ± 0.0
0.754TrpArg: 0.754 ± 0.415
1.131TrpSer: 1.131 ± 0.46
0.0TrpThr: 0.0 ± 0.0
1.508TrpVal: 1.508 ± 0.89
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.885TyrAla: 1.885 ± 1.284
0.754TyrCys: 0.754 ± 0.501
0.377TyrAsp: 0.377 ± 0.208
1.508TyrGlu: 1.508 ± 0.831
0.754TyrPhe: 0.754 ± 0.892
1.131TyrGly: 1.131 ± 0.46
0.754TyrHis: 0.754 ± 0.854
3.015TyrIle: 3.015 ± 1.209
3.392TyrLys: 3.392 ± 1.465
5.277TyrLeu: 5.277 ± 2.371
0.377TyrMet: 0.377 ± 0.208
2.262TyrAsn: 2.262 ± 0.759
1.508TyrPro: 1.508 ± 0.91
0.754TyrGln: 0.754 ± 0.597
2.639TyrArg: 2.639 ± 1.172
3.015TyrSer: 3.015 ± 0.761
1.131TyrThr: 1.131 ± 0.785
1.131TyrVal: 1.131 ± 0.46
0.377TyrTrp: 0.377 ± 0.208
1.131TyrTyr: 1.131 ± 0.623
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2654 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski