Amino acid dipepetide frequency for Mouse mammary tumor virus (strain BR6) (MMTV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.409AlaAla: 6.409 ± 0.615
2.136AlaCys: 2.136 ± 0.345
1.899AlaAsp: 1.899 ± 0.521
5.697AlaGlu: 5.697 ± 1.062
1.899AlaPhe: 1.899 ± 0.484
4.035AlaGly: 4.035 ± 0.765
1.662AlaHis: 1.662 ± 0.153
3.56AlaIle: 3.56 ± 0.75
2.374AlaLys: 2.374 ± 0.733
6.171AlaLeu: 6.171 ± 2.393
1.424AlaMet: 1.424 ± 0.505
2.374AlaAsn: 2.374 ± 0.647
3.086AlaPro: 3.086 ± 0.15
2.374AlaGln: 2.374 ± 0.747
3.798AlaArg: 3.798 ± 0.544
1.899AlaSer: 1.899 ± 0.409
3.798AlaThr: 3.798 ± 0.665
4.747AlaVal: 4.747 ± 0.571
1.662AlaTrp: 1.662 ± 0.406
3.323AlaTyr: 3.323 ± 0.399
0.0AlaXaa: 0.0 ± 0.0
Cys
0.237CysAla: 0.237 ± 0.238
0.237CysCys: 0.237 ± 0.24
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.187CysPhe: 1.187 ± 0.127
1.187CysGly: 1.187 ± 0.38
0.237CysHis: 0.237 ± 0.238
0.712CysIle: 0.712 ± 0.253
2.848CysLys: 2.848 ± 0.638
1.899CysLeu: 1.899 ± 0.558
0.237CysMet: 0.237 ± 0.24
0.0CysAsn: 0.0 ± 0.0
1.662CysPro: 1.662 ± 0.417
1.187CysGln: 1.187 ± 0.127
0.237CysArg: 0.237 ± 0.238
0.237CysSer: 0.237 ± 0.238
0.237CysThr: 0.237 ± 0.151
0.712CysVal: 0.712 ± 0.445
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.662AspAla: 1.662 ± 0.338
1.187AspCys: 1.187 ± 0.315
5.459AspAsp: 5.459 ± 1.338
3.798AspGlu: 3.798 ± 0.651
1.662AspPhe: 1.662 ± 0.796
2.374AspGly: 2.374 ± 0.661
0.712AspHis: 0.712 ± 0.252
2.848AspIle: 2.848 ± 0.227
4.51AspLys: 4.51 ± 0.461
8.308AspLeu: 8.308 ± 0.67
2.374AspMet: 2.374 ± 0.418
1.187AspAsn: 1.187 ± 0.526
1.424AspPro: 1.424 ± 0.543
1.662AspGln: 1.662 ± 0.895
0.949AspArg: 0.949 ± 0.516
5.697AspSer: 5.697 ± 0.549
2.611AspThr: 2.611 ± 0.333
2.374AspVal: 2.374 ± 0.354
4.035AspTrp: 4.035 ± 0.865
2.374AspTyr: 2.374 ± 0.426
0.0AspXaa: 0.0 ± 0.0
Glu
4.272GluAla: 4.272 ± 0.211
0.949GluCys: 0.949 ± 0.295
3.086GluAsp: 3.086 ± 0.791
9.02GluGlu: 9.02 ± 2.089
1.187GluPhe: 1.187 ± 0.315
4.985GluGly: 4.985 ± 1.194
1.899GluHis: 1.899 ± 0.409
4.51GluIle: 4.51 ± 0.701
8.07GluLys: 8.07 ± 1.407
4.035GluLeu: 4.035 ± 0.676
2.611GluMet: 2.611 ± 0.786
3.323GluAsn: 3.323 ± 0.53
2.136GluPro: 2.136 ± 0.322
2.374GluGln: 2.374 ± 0.226
3.323GluArg: 3.323 ± 0.509
4.747GluSer: 4.747 ± 0.503
1.899GluThr: 1.899 ± 0.484
2.848GluVal: 2.848 ± 0.502
0.237GluTrp: 0.237 ± 0.151
0.949GluTyr: 0.949 ± 0.242
0.0GluXaa: 0.0 ± 0.0
Phe
2.136PheAla: 2.136 ± 0.437
0.237PheCys: 0.237 ± 0.151
1.424PheAsp: 1.424 ± 0.175
0.712PheGlu: 0.712 ± 0.31
0.237PhePhe: 0.237 ± 0.151
0.949PheGly: 0.949 ± 0.191
1.187PheHis: 1.187 ± 0.504
1.662PheIle: 1.662 ± 0.521
0.475PheLys: 0.475 ± 0.232
4.035PheLeu: 4.035 ± 0.322
0.712PheMet: 0.712 ± 0.252
0.712PheAsn: 0.712 ± 0.324
2.848PhePro: 2.848 ± 0.143
1.899PheGln: 1.899 ± 0.593
0.949PheArg: 0.949 ± 0.779
2.374PheSer: 2.374 ± 0.733
3.323PheThr: 3.323 ± 0.509
3.086PheVal: 3.086 ± 0.286
0.475PheTrp: 0.475 ± 0.476
0.949PheTyr: 0.949 ± 0.242
0.0PheXaa: 0.0 ± 0.0
Gly
3.086GlyAla: 3.086 ± 0.317
0.237GlyCys: 0.237 ± 0.238
3.323GlyAsp: 3.323 ± 0.25
3.798GlyGlu: 3.798 ± 0.491
1.899GlyPhe: 1.899 ± 0.23
4.035GlyGly: 4.035 ± 0.602
1.899GlyHis: 1.899 ± 0.705
2.136GlyIle: 2.136 ± 0.558
5.459GlyLys: 5.459 ± 0.626
10.207GlyLeu: 10.207 ± 1.398
1.899GlyMet: 1.899 ± 0.464
2.848GlyAsn: 2.848 ± 0.59
3.086GlyPro: 3.086 ± 0.384
3.798GlyGln: 3.798 ± 0.903
4.035GlyArg: 4.035 ± 0.976
5.459GlySer: 5.459 ± 0.726
5.934GlyThr: 5.934 ± 1.434
4.747GlyVal: 4.747 ± 0.644
0.712GlyTrp: 0.712 ± 0.445
1.187GlyTyr: 1.187 ± 0.245
0.0GlyXaa: 0.0 ± 0.0
His
0.949HisAla: 0.949 ± 0.43
0.237HisCys: 0.237 ± 0.238
1.662HisAsp: 1.662 ± 0.347
0.712HisGlu: 0.712 ± 0.195
0.712HisPhe: 0.712 ± 0.31
1.187HisGly: 1.187 ± 0.127
0.475HisHis: 0.475 ± 0.303
3.323HisIle: 3.323 ± 0.368
0.949HisLys: 0.949 ± 0.279
2.374HisLeu: 2.374 ± 1.053
0.237HisMet: 0.237 ± 0.151
0.0HisAsn: 0.0 ± 0.0
1.187HisPro: 1.187 ± 0.285
2.374HisGln: 2.374 ± 0.259
0.712HisArg: 0.712 ± 0.445
0.237HisSer: 0.237 ± 0.24
0.949HisThr: 0.949 ± 0.351
2.848HisVal: 2.848 ± 0.725
1.424HisTrp: 1.424 ± 0.376
1.187HisTyr: 1.187 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
1.662IleAla: 1.662 ± 0.438
0.475IleCys: 0.475 ± 0.476
2.136IleAsp: 2.136 ± 0.423
2.136IleGlu: 2.136 ± 0.595
1.899IlePhe: 1.899 ± 0.822
1.899IleGly: 1.899 ± 0.54
2.374IleHis: 2.374 ± 0.86
3.798IleIle: 3.798 ± 0.892
6.409IleLys: 6.409 ± 0.832
5.459IleLeu: 5.459 ± 0.247
0.475IleMet: 0.475 ± 0.199
0.475IleAsn: 0.475 ± 0.232
2.611IlePro: 2.611 ± 0.4
2.611IleGln: 2.611 ± 0.507
4.51IleArg: 4.51 ± 0.48
3.798IleSer: 3.798 ± 0.89
1.187IleThr: 1.187 ± 0.565
2.374IleVal: 2.374 ± 1.043
1.424IleTrp: 1.424 ± 0.175
1.899IleTyr: 1.899 ± 0.464
0.0IleXaa: 0.0 ± 0.0
Lys
3.798LysAla: 3.798 ± 0.67
0.0LysCys: 0.0 ± 0.0
8.07LysAsp: 8.07 ± 1.292
6.646LysGlu: 6.646 ± 0.877
1.187LysPhe: 1.187 ± 0.315
9.02LysGly: 9.02 ± 1.562
0.712LysHis: 0.712 ± 0.31
2.374LysIle: 2.374 ± 0.458
6.409LysLys: 6.409 ± 1.213
5.934LysLeu: 5.934 ± 0.924
0.0LysMet: 0.0 ± 0.0
3.086LysAsn: 3.086 ± 0.384
2.848LysPro: 2.848 ± 0.143
4.035LysGln: 4.035 ± 0.649
7.596LysArg: 7.596 ± 0.912
4.51LysSer: 4.51 ± 0.629
5.934LysThr: 5.934 ± 0.651
4.985LysVal: 4.985 ± 0.883
0.475LysTrp: 0.475 ± 0.242
2.374LysTyr: 2.374 ± 0.741
0.0LysXaa: 0.0 ± 0.0
Leu
5.934LeuAla: 5.934 ± 0.412
2.611LeuCys: 2.611 ± 0.504
3.56LeuAsp: 3.56 ± 0.393
5.697LeuGlu: 5.697 ± 0.501
2.848LeuPhe: 2.848 ± 0.592
5.934LeuGly: 5.934 ± 1.531
2.136LeuHis: 2.136 ± 0.208
6.409LeuIle: 6.409 ± 0.631
6.409LeuLys: 6.409 ± 0.656
10.444LeuLeu: 10.444 ± 1.235
1.424LeuMet: 1.424 ± 0.226
4.51LeuAsn: 4.51 ± 0.678
9.494LeuPro: 9.494 ± 0.92
7.596LeuGln: 7.596 ± 1.672
4.035LeuArg: 4.035 ± 0.731
9.02LeuSer: 9.02 ± 1.245
6.409LeuThr: 6.409 ± 0.489
4.51LeuVal: 4.51 ± 0.964
2.136LeuTrp: 2.136 ± 0.208
0.475LeuTyr: 0.475 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
1.899MetAla: 1.899 ± 0.614
0.0MetCys: 0.0 ± 0.0
0.712MetAsp: 0.712 ± 0.454
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.848MetGly: 2.848 ± 0.638
0.237MetHis: 0.237 ± 0.151
0.949MetIle: 0.949 ± 0.463
1.662MetLys: 1.662 ± 0.347
1.899MetLeu: 1.899 ± 0.704
0.712MetMet: 0.712 ± 0.252
0.475MetAsn: 0.475 ± 0.232
1.187MetPro: 1.187 ± 0.289
0.237MetGln: 0.237 ± 0.238
0.949MetArg: 0.949 ± 0.217
1.424MetSer: 1.424 ± 1.071
0.712MetThr: 0.712 ± 0.253
2.374MetVal: 2.374 ± 0.414
0.475MetTrp: 0.475 ± 0.232
0.237MetTyr: 0.237 ± 0.238
0.0MetXaa: 0.0 ± 0.0
Asn
4.272AsnAla: 4.272 ± 0.373
0.475AsnCys: 0.475 ± 0.232
1.899AsnAsp: 1.899 ± 0.507
2.374AsnGlu: 2.374 ± 0.226
0.475AsnPhe: 0.475 ± 0.242
1.424AsnGly: 1.424 ± 0.293
0.237AsnHis: 0.237 ± 0.151
0.949AsnIle: 0.949 ± 0.43
1.187AsnLys: 1.187 ± 0.504
3.56AsnLeu: 3.56 ± 0.759
0.475AsnMet: 0.475 ± 0.232
0.475AsnAsn: 0.475 ± 0.476
2.848AsnPro: 2.848 ± 0.39
0.712AsnGln: 0.712 ± 0.31
0.475AsnArg: 0.475 ± 0.357
3.798AsnSer: 3.798 ± 0.581
0.949AsnThr: 0.949 ± 0.43
0.712AsnVal: 0.712 ± 0.714
0.949AsnTrp: 0.949 ± 0.191
1.187AsnTyr: 1.187 ± 0.46
0.0AsnXaa: 0.0 ± 0.0
Pro
2.136ProAla: 2.136 ± 0.379
0.475ProCys: 0.475 ± 0.357
1.899ProAsp: 1.899 ± 0.561
3.086ProGlu: 3.086 ± 0.791
2.374ProPhe: 2.374 ± 0.709
4.51ProGly: 4.51 ± 0.461
1.899ProHis: 1.899 ± 0.757
4.747ProIle: 4.747 ± 0.804
3.086ProLys: 3.086 ± 0.948
5.934ProLeu: 5.934 ± 0.786
0.949ProMet: 0.949 ± 0.362
1.187ProAsn: 1.187 ± 0.285
6.171ProPro: 6.171 ± 1.132
2.611ProGln: 2.611 ± 0.507
3.798ProArg: 3.798 ± 0.17
4.51ProSer: 4.51 ± 0.486
3.798ProThr: 3.798 ± 1.25
5.697ProVal: 5.697 ± 0.726
1.662ProTrp: 1.662 ± 0.153
4.035ProTyr: 4.035 ± 0.25
0.0ProXaa: 0.0 ± 0.0
Gln
3.56GlnAla: 3.56 ± 0.658
0.237GlnCys: 0.237 ± 0.238
4.272GlnAsp: 4.272 ± 0.618
3.323GlnGlu: 3.323 ± 0.599
1.899GlnPhe: 1.899 ± 0.484
5.222GlnGly: 5.222 ± 0.763
1.187GlnHis: 1.187 ± 0.35
1.899GlnIle: 1.899 ± 0.285
4.747GlnLys: 4.747 ± 1.048
3.798GlnLeu: 3.798 ± 0.888
0.237GlnMet: 0.237 ± 0.151
0.712GlnAsn: 0.712 ± 0.454
2.136GlnPro: 2.136 ± 0.775
1.187GlnGln: 1.187 ± 0.552
2.848GlnArg: 2.848 ± 0.143
3.086GlnSer: 3.086 ± 0.574
3.323GlnThr: 3.323 ± 0.78
1.899GlnVal: 1.899 ± 0.771
1.187GlnTrp: 1.187 ± 0.315
0.237GlnTyr: 0.237 ± 0.238
0.0GlnXaa: 0.0 ± 0.0
Arg
4.51ArgAla: 4.51 ± 0.954
0.949ArgCys: 0.949 ± 0.217
1.662ArgAsp: 1.662 ± 0.381
3.798ArgGlu: 3.798 ± 0.461
1.662ArgPhe: 1.662 ± 0.554
5.459ArgGly: 5.459 ± 0.879
0.949ArgHis: 0.949 ± 0.351
0.949ArgIle: 0.949 ± 0.191
7.121ArgLys: 7.121 ± 1.448
4.51ArgLeu: 4.51 ± 0.69
1.899ArgMet: 1.899 ± 0.771
0.712ArgAsn: 0.712 ± 0.557
3.086ArgPro: 3.086 ± 0.467
0.475ArgGln: 0.475 ± 0.242
2.374ArgArg: 2.374 ± 0.761
2.611ArgSer: 2.611 ± 0.818
3.086ArgThr: 3.086 ± 0.469
1.662ArgVal: 1.662 ± 0.275
1.662ArgTrp: 1.662 ± 0.563
1.187ArgTyr: 1.187 ± 0.289
0.0ArgXaa: 0.0 ± 0.0
Ser
6.171SerAla: 6.171 ± 0.515
1.187SerCys: 1.187 ± 0.379
5.697SerAsp: 5.697 ± 0.885
4.51SerGlu: 4.51 ± 0.728
1.899SerPhe: 1.899 ± 1.275
3.56SerGly: 3.56 ± 0.746
1.187SerHis: 1.187 ± 0.501
1.424SerIle: 1.424 ± 0.281
6.171SerLys: 6.171 ± 1.336
4.747SerLeu: 4.747 ± 0.364
0.237SerMet: 0.237 ± 0.151
1.187SerAsn: 1.187 ± 0.501
8.308SerPro: 8.308 ± 0.58
4.272SerGln: 4.272 ± 0.919
2.136SerArg: 2.136 ± 0.165
5.934SerSer: 5.934 ± 0.975
4.51SerThr: 4.51 ± 0.582
3.086SerVal: 3.086 ± 0.292
0.475SerTrp: 0.475 ± 0.303
2.374SerTyr: 2.374 ± 0.622
0.0SerXaa: 0.0 ± 0.0
Thr
4.985ThrAla: 4.985 ± 1.261
0.475ThrCys: 0.475 ± 0.199
3.798ThrAsp: 3.798 ± 0.17
4.985ThrGlu: 4.985 ± 0.714
4.272ThrPhe: 4.272 ± 0.528
6.171ThrGly: 6.171 ± 0.658
0.712ThrHis: 0.712 ± 0.454
1.424ThrIle: 1.424 ± 0.382
1.424ThrLys: 1.424 ± 0.556
6.409ThrLeu: 6.409 ± 0.466
1.662ThrMet: 1.662 ± 0.319
2.611ThrAsn: 2.611 ± 0.706
5.459ThrPro: 5.459 ± 1.105
1.187ThrGln: 1.187 ± 0.515
2.136ThrArg: 2.136 ± 0.493
3.323ThrSer: 3.323 ± 0.657
2.136ThrThr: 2.136 ± 0.208
1.899ThrVal: 1.899 ± 0.362
0.237ThrTrp: 0.237 ± 0.238
2.611ThrTyr: 2.611 ± 0.353
0.0ThrXaa: 0.0 ± 0.0
Val
3.086ValAla: 3.086 ± 0.701
0.712ValCys: 0.712 ± 0.252
3.56ValAsp: 3.56 ± 0.75
1.187ValGlu: 1.187 ± 0.504
1.662ValPhe: 1.662 ± 0.275
1.899ValGly: 1.899 ± 0.434
1.899ValHis: 1.899 ± 0.448
3.086ValIle: 3.086 ± 0.864
7.121ValLys: 7.121 ± 1.065
7.833ValLeu: 7.833 ± 0.721
0.712ValMet: 0.712 ± 0.253
1.424ValAsn: 1.424 ± 0.543
2.374ValPro: 2.374 ± 0.819
3.56ValGln: 3.56 ± 0.627
2.374ValArg: 2.374 ± 0.539
4.985ValSer: 4.985 ± 0.714
4.272ValThr: 4.272 ± 0.613
4.51ValVal: 4.51 ± 0.996
0.949ValTrp: 0.949 ± 0.242
0.712ValTyr: 0.712 ± 0.252
0.0ValXaa: 0.0 ± 0.0
Trp
0.237TrpAla: 0.237 ± 0.238
0.237TrpCys: 0.237 ± 0.24
0.949TrpAsp: 0.949 ± 0.217
2.611TrpGlu: 2.611 ± 0.663
1.187TrpPhe: 1.187 ± 0.379
1.424TrpGly: 1.424 ± 0.293
0.712TrpHis: 0.712 ± 0.252
0.475TrpIle: 0.475 ± 0.303
2.611TrpLys: 2.611 ± 0.74
2.374TrpLeu: 2.374 ± 0.311
0.0TrpMet: 0.0 ± 0.0
1.187TrpAsn: 1.187 ± 0.127
1.424TrpPro: 1.424 ± 0.507
1.424TrpGln: 1.424 ± 0.293
1.662TrpArg: 1.662 ± 0.471
0.237TrpSer: 0.237 ± 0.151
0.237TrpThr: 0.237 ± 0.238
1.187TrpVal: 1.187 ± 0.252
0.237TrpTrp: 0.237 ± 0.24
0.237TrpTyr: 0.237 ± 0.238
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.323TyrAla: 3.323 ± 0.439
0.0TyrCys: 0.0 ± 0.0
2.136TyrAsp: 2.136 ± 0.45
2.374TyrGlu: 2.374 ± 0.641
0.237TyrPhe: 0.237 ± 0.238
1.662TyrGly: 1.662 ± 0.494
1.424TyrHis: 1.424 ± 0.505
2.374TyrIle: 2.374 ± 0.686
0.949TyrLys: 0.949 ± 0.476
1.899TyrLeu: 1.899 ± 0.38
0.475TyrMet: 0.475 ± 0.303
0.949TyrAsn: 0.949 ± 0.509
1.187TyrPro: 1.187 ± 0.379
1.424TyrGln: 1.424 ± 0.431
1.424TyrArg: 1.424 ± 0.693
1.187TyrSer: 1.187 ± 0.245
2.611TyrThr: 2.611 ± 0.507
1.662TyrVal: 1.662 ± 0.303
0.237TyrTrp: 0.237 ± 0.238
0.475TyrTyr: 0.475 ± 0.242
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (4214 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski