Amino acid dipepetide frequency for DeBrazza s monkey arterivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.491AlaAla: 11.491 ± 2.023
3.434AlaCys: 3.434 ± 0.648
3.962AlaAsp: 3.962 ± 0.7
2.245AlaGlu: 2.245 ± 0.316
3.566AlaPhe: 3.566 ± 0.897
4.359AlaGly: 4.359 ± 0.463
1.585AlaHis: 1.585 ± 0.662
5.151AlaIle: 5.151 ± 0.665
2.51AlaLys: 2.51 ± 0.563
11.359AlaLeu: 11.359 ± 0.888
0.66AlaMet: 0.66 ± 0.185
4.095AlaAsn: 4.095 ± 0.566
6.736AlaPro: 6.736 ± 1.215
2.245AlaGln: 2.245 ± 0.633
4.359AlaArg: 4.359 ± 0.575
8.585AlaSer: 8.585 ± 1.048
6.736AlaThr: 6.736 ± 1.366
7.925AlaVal: 7.925 ± 1.191
0.66AlaTrp: 0.66 ± 0.425
3.434AlaTyr: 3.434 ± 0.447
0.0AlaXaa: 0.0 ± 0.0
Cys
2.245CysAla: 2.245 ± 0.415
1.453CysCys: 1.453 ± 0.281
1.321CysAsp: 1.321 ± 0.211
0.925CysGlu: 0.925 ± 0.204
1.453CysPhe: 1.453 ± 0.531
3.17CysGly: 3.17 ± 0.518
0.792CysHis: 0.792 ± 0.276
1.849CysIle: 1.849 ± 0.425
1.453CysLys: 1.453 ± 0.357
3.302CysLeu: 3.302 ± 1.08
1.189CysMet: 1.189 ± 0.294
1.057CysAsn: 1.057 ± 0.395
1.585CysPro: 1.585 ± 0.32
0.66CysGln: 0.66 ± 0.205
1.057CysArg: 1.057 ± 0.475
2.113CysSer: 2.113 ± 0.551
1.981CysThr: 1.981 ± 0.327
2.642CysVal: 2.642 ± 0.546
0.925CysTrp: 0.925 ± 0.272
0.66CysTyr: 0.66 ± 0.246
0.0CysXaa: 0.0 ± 0.0
Asp
4.359AspAla: 4.359 ± 0.759
0.792AspCys: 0.792 ± 0.34
0.925AspAsp: 0.925 ± 0.26
1.585AspGlu: 1.585 ± 0.492
2.113AspPhe: 2.113 ± 0.381
2.774AspGly: 2.774 ± 0.552
1.189AspHis: 1.189 ± 0.207
1.717AspIle: 1.717 ± 0.268
2.113AspLys: 2.113 ± 0.411
4.755AspLeu: 4.755 ± 0.818
0.792AspMet: 0.792 ± 0.314
0.792AspAsn: 0.792 ± 0.269
3.83AspPro: 3.83 ± 1.084
0.792AspGln: 0.792 ± 0.488
2.113AspArg: 2.113 ± 0.693
5.151AspSer: 5.151 ± 0.934
2.113AspThr: 2.113 ± 0.491
2.113AspVal: 2.113 ± 0.402
0.792AspTrp: 0.792 ± 0.258
1.453AspTyr: 1.453 ± 0.34
0.0AspXaa: 0.0 ± 0.0
Glu
2.51GluAla: 2.51 ± 0.584
1.453GluCys: 1.453 ± 0.458
0.792GluAsp: 0.792 ± 0.176
2.245GluGlu: 2.245 ± 0.568
1.321GluPhe: 1.321 ± 0.41
1.585GluGly: 1.585 ± 0.257
1.321GluHis: 1.321 ± 0.608
1.321GluIle: 1.321 ± 0.292
1.189GluLys: 1.189 ± 0.387
2.906GluLeu: 2.906 ± 0.439
0.66GluMet: 0.66 ± 0.205
2.113GluAsn: 2.113 ± 0.416
0.528GluPro: 0.528 ± 0.227
0.925GluGln: 0.925 ± 0.362
2.377GluArg: 2.377 ± 0.383
2.51GluSer: 2.51 ± 0.473
3.17GluThr: 3.17 ± 0.845
1.453GluVal: 1.453 ± 0.458
0.132GluTrp: 0.132 ± 0.199
1.585GluTyr: 1.585 ± 0.296
0.0GluXaa: 0.0 ± 0.0
Phe
3.17PheAla: 3.17 ± 0.489
2.113PheCys: 2.113 ± 0.327
2.51PheAsp: 2.51 ± 0.517
0.925PheGlu: 0.925 ± 0.194
2.377PhePhe: 2.377 ± 0.507
3.83PheGly: 3.83 ± 0.89
1.189PheHis: 1.189 ± 0.275
2.113PheIle: 2.113 ± 0.36
1.981PheLys: 1.981 ± 0.756
5.812PheLeu: 5.812 ± 0.934
0.264PheMet: 0.264 ± 0.088
1.981PheAsn: 1.981 ± 0.764
1.717PhePro: 1.717 ± 0.363
2.113PheGln: 2.113 ± 0.429
1.585PheArg: 1.585 ± 0.48
3.17PheSer: 3.17 ± 0.544
1.981PheThr: 1.981 ± 0.335
3.698PheVal: 3.698 ± 0.573
0.264PheTrp: 0.264 ± 0.328
1.585PheTyr: 1.585 ± 0.303
0.0PheXaa: 0.0 ± 0.0
Gly
3.698GlyAla: 3.698 ± 0.663
3.302GlyCys: 3.302 ± 0.721
3.434GlyAsp: 3.434 ± 0.915
1.717GlyGlu: 1.717 ± 0.426
3.566GlyPhe: 3.566 ± 0.894
4.359GlyGly: 4.359 ± 0.69
1.321GlyHis: 1.321 ± 0.79
3.698GlyIle: 3.698 ± 0.481
4.491GlyLys: 4.491 ± 1.125
5.812GlyLeu: 5.812 ± 0.588
1.057GlyMet: 1.057 ± 0.242
1.981GlyAsn: 1.981 ± 0.437
2.642GlyPro: 2.642 ± 0.501
1.585GlyGln: 1.585 ± 0.424
3.83GlyArg: 3.83 ± 0.793
6.076GlySer: 6.076 ± 1.031
5.812GlyThr: 5.812 ± 1.17
6.208GlyVal: 6.208 ± 1.299
0.264GlyTrp: 0.264 ± 0.387
2.774GlyTyr: 2.774 ± 0.526
0.0GlyXaa: 0.0 ± 0.0
His
2.774HisAla: 2.774 ± 0.536
0.66HisCys: 0.66 ± 0.412
1.189HisAsp: 1.189 ± 0.415
1.189HisGlu: 1.189 ± 0.324
0.528HisPhe: 0.528 ± 0.293
2.113HisGly: 2.113 ± 0.484
1.057HisHis: 1.057 ± 0.266
1.717HisIle: 1.717 ± 0.576
1.453HisLys: 1.453 ± 0.326
2.774HisLeu: 2.774 ± 1.716
0.528HisMet: 0.528 ± 0.236
0.66HisAsn: 0.66 ± 0.294
2.51HisPro: 2.51 ± 0.932
0.66HisGln: 0.66 ± 0.554
1.057HisArg: 1.057 ± 0.188
1.585HisSer: 1.585 ± 1.108
1.849HisThr: 1.849 ± 0.756
1.585HisVal: 1.585 ± 0.34
0.66HisTrp: 0.66 ± 0.246
1.189HisTyr: 1.189 ± 0.351
0.0HisXaa: 0.0 ± 0.0
Ile
5.415IleAla: 5.415 ± 0.597
1.453IleCys: 1.453 ± 0.606
1.717IleAsp: 1.717 ± 0.374
0.264IleGlu: 0.264 ± 0.185
1.321IlePhe: 1.321 ± 0.527
2.642IleGly: 2.642 ± 0.589
1.189IleHis: 1.189 ± 0.418
1.717IleIle: 1.717 ± 0.941
1.321IleLys: 1.321 ± 0.419
4.623IleLeu: 4.623 ± 1.438
0.925IleMet: 0.925 ± 0.23
1.321IleAsn: 1.321 ± 0.436
2.906IlePro: 2.906 ± 0.314
1.849IleGln: 1.849 ± 0.481
2.906IleArg: 2.906 ± 0.673
5.019IleSer: 5.019 ± 0.712
4.095IleThr: 4.095 ± 1.107
3.434IleVal: 3.434 ± 0.693
0.264IleTrp: 0.264 ± 0.088
1.057IleTyr: 1.057 ± 0.368
0.0IleXaa: 0.0 ± 0.0
Lys
3.302LysAla: 3.302 ± 0.852
0.792LysCys: 0.792 ± 0.532
2.113LysAsp: 2.113 ± 0.438
1.057LysGlu: 1.057 ± 0.254
1.453LysPhe: 1.453 ± 0.352
1.585LysGly: 1.585 ± 0.383
0.925LysHis: 0.925 ± 0.447
0.792LysIle: 0.792 ± 0.157
1.981LysLys: 1.981 ± 0.358
2.377LysLeu: 2.377 ± 0.773
1.189LysMet: 1.189 ± 0.442
1.453LysAsn: 1.453 ± 0.432
2.51LysPro: 2.51 ± 0.434
1.585LysGln: 1.585 ± 0.343
2.113LysArg: 2.113 ± 0.403
1.849LysSer: 1.849 ± 0.632
2.642LysThr: 2.642 ± 0.568
5.68LysVal: 5.68 ± 0.982
0.66LysTrp: 0.66 ± 0.205
1.057LysTyr: 1.057 ± 0.287
0.0LysXaa: 0.0 ± 0.0
Leu
12.02LeuAla: 12.02 ± 1.025
2.774LeuCys: 2.774 ± 1.081
5.151LeuAsp: 5.151 ± 0.85
3.434LeuGlu: 3.434 ± 0.65
5.68LeuPhe: 5.68 ± 0.614
7.925LeuGly: 7.925 ± 1.023
2.113LeuHis: 2.113 ± 0.342
5.151LeuIle: 5.151 ± 0.678
2.51LeuLys: 2.51 ± 0.602
14.925LeuLeu: 14.925 ± 2.291
0.925LeuMet: 0.925 ± 0.409
3.83LeuAsn: 3.83 ± 0.339
7.132LeuPro: 7.132 ± 1.428
3.83LeuGln: 3.83 ± 0.558
5.547LeuArg: 5.547 ± 1.113
10.831LeuSer: 10.831 ± 0.865
5.415LeuThr: 5.415 ± 1.041
8.85LeuVal: 8.85 ± 0.78
0.792LeuTrp: 0.792 ± 0.38
1.849LeuTyr: 1.849 ± 0.287
0.0LeuXaa: 0.0 ± 0.0
Met
1.849MetAla: 1.849 ± 0.393
0.528MetCys: 0.528 ± 0.238
0.0MetAsp: 0.0 ± 0.0
0.396MetGlu: 0.396 ± 0.248
0.264MetPhe: 0.264 ± 0.149
1.585MetGly: 1.585 ± 0.418
0.264MetHis: 0.264 ± 0.333
1.453MetIle: 1.453 ± 0.382
0.66MetLys: 0.66 ± 0.26
1.981MetLeu: 1.981 ± 0.519
0.132MetMet: 0.132 ± 0.083
0.0MetAsn: 0.0 ± 0.0
0.66MetPro: 0.66 ± 0.248
0.0MetGln: 0.0 ± 0.0
0.528MetArg: 0.528 ± 0.165
0.792MetSer: 0.792 ± 0.248
0.792MetThr: 0.792 ± 0.258
1.189MetVal: 1.189 ± 0.703
0.528MetTrp: 0.528 ± 0.177
0.132MetTyr: 0.132 ± 0.083
0.0MetXaa: 0.0 ± 0.0
Asn
3.566AsnAla: 3.566 ± 0.564
0.66AsnCys: 0.66 ± 0.19
0.925AsnAsp: 0.925 ± 0.323
1.585AsnGlu: 1.585 ± 0.589
1.189AsnPhe: 1.189 ± 0.324
2.642AsnGly: 2.642 ± 0.419
0.925AsnHis: 0.925 ± 0.747
0.925AsnIle: 0.925 ± 0.604
2.642AsnLys: 2.642 ± 0.419
3.962AsnLeu: 3.962 ± 0.608
0.132AsnMet: 0.132 ± 0.228
2.113AsnAsn: 2.113 ± 0.384
1.321AsnPro: 1.321 ± 0.417
1.321AsnGln: 1.321 ± 0.565
1.585AsnArg: 1.585 ± 0.492
1.189AsnSer: 1.189 ± 0.735
2.774AsnThr: 2.774 ± 0.398
2.377AsnVal: 2.377 ± 1.018
0.132AsnTrp: 0.132 ± 0.083
0.925AsnTyr: 0.925 ± 0.487
0.0AsnXaa: 0.0 ± 0.0
Pro
7.0ProAla: 7.0 ± 1.359
1.189ProCys: 1.189 ± 0.296
3.962ProAsp: 3.962 ± 0.776
2.245ProGlu: 2.245 ± 0.496
2.245ProPhe: 2.245 ± 0.393
4.095ProGly: 4.095 ± 0.781
1.585ProHis: 1.585 ± 0.358
1.585ProIle: 1.585 ± 0.334
2.377ProLys: 2.377 ± 0.529
7.529ProLeu: 7.529 ± 1.177
0.528ProMet: 0.528 ± 0.19
0.66ProAsn: 0.66 ± 0.341
4.227ProPro: 4.227 ± 0.7
2.377ProGln: 2.377 ± 0.433
3.17ProArg: 3.17 ± 0.536
4.755ProSer: 4.755 ± 1.34
5.415ProThr: 5.415 ± 0.874
4.359ProVal: 4.359 ± 0.41
0.925ProTrp: 0.925 ± 0.64
1.717ProTyr: 1.717 ± 0.472
0.0ProXaa: 0.0 ± 0.0
Gln
3.038GlnAla: 3.038 ± 0.984
0.264GlnCys: 0.264 ± 0.222
0.792GlnAsp: 0.792 ± 0.34
1.453GlnGlu: 1.453 ± 0.302
1.057GlnPhe: 1.057 ± 0.334
1.981GlnGly: 1.981 ± 0.331
2.113GlnHis: 2.113 ± 0.429
0.792GlnIle: 0.792 ± 0.406
0.528GlnLys: 0.528 ± 0.162
5.151GlnLeu: 5.151 ± 0.678
0.264GlnMet: 0.264 ± 0.088
1.321GlnAsn: 1.321 ± 0.425
1.981GlnPro: 1.981 ± 0.435
1.453GlnGln: 1.453 ± 0.414
1.189GlnArg: 1.189 ± 0.265
2.377GlnSer: 2.377 ± 0.591
2.113GlnThr: 2.113 ± 0.443
1.321GlnVal: 1.321 ± 0.295
0.264GlnTrp: 0.264 ± 0.088
1.717GlnTyr: 1.717 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
4.623ArgAla: 4.623 ± 0.61
1.849ArgCys: 1.849 ± 0.581
1.981ArgAsp: 1.981 ± 0.587
1.981ArgGlu: 1.981 ± 0.312
2.113ArgPhe: 2.113 ± 0.398
4.359ArgGly: 4.359 ± 0.924
2.51ArgHis: 2.51 ± 0.44
1.849ArgIle: 1.849 ± 0.418
1.585ArgLys: 1.585 ± 0.318
6.34ArgLeu: 6.34 ± 0.534
0.66ArgMet: 0.66 ± 0.429
1.057ArgAsn: 1.057 ± 0.225
1.981ArgPro: 1.981 ± 0.638
1.585ArgGln: 1.585 ± 0.327
2.642ArgArg: 2.642 ± 1.395
4.095ArgSer: 4.095 ± 0.663
3.434ArgThr: 3.434 ± 0.509
3.962ArgVal: 3.962 ± 0.662
0.396ArgTrp: 0.396 ± 0.374
2.377ArgTyr: 2.377 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
7.132SerAla: 7.132 ± 1.064
2.245SerCys: 2.245 ± 0.825
3.302SerAsp: 3.302 ± 0.478
3.83SerGlu: 3.83 ± 0.765
3.962SerPhe: 3.962 ± 0.503
5.68SerGly: 5.68 ± 0.954
2.377SerHis: 2.377 ± 0.515
2.642SerIle: 2.642 ± 1.334
2.51SerLys: 2.51 ± 0.586
8.189SerLeu: 8.189 ± 1.429
0.528SerMet: 0.528 ± 0.488
2.51SerAsn: 2.51 ± 1.035
5.68SerPro: 5.68 ± 0.735
2.377SerGln: 2.377 ± 0.405
4.887SerArg: 4.887 ± 0.921
6.472SerSer: 6.472 ± 1.776
5.151SerThr: 5.151 ± 1.076
6.472SerVal: 6.472 ± 0.614
0.925SerTrp: 0.925 ± 0.25
2.51SerTyr: 2.51 ± 0.483
0.0SerXaa: 0.0 ± 0.0
Thr
6.208ThrAla: 6.208 ± 1.053
2.377ThrCys: 2.377 ± 0.383
1.717ThrAsp: 1.717 ± 0.386
1.189ThrGlu: 1.189 ± 0.254
3.962ThrPhe: 3.962 ± 0.699
6.736ThrGly: 6.736 ± 0.762
2.642ThrHis: 2.642 ± 0.865
4.095ThrIle: 4.095 ± 0.947
2.51ThrLys: 2.51 ± 0.53
4.359ThrLeu: 4.359 ± 0.58
1.057ThrMet: 1.057 ± 0.316
2.774ThrAsn: 2.774 ± 0.343
7.0ThrPro: 7.0 ± 0.797
2.642ThrGln: 2.642 ± 0.625
3.038ThrArg: 3.038 ± 0.717
3.83ThrSer: 3.83 ± 1.282
3.962ThrThr: 3.962 ± 0.954
5.812ThrVal: 5.812 ± 0.859
0.792ThrTrp: 0.792 ± 0.274
2.245ThrTyr: 2.245 ± 0.63
0.0ThrXaa: 0.0 ± 0.0
Val
8.453ValAla: 8.453 ± 1.053
3.302ValCys: 3.302 ± 0.6
4.623ValAsp: 4.623 ± 1.05
2.774ValGlu: 2.774 ± 0.31
2.906ValPhe: 2.906 ± 0.64
3.566ValGly: 3.566 ± 0.51
1.717ValHis: 1.717 ± 0.281
3.698ValIle: 3.698 ± 0.416
2.642ValLys: 2.642 ± 0.467
8.453ValLeu: 8.453 ± 0.862
1.321ValMet: 1.321 ± 0.318
2.113ValAsn: 2.113 ± 0.366
4.491ValPro: 4.491 ± 0.908
1.453ValGln: 1.453 ± 0.302
4.887ValArg: 4.887 ± 0.67
5.68ValSer: 5.68 ± 0.69
6.868ValThr: 6.868 ± 0.991
8.189ValVal: 8.189 ± 0.867
1.453ValTrp: 1.453 ± 0.313
2.774ValTyr: 2.774 ± 0.491
0.0ValXaa: 0.0 ± 0.0
Trp
0.528TrpAla: 0.528 ± 0.348
0.0TrpCys: 0.0 ± 0.0
0.66TrpAsp: 0.66 ± 0.221
0.132TrpGlu: 0.132 ± 0.202
0.792TrpPhe: 0.792 ± 0.22
0.925TrpGly: 0.925 ± 0.22
0.0TrpHis: 0.0 ± 0.0
0.66TrpIle: 0.66 ± 0.16
0.264TrpLys: 0.264 ± 0.088
1.849TrpLeu: 1.849 ± 0.347
0.264TrpMet: 0.264 ± 0.174
0.132TrpAsn: 0.132 ± 0.083
0.528TrpPro: 0.528 ± 0.198
0.66TrpGln: 0.66 ± 0.257
0.66TrpArg: 0.66 ± 0.205
0.925TrpSer: 0.925 ± 0.593
0.66TrpThr: 0.66 ± 0.454
1.057TrpVal: 1.057 ± 0.802
0.0TrpTrp: 0.0 ± 0.0
0.792TrpTyr: 0.792 ± 0.265
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.717TyrAla: 1.717 ± 0.711
1.321TyrCys: 1.321 ± 0.279
1.189TyrAsp: 1.189 ± 0.387
0.925TyrGlu: 0.925 ± 0.288
2.51TyrPhe: 2.51 ± 0.459
1.717TyrGly: 1.717 ± 0.781
0.925TyrHis: 0.925 ± 0.682
2.51TyrIle: 2.51 ± 0.41
0.396TyrLys: 0.396 ± 0.273
4.227TyrLeu: 4.227 ± 0.744
0.528TyrMet: 0.528 ± 0.207
0.925TyrAsn: 0.925 ± 0.363
1.981TyrPro: 1.981 ± 0.615
1.057TyrGln: 1.057 ± 0.241
1.849TyrArg: 1.849 ± 0.423
2.377TyrSer: 2.377 ± 0.306
1.981TyrThr: 1.981 ± 0.564
3.038TyrVal: 3.038 ± 0.368
0.66TyrTrp: 0.66 ± 0.184
1.585TyrTyr: 1.585 ± 1.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (7572 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski