Amino acid dipepetide frequency for Berrimah virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.843AlaAla: 0.843 ± 0.467
0.843AlaCys: 0.843 ± 0.402
1.897AlaAsp: 1.897 ± 0.439
1.476AlaGlu: 1.476 ± 0.572
1.265AlaPhe: 1.265 ± 0.484
1.686AlaGly: 1.686 ± 0.655
1.054AlaHis: 1.054 ± 0.274
2.53AlaIle: 2.53 ± 0.848
1.476AlaLys: 1.476 ± 0.542
2.74AlaLeu: 2.74 ± 0.558
1.686AlaMet: 1.686 ± 0.546
1.897AlaAsn: 1.897 ± 0.573
1.054AlaPro: 1.054 ± 0.619
0.843AlaGln: 0.843 ± 0.292
1.686AlaArg: 1.686 ± 0.657
1.265AlaSer: 1.265 ± 0.384
1.054AlaThr: 1.054 ± 0.466
1.897AlaVal: 1.897 ± 0.771
0.422AlaTrp: 0.422 ± 0.331
1.686AlaTyr: 1.686 ± 0.852
0.0AlaXaa: 0.0 ± 0.0
Cys
0.422CysAla: 0.422 ± 0.193
0.0CysCys: 0.0 ± 0.0
0.843CysAsp: 0.843 ± 0.304
0.422CysGlu: 0.422 ± 0.309
0.422CysPhe: 0.422 ± 0.18
1.476CysGly: 1.476 ± 0.429
1.054CysHis: 1.054 ± 0.488
0.632CysIle: 0.632 ± 0.297
1.265CysLys: 1.265 ± 0.621
2.74CysLeu: 2.74 ± 0.664
0.422CysMet: 0.422 ± 0.354
1.476CysAsn: 1.476 ± 0.369
0.632CysPro: 0.632 ± 0.417
1.265CysGln: 1.265 ± 0.366
0.843CysArg: 0.843 ± 0.458
1.054CysSer: 1.054 ± 0.422
1.054CysThr: 1.054 ± 0.433
0.843CysVal: 0.843 ± 0.215
0.843CysTrp: 0.843 ± 0.516
0.843CysTyr: 0.843 ± 0.516
0.0CysXaa: 0.0 ± 0.0
Asp
2.108AspAla: 2.108 ± 0.508
0.422AspCys: 0.422 ± 0.225
3.583AspAsp: 3.583 ± 1.094
5.059AspGlu: 5.059 ± 1.707
1.897AspPhe: 1.897 ± 0.551
3.373AspGly: 3.373 ± 0.799
1.476AspHis: 1.476 ± 0.634
4.848AspIle: 4.848 ± 1.652
4.427AspLys: 4.427 ± 0.818
5.27AspLeu: 5.27 ± 0.989
2.53AspMet: 2.53 ± 0.394
2.951AspAsn: 2.951 ± 0.407
1.476AspPro: 1.476 ± 0.395
2.319AspGln: 2.319 ± 0.698
2.108AspArg: 2.108 ± 0.36
3.373AspSer: 3.373 ± 0.342
1.686AspThr: 1.686 ± 0.716
3.373AspVal: 3.373 ± 0.763
1.265AspTrp: 1.265 ± 0.628
2.53AspTyr: 2.53 ± 0.871
0.0AspXaa: 0.0 ± 0.0
Glu
1.265GluAla: 1.265 ± 0.432
1.686GluCys: 1.686 ± 0.437
6.113GluAsp: 6.113 ± 1.812
7.378GluGlu: 7.378 ± 1.241
5.059GluPhe: 5.059 ± 0.522
4.848GluGly: 4.848 ± 0.761
1.686GluHis: 1.686 ± 1.016
7.589GluIle: 7.589 ± 2.128
4.848GluLys: 4.848 ± 0.694
8.432GluLeu: 8.432 ± 0.934
1.897GluMet: 1.897 ± 0.598
4.637GluAsn: 4.637 ± 1.13
1.686GluPro: 1.686 ± 0.382
0.843GluGln: 0.843 ± 0.404
3.794GluArg: 3.794 ± 0.617
7.589GluSer: 7.589 ± 0.921
2.108GluThr: 2.108 ± 0.532
4.848GluVal: 4.848 ± 0.515
0.632GluTrp: 0.632 ± 0.254
4.216GluTyr: 4.216 ± 0.907
0.0GluXaa: 0.0 ± 0.0
Phe
0.632PheAla: 0.632 ± 0.286
1.265PheCys: 1.265 ± 0.52
2.951PheAsp: 2.951 ± 0.768
2.74PheGlu: 2.74 ± 1.0
2.74PhePhe: 2.74 ± 0.54
3.162PheGly: 3.162 ± 0.938
0.0PheHis: 0.0 ± 0.0
2.74PheIle: 2.74 ± 0.431
2.951PheLys: 2.951 ± 0.273
4.216PheLeu: 4.216 ± 1.04
0.843PheMet: 0.843 ± 0.283
1.476PheAsn: 1.476 ± 0.461
1.054PhePro: 1.054 ± 0.645
1.265PheGln: 1.265 ± 0.48
1.897PheArg: 1.897 ± 0.719
2.53PheSer: 2.53 ± 0.882
2.53PheThr: 2.53 ± 0.962
2.74PheVal: 2.74 ± 0.71
0.843PheTrp: 0.843 ± 0.562
1.265PheTyr: 1.265 ± 0.299
0.0PheXaa: 0.0 ± 0.0
Gly
2.108GlyAla: 2.108 ± 0.639
1.265GlyCys: 1.265 ± 0.773
2.108GlyAsp: 2.108 ± 0.527
5.902GlyGlu: 5.902 ± 0.986
2.53GlyPhe: 2.53 ± 0.653
2.951GlyGly: 2.951 ± 0.508
1.476GlyHis: 1.476 ± 0.468
5.902GlyIle: 5.902 ± 0.846
4.427GlyLys: 4.427 ± 1.07
6.324GlyLeu: 6.324 ± 2.08
1.265GlyMet: 1.265 ± 0.494
3.162GlyAsn: 3.162 ± 0.651
0.843GlyPro: 0.843 ± 0.259
0.843GlyGln: 0.843 ± 0.352
3.373GlyArg: 3.373 ± 1.124
3.794GlySer: 3.794 ± 0.539
3.583GlyThr: 3.583 ± 1.025
1.897GlyVal: 1.897 ± 0.365
0.632GlyTrp: 0.632 ± 0.219
1.265GlyTyr: 1.265 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
0.632HisAla: 0.632 ± 0.471
0.211HisCys: 0.211 ± 0.225
0.211HisAsp: 0.211 ± 0.129
2.74HisGlu: 2.74 ± 0.527
1.054HisPhe: 1.054 ± 0.554
1.265HisGly: 1.265 ± 0.405
0.632HisHis: 0.632 ± 0.386
1.897HisIle: 1.897 ± 0.26
2.53HisLys: 2.53 ± 0.445
1.897HisLeu: 1.897 ± 0.727
0.422HisMet: 0.422 ± 0.528
1.265HisAsn: 1.265 ± 0.355
1.265HisPro: 1.265 ± 0.401
0.632HisGln: 0.632 ± 0.547
1.476HisArg: 1.476 ± 0.387
0.632HisSer: 0.632 ± 0.26
0.422HisThr: 0.422 ± 0.449
1.265HisVal: 1.265 ± 0.716
0.632HisTrp: 0.632 ± 0.233
1.054HisTyr: 1.054 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
1.476IleAla: 1.476 ± 0.547
2.53IleCys: 2.53 ± 0.771
5.691IleAsp: 5.691 ± 0.713
8.01IleGlu: 8.01 ± 1.774
2.319IlePhe: 2.319 ± 0.748
3.794IleGly: 3.794 ± 0.995
1.476IleHis: 1.476 ± 0.472
8.642IleIle: 8.642 ± 1.817
11.594IleLys: 11.594 ± 1.181
7.378IleLeu: 7.378 ± 0.993
1.476IleMet: 1.476 ± 0.513
6.745IleAsn: 6.745 ± 0.824
2.74IlePro: 2.74 ± 0.625
2.108IleGln: 2.108 ± 0.87
5.481IleArg: 5.481 ± 0.905
5.902IleSer: 5.902 ± 1.143
1.897IleThr: 1.897 ± 0.849
3.373IleVal: 3.373 ± 0.563
1.054IleTrp: 1.054 ± 0.363
3.794IleTyr: 3.794 ± 0.827
0.0IleXaa: 0.0 ± 0.0
Lys
2.53LysAla: 2.53 ± 0.928
2.108LysCys: 2.108 ± 0.412
4.427LysAsp: 4.427 ± 1.07
8.221LysGlu: 8.221 ± 1.566
2.74LysPhe: 2.74 ± 0.383
5.691LysGly: 5.691 ± 0.84
1.265LysHis: 1.265 ± 0.486
8.01LysIle: 8.01 ± 1.015
7.167LysLys: 7.167 ± 0.974
9.064LysLeu: 9.064 ± 1.205
1.897LysMet: 1.897 ± 0.491
6.113LysAsn: 6.113 ± 1.017
3.373LysPro: 3.373 ± 0.525
1.265LysGln: 1.265 ± 0.422
4.637LysArg: 4.637 ± 0.988
8.432LysSer: 8.432 ± 1.383
2.74LysThr: 2.74 ± 0.606
4.005LysVal: 4.005 ± 0.7
1.054LysTrp: 1.054 ± 0.36
3.373LysTyr: 3.373 ± 1.527
0.0LysXaa: 0.0 ± 0.0
Leu
3.373LeuAla: 3.373 ± 0.573
1.476LeuCys: 1.476 ± 0.412
7.589LeuAsp: 7.589 ± 1.167
7.799LeuGlu: 7.799 ± 0.723
3.583LeuPhe: 3.583 ± 0.834
5.27LeuGly: 5.27 ± 1.179
1.686LeuHis: 1.686 ± 0.287
8.01LeuIle: 8.01 ± 1.292
8.432LeuLys: 8.432 ± 0.73
8.01LeuLeu: 8.01 ± 1.522
2.53LeuMet: 2.53 ± 0.467
6.324LeuAsn: 6.324 ± 0.704
2.53LeuPro: 2.53 ± 0.503
3.583LeuGln: 3.583 ± 0.496
4.848LeuArg: 4.848 ± 1.715
6.535LeuSer: 6.535 ± 0.997
5.902LeuThr: 5.902 ± 0.745
4.005LeuVal: 4.005 ± 0.803
1.265LeuTrp: 1.265 ± 0.382
2.319LeuTyr: 2.319 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
1.265MetAla: 1.265 ± 0.302
0.843MetCys: 0.843 ± 0.215
2.319MetAsp: 2.319 ± 0.874
1.265MetGlu: 1.265 ± 0.198
1.476MetPhe: 1.476 ± 0.38
1.686MetGly: 1.686 ± 0.433
0.211MetHis: 0.211 ± 0.225
2.108MetIle: 2.108 ± 0.587
0.843MetLys: 0.843 ± 0.428
2.53MetLeu: 2.53 ± 0.704
1.476MetMet: 1.476 ± 0.505
2.108MetAsn: 2.108 ± 0.449
1.265MetPro: 1.265 ± 0.458
0.0MetGln: 0.0 ± 0.0
1.265MetArg: 1.265 ± 0.395
1.897MetSer: 1.897 ± 0.329
1.265MetThr: 1.265 ± 0.448
1.897MetVal: 1.897 ± 0.429
0.422MetTrp: 0.422 ± 0.258
0.422MetTyr: 0.422 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
2.319AsnAla: 2.319 ± 0.943
1.265AsnCys: 1.265 ± 0.437
3.373AsnAsp: 3.373 ± 0.838
4.637AsnGlu: 4.637 ± 0.37
2.951AsnPhe: 2.951 ± 0.468
2.951AsnGly: 2.951 ± 0.969
2.108AsnHis: 2.108 ± 0.699
4.216AsnIle: 4.216 ± 0.586
8.01AsnLys: 8.01 ± 1.317
7.589AsnLeu: 7.589 ± 0.951
1.265AsnMet: 1.265 ± 0.259
4.637AsnAsn: 4.637 ± 0.814
1.476AsnPro: 1.476 ± 0.329
3.162AsnGln: 3.162 ± 0.704
1.265AsnArg: 1.265 ± 0.284
3.373AsnSer: 3.373 ± 1.163
2.74AsnThr: 2.74 ± 0.437
3.162AsnVal: 3.162 ± 0.899
2.108AsnTrp: 2.108 ± 0.475
3.373AsnTyr: 3.373 ± 1.358
0.0AsnXaa: 0.0 ± 0.0
Pro
1.054ProAla: 1.054 ± 0.267
0.422ProCys: 0.422 ± 0.18
1.897ProAsp: 1.897 ± 0.532
2.108ProGlu: 2.108 ± 0.689
1.054ProPhe: 1.054 ± 0.36
1.265ProGly: 1.265 ± 0.416
1.054ProHis: 1.054 ± 0.274
2.53ProIle: 2.53 ± 1.019
2.108ProLys: 2.108 ± 0.71
2.53ProLeu: 2.53 ± 0.823
0.632ProMet: 0.632 ± 0.375
1.476ProAsn: 1.476 ± 0.42
2.108ProPro: 2.108 ± 0.769
1.476ProGln: 1.476 ± 0.801
1.265ProArg: 1.265 ± 0.381
2.108ProSer: 2.108 ± 0.734
2.108ProThr: 2.108 ± 0.594
2.108ProVal: 2.108 ± 0.498
0.632ProTrp: 0.632 ± 0.27
1.897ProTyr: 1.897 ± 0.628
0.0ProXaa: 0.0 ± 0.0
Gln
0.422GlnAla: 0.422 ± 0.367
0.843GlnCys: 0.843 ± 0.352
0.632GlnAsp: 0.632 ± 0.301
2.53GlnGlu: 2.53 ± 0.78
1.054GlnPhe: 1.054 ± 0.575
1.897GlnGly: 1.897 ± 0.253
1.265GlnHis: 1.265 ± 0.757
2.74GlnIle: 2.74 ± 0.452
2.74GlnLys: 2.74 ± 0.667
2.319GlnLeu: 2.319 ± 0.504
0.632GlnMet: 0.632 ± 0.26
2.74GlnAsn: 2.74 ± 0.742
0.632GlnPro: 0.632 ± 0.219
0.422GlnGln: 0.422 ± 0.258
1.476GlnArg: 1.476 ± 0.576
0.843GlnSer: 0.843 ± 0.386
1.686GlnThr: 1.686 ± 0.282
1.265GlnVal: 1.265 ± 0.458
0.0GlnTrp: 0.0 ± 0.0
0.422GlnTyr: 0.422 ± 0.234
0.0GlnXaa: 0.0 ± 0.0
Arg
1.054ArgAla: 1.054 ± 0.339
1.054ArgCys: 1.054 ± 0.495
1.897ArgAsp: 1.897 ± 0.373
4.427ArgGlu: 4.427 ± 0.482
2.53ArgPhe: 2.53 ± 0.722
4.005ArgGly: 4.005 ± 0.832
0.632ArgHis: 0.632 ± 0.387
3.583ArgIle: 3.583 ± 0.929
5.481ArgLys: 5.481 ± 1.075
3.162ArgLeu: 3.162 ± 0.363
0.843ArgMet: 0.843 ± 0.627
2.951ArgAsn: 2.951 ± 0.514
2.74ArgPro: 2.74 ± 0.434
0.632ArgGln: 0.632 ± 0.253
2.319ArgArg: 2.319 ± 0.9
4.637ArgSer: 4.637 ± 0.647
2.108ArgThr: 2.108 ± 0.598
3.794ArgVal: 3.794 ± 0.599
0.843ArgTrp: 0.843 ± 0.274
1.476ArgTyr: 1.476 ± 0.49
0.0ArgXaa: 0.0 ± 0.0
Ser
2.74SerAla: 2.74 ± 0.504
1.054SerCys: 1.054 ± 0.411
3.583SerAsp: 3.583 ± 0.687
5.059SerGlu: 5.059 ± 0.811
3.162SerPhe: 3.162 ± 0.59
3.373SerGly: 3.373 ± 0.837
2.53SerHis: 2.53 ± 0.654
6.535SerIle: 6.535 ± 0.756
5.481SerLys: 5.481 ± 1.4
5.902SerLeu: 5.902 ± 1.505
2.74SerMet: 2.74 ± 0.751
5.059SerAsn: 5.059 ± 1.128
2.108SerPro: 2.108 ± 0.378
2.108SerGln: 2.108 ± 0.689
3.794SerArg: 3.794 ± 0.852
8.642SerSer: 8.642 ± 0.691
3.373SerThr: 3.373 ± 0.766
1.897SerVal: 1.897 ± 0.472
1.686SerTrp: 1.686 ± 0.49
4.216SerTyr: 4.216 ± 1.612
0.0SerXaa: 0.0 ± 0.0
Thr
0.843ThrAla: 0.843 ± 0.309
0.0ThrCys: 0.0 ± 0.0
1.476ThrAsp: 1.476 ± 0.286
3.373ThrGlu: 3.373 ± 0.394
0.843ThrPhe: 0.843 ± 0.253
1.897ThrGly: 1.897 ± 0.447
0.843ThrHis: 0.843 ± 0.398
5.481ThrIle: 5.481 ± 0.734
4.216ThrLys: 4.216 ± 0.513
2.74ThrLeu: 2.74 ± 0.843
1.476ThrMet: 1.476 ± 0.414
3.583ThrAsn: 3.583 ± 0.766
0.422ThrPro: 0.422 ± 0.309
0.422ThrGln: 0.422 ± 0.258
2.108ThrArg: 2.108 ± 0.468
3.794ThrSer: 3.794 ± 0.676
1.897ThrThr: 1.897 ± 0.518
3.373ThrVal: 3.373 ± 0.463
1.054ThrTrp: 1.054 ± 0.379
2.319ThrTyr: 2.319 ± 0.495
0.0ThrXaa: 0.0 ± 0.0
Val
1.897ValAla: 1.897 ± 0.773
0.422ValCys: 0.422 ± 0.258
2.319ValAsp: 2.319 ± 0.463
2.108ValGlu: 2.108 ± 0.43
1.265ValPhe: 1.265 ± 0.415
2.74ValGly: 2.74 ± 0.702
0.843ValHis: 0.843 ± 0.59
4.637ValIle: 4.637 ± 0.492
5.059ValLys: 5.059 ± 1.328
4.637ValLeu: 4.637 ± 0.616
1.265ValMet: 1.265 ± 0.348
3.583ValAsn: 3.583 ± 0.682
2.319ValPro: 2.319 ± 0.459
1.265ValGln: 1.265 ± 0.621
2.74ValArg: 2.74 ± 0.677
5.27ValSer: 5.27 ± 0.638
2.74ValThr: 2.74 ± 0.794
1.476ValVal: 1.476 ± 0.793
1.265ValTrp: 1.265 ± 0.483
1.686ValTyr: 1.686 ± 0.645
0.0ValXaa: 0.0 ± 0.0
Trp
0.422TrpAla: 0.422 ± 0.361
0.422TrpCys: 0.422 ± 0.258
0.422TrpAsp: 0.422 ± 0.214
2.74TrpGlu: 2.74 ± 0.996
1.054TrpPhe: 1.054 ± 0.274
1.265TrpGly: 1.265 ± 0.428
0.211TrpHis: 0.211 ± 0.129
2.319TrpIle: 2.319 ± 0.414
0.843TrpLys: 0.843 ± 0.276
1.054TrpLeu: 1.054 ± 0.457
0.0TrpMet: 0.0 ± 0.0
1.265TrpAsn: 1.265 ± 0.198
0.422TrpPro: 0.422 ± 0.258
0.422TrpGln: 0.422 ± 0.18
0.843TrpArg: 0.843 ± 0.38
1.265TrpSer: 1.265 ± 0.277
0.422TrpThr: 0.422 ± 0.364
1.476TrpVal: 1.476 ± 0.468
0.422TrpTrp: 0.422 ± 0.364
0.843TrpTyr: 0.843 ± 0.404
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.897TyrAla: 1.897 ± 0.742
0.422TyrCys: 0.422 ± 0.312
2.74TyrAsp: 2.74 ± 0.795
3.162TyrGlu: 3.162 ± 0.483
0.843TyrPhe: 0.843 ± 0.31
1.054TyrGly: 1.054 ± 0.418
0.843TyrHis: 0.843 ± 0.351
2.53TyrIle: 2.53 ± 0.758
4.216TyrLys: 4.216 ± 0.668
6.535TyrLeu: 6.535 ± 1.203
1.054TyrMet: 1.054 ± 0.552
2.53TyrAsn: 2.53 ± 0.55
1.686TyrPro: 1.686 ± 0.559
1.686TyrGln: 1.686 ± 0.715
2.951TyrArg: 2.951 ± 0.614
2.108TyrSer: 2.108 ± 0.537
0.843TyrThr: 0.843 ± 0.402
0.843TyrVal: 0.843 ± 0.414
1.054TyrTrp: 1.054 ± 0.431
1.686TyrTyr: 1.686 ± 0.315
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (4745 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski