Amino acid dipepetide frequency for Oyo virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.764AlaAla: 0.764 ± 0.292
0.255AlaCys: 0.255 ± 0.206
1.529AlaAsp: 1.529 ± 1.01
2.803AlaGlu: 2.803 ± 0.857
0.764AlaPhe: 0.764 ± 0.144
1.529AlaGly: 1.529 ± 0.39
0.764AlaHis: 0.764 ± 0.539
2.803AlaIle: 2.803 ± 0.96
3.057AlaLys: 3.057 ± 2.021
2.548AlaLeu: 2.548 ± 0.261
2.038AlaMet: 2.038 ± 0.436
2.803AlaAsn: 2.803 ± 1.091
0.51AlaPro: 0.51 ± 0.548
0.51AlaGln: 0.51 ± 0.29
2.803AlaArg: 2.803 ± 0.857
2.548AlaSer: 2.548 ± 0.408
2.038AlaThr: 2.038 ± 1.161
1.783AlaVal: 1.783 ± 0.348
0.51AlaTrp: 0.51 ± 0.29
2.038AlaTyr: 2.038 ± 0.567
0.0AlaXaa: 0.0 ± 0.0
Cys
1.019CysAla: 1.019 ± 0.569
0.0CysCys: 0.0 ± 0.0
0.51CysAsp: 0.51 ± 0.103
1.783CysGlu: 1.783 ± 1.111
1.274CysPhe: 1.274 ± 0.386
1.529CysGly: 1.529 ± 1.238
0.255CysHis: 0.255 ± 0.206
3.057CysIle: 3.057 ± 1.168
2.038CysLys: 2.038 ± 1.317
1.783CysLeu: 1.783 ± 0.786
1.019CysMet: 1.019 ± 0.464
2.038CysAsn: 2.038 ± 0.338
0.764CysPro: 0.764 ± 0.144
0.51CysGln: 0.51 ± 0.103
1.019CysArg: 1.019 ± 0.825
2.548CysSer: 2.548 ± 0.772
2.038CysThr: 2.038 ± 0.677
1.529CysVal: 1.529 ± 0.905
0.0CysTrp: 0.0 ± 0.0
2.038CysTyr: 2.038 ± 0.99
0.0CysXaa: 0.0 ± 0.0
Asp
1.783AspAla: 1.783 ± 0.695
1.019AspCys: 1.019 ± 0.495
3.312AspAsp: 3.312 ± 0.725
5.35AspGlu: 5.35 ± 1.348
3.567AspPhe: 3.567 ± 1.707
2.803AspGly: 2.803 ± 0.857
1.019AspHis: 1.019 ± 0.206
4.841AspIle: 4.841 ± 0.149
3.822AspLys: 3.822 ± 0.946
5.096AspLeu: 5.096 ± 0.87
1.783AspMet: 1.783 ± 0.822
3.057AspAsn: 3.057 ± 0.576
2.038AspPro: 2.038 ± 0.54
1.529AspGln: 1.529 ± 0.551
1.529AspArg: 1.529 ± 0.871
2.803AspSer: 2.803 ± 0.332
2.038AspThr: 2.038 ± 0.332
3.822AspVal: 3.822 ± 0.766
0.51AspTrp: 0.51 ± 0.548
3.057AspTyr: 3.057 ± 0.773
0.0AspXaa: 0.0 ± 0.0
Glu
2.293GluAla: 2.293 ± 0.998
3.057GluCys: 3.057 ± 1.811
3.567GluAsp: 3.567 ± 1.041
3.822GluGlu: 3.822 ± 0.946
4.076GluPhe: 4.076 ± 0.663
2.293GluGly: 2.293 ± 1.057
2.038GluHis: 2.038 ± 0.338
6.115GluIle: 6.115 ± 0.518
6.115GluLys: 6.115 ± 1.364
5.096GluLeu: 5.096 ± 1.704
2.803GluMet: 2.803 ± 0.646
3.567GluAsn: 3.567 ± 0.332
2.293GluPro: 2.293 ± 0.732
0.764GluGln: 0.764 ± 0.435
2.038GluArg: 2.038 ± 0.839
5.86GluSer: 5.86 ± 0.324
3.567GluThr: 3.567 ± 0.619
3.567GluVal: 3.567 ± 0.332
0.255GluTrp: 0.255 ± 0.145
3.057GluTyr: 3.057 ± 0.773
0.0GluXaa: 0.0 ± 0.0
Phe
2.293PheAla: 2.293 ± 0.429
1.274PheCys: 1.274 ± 0.386
3.567PheAsp: 3.567 ± 0.619
3.312PheGlu: 3.312 ± 0.567
2.548PhePhe: 2.548 ± 0.298
2.038PheGly: 2.038 ± 1.168
1.274PheHis: 1.274 ± 0.204
3.822PheIle: 3.822 ± 1.227
5.096PheLys: 5.096 ± 0.762
5.35PheLeu: 5.35 ± 1.037
0.764PheMet: 0.764 ± 1.194
2.293PheAsn: 2.293 ± 0.261
1.529PhePro: 1.529 ± 0.466
2.548PheGln: 2.548 ± 0.408
2.548PheArg: 2.548 ± 0.298
3.567PheSer: 3.567 ± 0.671
3.312PheThr: 3.312 ± 0.238
2.038PheVal: 2.038 ± 0.332
0.764PheTrp: 0.764 ± 0.144
3.312PheTyr: 3.312 ± 0.539
0.0PheXaa: 0.0 ± 0.0
Gly
0.764GlyAla: 0.764 ± 0.292
1.783GlyCys: 1.783 ± 0.484
3.312GlyAsp: 3.312 ± 0.814
2.293GlyGlu: 2.293 ± 0.432
1.783GlyPhe: 1.783 ± 0.708
1.783GlyGly: 1.783 ± 0.484
1.019GlyHis: 1.019 ± 0.495
4.076GlyIle: 4.076 ± 0.732
3.057GlyLys: 3.057 ± 0.159
3.057GlyLeu: 3.057 ± 0.78
0.255GlyMet: 0.255 ± 0.593
3.057GlyAsn: 3.057 ± 0.617
1.274GlyPro: 1.274 ± 0.475
1.783GlyGln: 1.783 ± 0.441
1.019GlyArg: 1.019 ± 0.569
1.783GlySer: 1.783 ± 0.441
3.057GlyThr: 3.057 ± 2.33
3.057GlyVal: 3.057 ± 0.299
0.51GlyTrp: 0.51 ± 0.103
1.529GlyTyr: 1.529 ± 0.466
0.0GlyXaa: 0.0 ± 0.0
His
0.51HisAla: 0.51 ± 0.413
0.51HisCys: 0.51 ± 0.413
0.51HisAsp: 0.51 ± 0.103
2.293HisGlu: 2.293 ± 0.432
1.274HisPhe: 1.274 ± 0.204
1.019HisGly: 1.019 ± 0.206
0.51HisHis: 0.51 ± 0.103
2.293HisIle: 2.293 ± 1.523
3.057HisLys: 3.057 ± 0.622
1.019HisLeu: 1.019 ± 0.27
1.274HisMet: 1.274 ± 0.615
0.51HisAsn: 0.51 ± 0.103
0.764HisPro: 0.764 ± 0.292
0.51HisGln: 0.51 ± 0.413
2.038HisArg: 2.038 ± 0.411
2.293HisSer: 2.293 ± 0.383
1.529HisThr: 1.529 ± 0.584
0.764HisVal: 0.764 ± 0.292
0.51HisTrp: 0.51 ± 0.103
1.529HisTyr: 1.529 ± 1.165
0.0HisXaa: 0.0 ± 0.0
Ile
3.057IleAla: 3.057 ± 0.441
1.529IleCys: 1.529 ± 0.288
6.115IleAsp: 6.115 ± 0.352
7.389IleGlu: 7.389 ± 1.335
3.822IlePhe: 3.822 ± 1.213
3.312IleGly: 3.312 ± 0.785
2.548IleHis: 2.548 ± 0.772
7.643IleIle: 7.643 ± 2.034
5.605IleLys: 5.605 ± 1.494
10.191IleLeu: 10.191 ± 0.772
1.529IleMet: 1.529 ± 0.309
7.898IleAsn: 7.898 ± 1.017
2.548IlePro: 2.548 ± 0.514
2.803IleGln: 2.803 ± 0.767
4.331IleArg: 4.331 ± 1.511
5.35IleSer: 5.35 ± 1.192
4.331IleThr: 4.331 ± 0.596
5.86IleVal: 5.86 ± 0.537
1.019IleTrp: 1.019 ± 0.206
4.076IleTyr: 4.076 ± 1.415
0.0IleXaa: 0.0 ± 0.0
Lys
2.803LysAla: 2.803 ± 0.332
2.548LysCys: 2.548 ± 1.729
4.331LysAsp: 4.331 ± 0.437
5.86LysGlu: 5.86 ± 1.056
4.076LysPhe: 4.076 ± 1.962
4.331LysGly: 4.331 ± 1.351
2.803LysHis: 2.803 ± 0.677
5.35LysIle: 5.35 ± 1.015
5.86LysLys: 5.86 ± 0.661
6.624LysLeu: 6.624 ± 1.337
2.293LysMet: 2.293 ± 0.383
6.115LysAsn: 6.115 ± 1.179
2.803LysPro: 2.803 ± 0.897
3.312LysGln: 3.312 ± 0.806
3.567LysArg: 3.567 ± 1.041
7.134LysSer: 7.134 ± 0.787
5.86LysThr: 5.86 ± 0.324
4.586LysVal: 4.586 ± 1.151
0.764LysTrp: 0.764 ± 0.435
4.331LysTyr: 4.331 ± 0.761
0.0LysXaa: 0.0 ± 0.0
Leu
4.076LeuAla: 4.076 ± 1.879
2.803LeuCys: 2.803 ± 1.605
3.312LeuAsp: 3.312 ± 1.246
5.096LeuGlu: 5.096 ± 1.545
4.586LeuPhe: 4.586 ± 0.919
2.548LeuGly: 2.548 ± 0.514
2.038LeuHis: 2.038 ± 0.411
5.35LeuIle: 5.35 ± 0.381
9.427LeuLys: 9.427 ± 0.65
7.134LeuLeu: 7.134 ± 1.407
1.529LeuMet: 1.529 ± 0.359
4.841LeuAsn: 4.841 ± 0.814
4.076LeuPro: 4.076 ± 0.391
2.038LeuGln: 2.038 ± 0.54
2.548LeuArg: 2.548 ± 0.261
9.682LeuSer: 9.682 ± 1.787
5.86LeuThr: 5.86 ± 1.246
2.548LeuVal: 2.548 ± 0.514
0.51LeuTrp: 0.51 ± 0.29
4.586LeuTyr: 4.586 ± 0.926
0.0LeuXaa: 0.0 ± 0.0
Met
0.764MetAla: 0.764 ± 0.539
1.019MetCys: 1.019 ± 0.206
1.783MetAsp: 1.783 ± 1.099
1.529MetGlu: 1.529 ± 0.288
1.529MetPhe: 1.529 ± 0.566
0.255MetGly: 0.255 ± 0.145
1.019MetHis: 1.019 ± 0.825
2.548MetIle: 2.548 ± 1.127
1.529MetLys: 1.529 ± 0.718
2.038MetLeu: 2.038 ± 0.332
0.51MetMet: 0.51 ± 0.548
1.783MetAsn: 1.783 ± 0.484
1.529MetPro: 1.529 ± 0.566
1.274MetGln: 1.274 ± 0.475
1.274MetArg: 1.274 ± 0.442
3.822MetSer: 3.822 ± 0.648
2.293MetThr: 2.293 ± 0.998
0.0MetVal: 0.0 ± 0.0
0.255MetTrp: 0.255 ± 0.206
1.019MetTyr: 1.019 ± 0.569
0.0MetXaa: 0.0 ± 0.0
Asn
3.567AsnAla: 3.567 ± 0.696
1.529AsnCys: 1.529 ± 0.584
4.076AsnAsp: 4.076 ± 1.318
2.803AsnGlu: 2.803 ± 0.646
3.822AsnPhe: 3.822 ± 0.946
2.803AsnGly: 2.803 ± 1.44
1.274AsnHis: 1.274 ± 0.993
6.879AsnIle: 6.879 ± 2.322
6.369AsnLys: 6.369 ± 0.907
5.86AsnLeu: 5.86 ± 1.048
2.038AsnMet: 2.038 ± 0.332
4.076AsnAsn: 4.076 ± 0.294
3.567AsnPro: 3.567 ± 1.128
1.783AsnGln: 1.783 ± 0.53
2.293AsnArg: 2.293 ± 0.73
5.605AsnSer: 5.605 ± 1.12
4.331AsnThr: 4.331 ± 0.782
1.783AsnVal: 1.783 ± 1.111
1.274AsnTrp: 1.274 ± 0.386
3.567AsnTyr: 3.567 ± 1.707
0.0AsnXaa: 0.0 ± 0.0
Pro
1.529ProAla: 1.529 ± 1.054
0.0ProCys: 0.0 ± 0.0
1.783ProAsp: 1.783 ± 1.016
2.293ProGlu: 2.293 ± 0.583
0.764ProPhe: 0.764 ± 0.539
1.529ProGly: 1.529 ± 1.079
0.0ProHis: 0.0 ± 0.0
4.331ProIle: 4.331 ± 1.277
2.038ProLys: 2.038 ± 1.576
2.038ProLeu: 2.038 ± 0.332
1.274ProMet: 1.274 ± 0.475
2.803ProAsn: 2.803 ± 0.968
0.255ProPro: 0.255 ± 0.145
0.764ProGln: 0.764 ± 0.292
0.51ProArg: 0.51 ± 0.548
2.548ProSer: 2.548 ± 0.408
2.038ProThr: 2.038 ± 0.338
3.567ProVal: 3.567 ± 0.578
0.51ProTrp: 0.51 ± 0.29
1.274ProTyr: 1.274 ± 0.204
0.0ProXaa: 0.0 ± 0.0
Gln
1.274GlnAla: 1.274 ± 0.475
0.255GlnCys: 0.255 ± 0.206
2.293GlnAsp: 2.293 ± 0.43
1.019GlnGlu: 1.019 ± 0.464
2.038GlnPhe: 2.038 ± 0.54
1.783GlnGly: 1.783 ± 0.786
0.51GlnHis: 0.51 ± 0.413
2.548GlnIle: 2.548 ± 0.53
3.312GlnLys: 3.312 ± 0.691
1.529GlnLeu: 1.529 ± 0.288
0.0GlnMet: 0.0 ± 0.0
1.529GlnAsn: 1.529 ± 0.566
1.274GlnPro: 1.274 ± 0.386
0.764GlnGln: 0.764 ± 0.144
1.274GlnArg: 1.274 ± 0.409
1.019GlnSer: 1.019 ± 0.27
2.293GlnThr: 2.293 ± 0.43
1.783GlnVal: 1.783 ± 0.53
0.0GlnTrp: 0.0 ± 0.0
1.019GlnTyr: 1.019 ± 0.27
0.0GlnXaa: 0.0 ± 0.0
Arg
1.274ArgAla: 1.274 ± 0.631
1.529ArgCys: 1.529 ± 0.466
1.783ArgAsp: 1.783 ± 0.53
3.312ArgGlu: 3.312 ± 0.539
1.274ArgPhe: 1.274 ± 0.386
1.019ArgGly: 1.019 ± 0.495
2.038ArgHis: 2.038 ± 0.338
4.331ArgIle: 4.331 ± 0.437
1.783ArgLys: 1.783 ± 0.348
3.567ArgLeu: 3.567 ± 0.816
1.274ArgMet: 1.274 ± 0.475
3.822ArgAsn: 3.822 ± 0.495
0.764ArgPro: 0.764 ± 0.292
0.51ArgGln: 0.51 ± 0.548
1.783ArgArg: 1.783 ± 0.348
2.548ArgSer: 2.548 ± 0.852
1.783ArgThr: 1.783 ± 0.695
0.51ArgVal: 0.51 ± 0.548
0.0ArgTrp: 0.0 ± 0.0
1.274ArgTyr: 1.274 ± 0.726
0.0ArgXaa: 0.0 ± 0.0
Ser
2.293SerAla: 2.293 ± 0.429
3.312SerCys: 3.312 ± 1.689
3.822SerAsp: 3.822 ± 1.533
4.076SerGlu: 4.076 ± 1.081
3.822SerPhe: 3.822 ± 1.274
2.548SerGly: 2.548 ± 1.145
1.783SerHis: 1.783 ± 1.347
8.408SerIle: 8.408 ± 1.351
9.172SerLys: 9.172 ± 1.162
7.898SerLeu: 7.898 ± 2.465
1.274SerMet: 1.274 ± 0.475
5.86SerAsn: 5.86 ± 1.55
1.529SerPro: 1.529 ± 0.551
2.038SerGln: 2.038 ± 0.839
3.312SerArg: 3.312 ± 0.567
6.879SerSer: 6.879 ± 1.291
4.076SerThr: 4.076 ± 0.671
4.076SerVal: 4.076 ± 1.081
1.019SerTrp: 1.019 ± 0.27
1.783SerTyr: 1.783 ± 0.289
0.0SerXaa: 0.0 ± 0.0
Thr
1.783ThrAla: 1.783 ± 0.408
1.274ThrCys: 1.274 ± 0.386
3.567ThrAsp: 3.567 ± 0.332
5.096ThrGlu: 5.096 ± 0.445
4.331ThrPhe: 4.331 ± 1.554
2.038ThrGly: 2.038 ± 0.338
1.529ThrHis: 1.529 ± 0.905
7.389ThrIle: 7.389 ± 1.645
4.841ThrLys: 4.841 ± 1.402
4.076ThrLeu: 4.076 ± 0.671
1.274ThrMet: 1.274 ± 0.386
3.567ThrAsn: 3.567 ± 0.72
1.783ThrPro: 1.783 ± 0.348
0.764ThrGln: 0.764 ± 0.144
0.51ThrArg: 0.51 ± 0.103
4.586ThrSer: 4.586 ± 0.766
2.803ThrThr: 2.803 ± 0.19
3.567ThrVal: 3.567 ± 0.059
1.274ThrTrp: 1.274 ± 1.14
3.822ThrTyr: 3.822 ± 0.68
0.0ThrXaa: 0.0 ± 0.0
Val
1.783ValAla: 1.783 ± 1.016
1.274ValCys: 1.274 ± 0.442
2.803ValAsp: 2.803 ± 0.332
2.803ValGlu: 2.803 ± 0.442
3.822ValPhe: 3.822 ± 0.871
3.312ValGly: 3.312 ± 1.004
1.529ValHis: 1.529 ± 0.309
3.312ValIle: 3.312 ± 1.062
4.331ValLys: 4.331 ± 0.995
4.076ValLeu: 4.076 ± 0.675
2.293ValMet: 2.293 ± 0.43
3.822ValAsn: 3.822 ± 0.697
0.764ValPro: 0.764 ± 0.619
1.274ValGln: 1.274 ± 0.204
0.764ValArg: 0.764 ± 0.435
4.331ValSer: 4.331 ± 0.259
1.783ValThr: 1.783 ± 0.348
3.057ValVal: 3.057 ± 0.299
0.255ValTrp: 0.255 ± 0.206
3.057ValTyr: 3.057 ± 0.81
0.0ValXaa: 0.0 ± 0.0
Trp
0.255TrpAla: 0.255 ± 0.593
0.0TrpCys: 0.0 ± 0.0
0.51TrpAsp: 0.51 ± 0.103
0.764TrpGlu: 0.764 ± 0.144
0.764TrpPhe: 0.764 ± 0.144
0.764TrpGly: 0.764 ± 0.144
0.255TrpHis: 0.255 ± 0.206
0.255TrpIle: 0.255 ± 0.206
0.255TrpLys: 0.255 ± 0.145
1.783TrpLeu: 1.783 ± 0.348
0.255TrpMet: 0.255 ± 0.145
1.019TrpAsn: 1.019 ± 0.27
0.51TrpPro: 0.51 ± 0.413
0.51TrpGln: 0.51 ± 0.29
0.0TrpArg: 0.0 ± 0.0
1.529TrpSer: 1.529 ± 0.288
0.255TrpThr: 0.255 ± 0.593
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.255TrpTyr: 0.255 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.51TyrAla: 0.51 ± 0.29
1.783TyrCys: 1.783 ± 0.484
2.293TyrAsp: 2.293 ± 0.43
2.548TyrGlu: 2.548 ± 0.549
3.312TyrPhe: 3.312 ± 0.58
1.019TyrGly: 1.019 ± 0.27
0.764TyrHis: 0.764 ± 0.619
5.605TyrIle: 5.605 ± 0.895
4.586TyrLys: 4.586 ± 0.926
3.822TyrLeu: 3.822 ± 0.697
2.038TyrMet: 2.038 ± 0.839
4.841TyrAsn: 4.841 ± 0.397
1.274TyrPro: 1.274 ± 0.475
1.529TyrGln: 1.529 ± 0.288
1.019TyrArg: 1.019 ± 0.27
2.548TyrSer: 2.548 ± 0.818
4.586TyrThr: 4.586 ± 0.802
2.548TyrVal: 2.548 ± 0.261
0.0TyrTrp: 0.0 ± 0.0
0.255TyrTyr: 0.255 ± 0.145
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3926 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski