Amino acid dipepetide frequency for Penicillium chrysogenum virus (isolate Caston/2003) (PcV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.482AlaAla: 6.482 ± 2.03
1.037AlaCys: 1.037 ± 0.339
3.37AlaAsp: 3.37 ± 1.017
7.519AlaGlu: 7.519 ± 1.439
2.074AlaPhe: 2.074 ± 0.662
5.185AlaGly: 5.185 ± 1.115
2.074AlaHis: 2.074 ± 0.863
4.148AlaIle: 4.148 ± 1.072
3.37AlaLys: 3.37 ± 0.559
6.741AlaLeu: 6.741 ± 0.562
3.111AlaMet: 3.111 ± 0.812
1.815AlaAsn: 1.815 ± 0.608
2.852AlaPro: 2.852 ± 0.459
3.111AlaGln: 3.111 ± 0.833
5.445AlaArg: 5.445 ± 0.684
4.408AlaSer: 4.408 ± 1.171
5.185AlaThr: 5.185 ± 0.637
4.667AlaVal: 4.667 ± 0.333
1.037AlaTrp: 1.037 ± 0.056
1.296AlaTyr: 1.296 ± 0.172
0.0AlaXaa: 0.0 ± 0.0
Cys
1.556CysAla: 1.556 ± 0.924
0.0CysCys: 0.0 ± 0.0
1.815CysAsp: 1.815 ± 0.521
0.519CysGlu: 0.519 ± 0.236
0.259CysPhe: 0.259 ± 0.239
1.815CysGly: 1.815 ± 0.244
0.778CysHis: 0.778 ± 0.224
0.259CysIle: 0.259 ± 0.217
1.296CysLys: 1.296 ± 0.172
1.296CysLeu: 1.296 ± 0.423
0.519CysMet: 0.519 ± 0.283
0.259CysAsn: 0.259 ± 0.247
0.519CysPro: 0.519 ± 0.283
1.037CysGln: 1.037 ± 0.383
2.074CysArg: 2.074 ± 0.428
1.556CysSer: 1.556 ± 0.518
0.778CysThr: 0.778 ± 0.397
0.519CysVal: 0.519 ± 0.261
0.259CysTrp: 0.259 ± 0.219
0.778CysTyr: 0.778 ± 0.4
0.0CysXaa: 0.0 ± 0.0
Asp
4.148AspAla: 4.148 ± 0.896
1.037AspCys: 1.037 ± 0.566
2.333AspAsp: 2.333 ± 0.432
3.37AspGlu: 3.37 ± 1.261
2.074AspPhe: 2.074 ± 0.895
5.963AspGly: 5.963 ± 1.025
0.778AspHis: 0.778 ± 0.656
4.408AspIle: 4.408 ± 0.109
3.37AspLys: 3.37 ± 0.617
5.963AspLeu: 5.963 ± 1.088
1.815AspMet: 1.815 ± 0.644
2.074AspAsn: 2.074 ± 0.462
1.815AspPro: 1.815 ± 0.739
1.037AspGln: 1.037 ± 0.335
2.593AspArg: 2.593 ± 0.51
3.37AspSer: 3.37 ± 0.783
2.074AspThr: 2.074 ± 0.439
2.852AspVal: 2.852 ± 0.248
2.333AspTrp: 2.333 ± 0.958
3.37AspTyr: 3.37 ± 0.923
0.0AspXaa: 0.0 ± 0.0
Glu
6.741GluAla: 6.741 ± 0.911
1.556GluCys: 1.556 ± 0.836
3.63GluAsp: 3.63 ± 0.897
6.222GluGlu: 6.222 ± 0.838
2.852GluPhe: 2.852 ± 0.694
5.704GluGly: 5.704 ± 1.173
2.333GluHis: 2.333 ± 0.5
2.593GluIle: 2.593 ± 1.121
4.667GluLys: 4.667 ± 0.965
5.704GluLeu: 5.704 ± 0.738
3.63GluMet: 3.63 ± 0.738
2.074GluAsn: 2.074 ± 0.63
1.815GluPro: 1.815 ± 0.379
3.63GluGln: 3.63 ± 0.694
5.704GluArg: 5.704 ± 0.593
3.111GluSer: 3.111 ± 0.549
2.074GluThr: 2.074 ± 0.428
5.185GluVal: 5.185 ± 1.874
1.556GluTrp: 1.556 ± 0.601
2.074GluTyr: 2.074 ± 0.112
0.0GluXaa: 0.0 ± 0.0
Phe
1.556PheAla: 1.556 ± 0.793
0.519PheCys: 0.519 ± 0.437
3.37PheAsp: 3.37 ± 0.799
2.852PheGlu: 2.852 ± 0.653
1.037PhePhe: 1.037 ± 0.875
3.111PheGly: 3.111 ± 0.386
0.778PheHis: 0.778 ± 0.229
1.296PheIle: 1.296 ± 0.525
1.556PheLys: 1.556 ± 0.731
2.593PheLeu: 2.593 ± 0.462
1.037PheMet: 1.037 ± 0.393
1.296PheAsn: 1.296 ± 0.393
1.037PhePro: 1.037 ± 0.471
0.259PheGln: 0.259 ± 0.219
2.333PheArg: 2.333 ± 0.638
3.111PheSer: 3.111 ± 0.975
3.111PheThr: 3.111 ± 0.468
2.074PheVal: 2.074 ± 0.826
0.519PheTrp: 0.519 ± 0.261
0.259PheTyr: 0.259 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
5.185GlyAla: 5.185 ± 0.783
0.778GlyCys: 0.778 ± 0.656
4.148GlyAsp: 4.148 ± 0.504
4.408GlyGlu: 4.408 ± 0.355
2.333GlyPhe: 2.333 ± 0.429
5.185GlyGly: 5.185 ± 0.565
1.037GlyHis: 1.037 ± 0.34
2.852GlyIle: 2.852 ± 0.957
4.667GlyLys: 4.667 ± 0.955
8.037GlyLeu: 8.037 ± 0.69
3.37GlyMet: 3.37 ± 0.86
2.593GlyAsn: 2.593 ± 0.448
3.111GlyPro: 3.111 ± 1.201
2.593GlyGln: 2.593 ± 0.278
6.482GlyArg: 6.482 ± 0.939
4.667GlySer: 4.667 ± 1.125
4.408GlyThr: 4.408 ± 0.109
5.704GlyVal: 5.704 ± 1.408
2.074GlyTrp: 2.074 ± 0.112
1.556GlyTyr: 1.556 ± 0.538
0.0GlyXaa: 0.0 ± 0.0
His
0.778HisAla: 0.778 ± 0.229
0.0HisCys: 0.0 ± 0.0
1.556HisAsp: 1.556 ± 0.547
2.074HisGlu: 2.074 ± 0.309
1.037HisPhe: 1.037 ± 0.056
1.815HisGly: 1.815 ± 0.185
0.259HisHis: 0.259 ± 0.217
0.519HisIle: 0.519 ± 0.236
1.037HisLys: 1.037 ± 0.056
3.889HisLeu: 3.889 ± 0.652
1.556HisMet: 1.556 ± 0.681
0.778HisAsn: 0.778 ± 0.656
0.778HisPro: 0.778 ± 0.451
0.259HisGln: 0.259 ± 0.217
1.815HisArg: 1.815 ± 0.737
2.074HisSer: 2.074 ± 0.798
1.556HisThr: 1.556 ± 0.547
2.593HisVal: 2.593 ± 0.756
0.778HisTrp: 0.778 ± 0.224
0.259HisTyr: 0.259 ± 0.247
0.0HisXaa: 0.0 ± 0.0
Ile
4.408IleAla: 4.408 ± 0.895
1.037IleCys: 1.037 ± 0.34
3.111IleAsp: 3.111 ± 0.812
3.37IleGlu: 3.37 ± 0.417
0.778IlePhe: 0.778 ± 0.397
2.593IleGly: 2.593 ± 0.37
1.296IleHis: 1.296 ± 0.643
2.074IleIle: 2.074 ± 0.662
1.037IleLys: 1.037 ± 0.523
3.111IleLeu: 3.111 ± 0.996
0.519IleMet: 0.519 ± 0.377
3.111IleAsn: 3.111 ± 0.636
1.815IlePro: 1.815 ± 0.321
1.037IleGln: 1.037 ± 0.594
2.852IleArg: 2.852 ± 1.099
3.889IleSer: 3.889 ± 0.428
2.333IleThr: 2.333 ± 0.628
2.593IleVal: 2.593 ± 0.626
0.519IleTrp: 0.519 ± 0.437
1.296IleTyr: 1.296 ± 0.172
0.0IleXaa: 0.0 ± 0.0
Lys
2.593LysAla: 2.593 ± 0.37
1.296LysCys: 1.296 ± 0.663
3.63LysAsp: 3.63 ± 0.604
4.926LysGlu: 4.926 ± 1.177
2.593LysPhe: 2.593 ± 0.683
2.333LysGly: 2.333 ± 0.747
2.593LysHis: 2.593 ± 0.761
1.815LysIle: 1.815 ± 0.662
3.889LysLys: 3.889 ± 0.672
6.222LysLeu: 6.222 ± 0.842
2.852LysMet: 2.852 ± 0.374
0.519LysAsn: 0.519 ± 0.27
1.556LysPro: 1.556 ± 0.601
2.074LysGln: 2.074 ± 0.428
2.593LysArg: 2.593 ± 1.051
2.852LysSer: 2.852 ± 0.248
2.852LysThr: 2.852 ± 0.649
2.074LysVal: 2.074 ± 0.5
1.037LysTrp: 1.037 ± 0.523
1.815LysTyr: 1.815 ± 0.232
0.0LysXaa: 0.0 ± 0.0
Leu
8.556LeuAla: 8.556 ± 1.304
1.296LeuCys: 1.296 ± 0.44
3.37LeuAsp: 3.37 ± 0.923
7.519LeuGlu: 7.519 ± 0.664
3.889LeuPhe: 3.889 ± 0.843
6.741LeuGly: 6.741 ± 2.177
2.074LeuHis: 2.074 ± 0.77
2.333LeuIle: 2.333 ± 0.555
4.148LeuLys: 4.148 ± 1.226
8.037LeuLeu: 8.037 ± 1.697
3.111LeuMet: 3.111 ± 0.167
2.852LeuAsn: 2.852 ± 0.892
4.148LeuPro: 4.148 ± 0.811
2.852LeuGln: 2.852 ± 0.459
6.741LeuArg: 6.741 ± 0.348
6.222LeuSer: 6.222 ± 0.969
5.963LeuThr: 5.963 ± 0.861
8.037LeuVal: 8.037 ± 1.256
0.778LeuTrp: 0.778 ± 0.229
3.889LeuTyr: 3.889 ± 0.896
0.0LeuXaa: 0.0 ± 0.0
Met
3.63MetAla: 3.63 ± 0.642
1.815MetCys: 1.815 ± 0.482
2.593MetAsp: 2.593 ± 0.37
0.259MetGlu: 0.259 ± 0.219
0.778MetPhe: 0.778 ± 0.656
3.63MetGly: 3.63 ± 1.022
0.778MetHis: 0.778 ± 0.45
0.519MetIle: 0.519 ± 0.495
1.815MetLys: 1.815 ± 0.59
2.593MetLeu: 2.593 ± 0.682
1.556MetMet: 1.556 ± 0.924
1.556MetAsn: 1.556 ± 0.731
2.333MetPro: 2.333 ± 0.432
0.778MetGln: 0.778 ± 0.651
2.333MetArg: 2.333 ± 0.244
3.111MetSer: 3.111 ± 0.666
1.556MetThr: 1.556 ± 0.333
3.63MetVal: 3.63 ± 1.655
0.519MetTrp: 0.519 ± 0.495
1.556MetTyr: 1.556 ± 0.448
0.0MetXaa: 0.0 ± 0.0
Asn
3.111AsnAla: 3.111 ± 0.976
0.0AsnCys: 0.0 ± 0.0
0.778AsnAsp: 0.778 ± 0.462
2.333AsnGlu: 2.333 ± 0.859
1.556AsnPhe: 1.556 ± 0.826
2.333AsnGly: 2.333 ± 0.337
0.778AsnHis: 0.778 ± 0.496
1.037AsnIle: 1.037 ± 0.362
1.815AsnLys: 1.815 ± 1.531
3.37AsnLeu: 3.37 ± 1.031
1.037AsnMet: 1.037 ± 0.415
1.037AsnAsn: 1.037 ± 0.471
1.037AsnPro: 1.037 ± 0.644
1.296AsnGln: 1.296 ± 0.619
2.593AsnArg: 2.593 ± 0.448
2.852AsnSer: 2.852 ± 1.14
2.852AsnThr: 2.852 ± 1.196
2.074AsnVal: 2.074 ± 1.156
1.037AsnTrp: 1.037 ± 0.673
0.519AsnTyr: 0.519 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
2.852ProAla: 2.852 ± 0.66
0.259ProCys: 0.259 ± 0.239
1.815ProAsp: 1.815 ± 0.608
2.333ProGlu: 2.333 ± 0.672
0.519ProPhe: 0.519 ± 0.236
4.148ProGly: 4.148 ± 0.605
1.037ProHis: 1.037 ± 0.393
1.815ProIle: 1.815 ± 0.66
3.111ProLys: 3.111 ± 0.482
1.556ProLeu: 1.556 ± 0.923
0.778ProMet: 0.778 ± 0.211
1.296ProAsn: 1.296 ± 0.44
2.593ProPro: 2.593 ± 0.756
2.074ProGln: 2.074 ± 1.006
2.593ProArg: 2.593 ± 0.679
4.667ProSer: 4.667 ± 1.018
3.111ProThr: 3.111 ± 0.975
2.852ProVal: 2.852 ± 0.806
1.037ProTrp: 1.037 ± 0.34
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.296GlnAla: 1.296 ± 0.423
0.778GlnCys: 0.778 ± 0.475
1.037GlnAsp: 1.037 ± 0.398
1.556GlnGlu: 1.556 ± 0.733
1.296GlnPhe: 1.296 ± 0.393
3.37GlnGly: 3.37 ± 1.206
1.296GlnHis: 1.296 ± 0.643
3.889GlnIle: 3.889 ± 0.484
1.037GlnLys: 1.037 ± 0.398
1.815GlnLeu: 1.815 ± 0.607
2.593GlnMet: 2.593 ± 0.659
1.296GlnAsn: 1.296 ± 0.172
0.778GlnPro: 0.778 ± 0.462
1.556GlnGln: 1.556 ± 0.264
0.778GlnArg: 0.778 ± 0.475
1.296GlnSer: 1.296 ± 0.508
1.556GlnThr: 1.556 ± 0.328
3.111GlnVal: 3.111 ± 0.316
0.519GlnTrp: 0.519 ± 0.283
1.556GlnTyr: 1.556 ± 0.524
0.0GlnXaa: 0.0 ± 0.0
Arg
5.963ArgAla: 5.963 ± 1.149
0.519ArgCys: 0.519 ± 0.236
5.704ArgAsp: 5.704 ± 0.85
7.0ArgGlu: 7.0 ± 1.886
1.296ArgPhe: 1.296 ± 0.172
4.148ArgGly: 4.148 ± 0.799
2.333ArgHis: 2.333 ± 0.415
3.889ArgIle: 3.889 ± 0.974
2.852ArgLys: 2.852 ± 1.173
7.519ArgLeu: 7.519 ± 0.803
1.296ArgMet: 1.296 ± 0.582
2.333ArgAsn: 2.333 ± 0.293
2.074ArgPro: 2.074 ± 0.751
2.852ArgGln: 2.852 ± 0.374
5.963ArgArg: 5.963 ± 1.852
3.63ArgSer: 3.63 ± 0.711
3.111ArgThr: 3.111 ± 0.846
5.445ArgVal: 5.445 ± 0.786
1.556ArgTrp: 1.556 ± 0.733
1.556ArgTyr: 1.556 ± 0.558
0.0ArgXaa: 0.0 ± 0.0
Ser
4.408SerAla: 4.408 ± 1.265
2.074SerCys: 2.074 ± 0.751
3.37SerAsp: 3.37 ± 0.708
4.408SerGlu: 4.408 ± 0.527
1.037SerPhe: 1.037 ± 0.362
4.408SerGly: 4.408 ± 0.355
1.815SerHis: 1.815 ± 0.521
2.333SerIle: 2.333 ± 0.867
3.889SerLys: 3.889 ± 1.069
7.778SerLeu: 7.778 ± 0.738
1.556SerMet: 1.556 ± 0.448
1.815SerAsn: 1.815 ± 0.346
3.111SerPro: 3.111 ± 0.683
1.037SerGln: 1.037 ± 0.362
5.963SerArg: 5.963 ± 1.309
4.926SerSer: 4.926 ± 1.082
4.148SerThr: 4.148 ± 1.9
4.667SerVal: 4.667 ± 0.783
1.037SerTrp: 1.037 ± 0.523
3.37SerTyr: 3.37 ± 0.73
0.0SerXaa: 0.0 ± 0.0
Thr
3.37ThrAla: 3.37 ± 1.071
1.037ThrCys: 1.037 ± 0.339
2.852ThrAsp: 2.852 ± 0.638
3.111ThrGlu: 3.111 ± 0.313
1.296ThrPhe: 1.296 ± 0.461
3.63ThrGly: 3.63 ± 1.222
0.259ThrHis: 0.259 ± 0.217
3.111ThrIle: 3.111 ± 1.531
3.37ThrLys: 3.37 ± 0.448
6.222ThrLeu: 6.222 ± 0.952
2.333ThrMet: 2.333 ± 0.337
1.556ThrAsn: 1.556 ± 0.547
2.333ThrPro: 2.333 ± 0.921
1.815ThrGln: 1.815 ± 0.608
4.926ThrArg: 4.926 ± 0.872
3.63ThrSer: 3.63 ± 0.566
3.37ThrThr: 3.37 ± 0.292
4.148ThrVal: 4.148 ± 0.314
0.519ThrTrp: 0.519 ± 0.261
1.815ThrTyr: 1.815 ± 0.537
0.0ThrXaa: 0.0 ± 0.0
Val
4.926ValAla: 4.926 ± 0.513
1.815ValCys: 1.815 ± 0.847
4.148ValAsp: 4.148 ± 1.352
5.185ValGlu: 5.185 ± 1.111
4.408ValPhe: 4.408 ± 1.27
4.408ValGly: 4.408 ± 0.956
2.333ValHis: 2.333 ± 0.638
2.852ValIle: 2.852 ± 0.528
2.852ValLys: 2.852 ± 1.331
5.704ValLeu: 5.704 ± 1.988
2.852ValMet: 2.852 ± 0.746
2.333ValAsn: 2.333 ± 0.337
5.185ValPro: 5.185 ± 0.858
1.556ValGln: 1.556 ± 0.643
4.667ValArg: 4.667 ± 0.791
4.926ValSer: 4.926 ± 0.959
3.37ValThr: 3.37 ± 0.553
5.963ValVal: 5.963 ± 0.9
0.778ValTrp: 0.778 ± 0.45
1.296ValTyr: 1.296 ± 0.525
0.0ValXaa: 0.0 ± 0.0
Trp
0.778TrpAla: 0.778 ± 0.451
0.778TrpCys: 0.778 ± 0.451
1.815TrpAsp: 1.815 ± 0.644
1.037TrpGlu: 1.037 ± 0.523
1.296TrpPhe: 1.296 ± 0.663
1.037TrpGly: 1.037 ± 0.702
0.259TrpHis: 0.259 ± 0.219
1.037TrpIle: 1.037 ± 0.34
1.296TrpLys: 1.296 ± 0.525
2.593TrpLeu: 2.593 ± 0.726
0.519TrpMet: 0.519 ± 0.261
1.037TrpAsn: 1.037 ± 0.385
0.259TrpPro: 0.259 ± 0.239
0.259TrpGln: 0.259 ± 0.219
1.296TrpArg: 1.296 ± 0.525
1.037TrpSer: 1.037 ± 0.615
0.519TrpThr: 0.519 ± 0.495
0.259TrpVal: 0.259 ± 0.217
0.0TrpTrp: 0.0 ± 0.0
0.778TrpTyr: 0.778 ± 0.459
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.593TyrAla: 2.593 ± 0.784
0.259TyrCys: 0.259 ± 0.247
2.333TyrAsp: 2.333 ± 0.687
2.852TyrGlu: 2.852 ± 0.649
1.296TyrPhe: 1.296 ± 0.477
3.63TyrGly: 3.63 ± 0.791
0.519TyrHis: 0.519 ± 0.437
0.259TyrIle: 0.259 ± 0.239
1.037TyrLys: 1.037 ± 0.607
1.815TyrLeu: 1.815 ± 0.346
1.037TyrMet: 1.037 ± 0.393
1.815TyrAsn: 1.815 ± 1.015
1.296TyrPro: 1.296 ± 0.508
1.296TyrGln: 1.296 ± 0.261
1.296TyrArg: 1.296 ± 0.423
1.815TyrSer: 1.815 ± 0.725
0.778TyrThr: 0.778 ± 0.459
3.111TyrVal: 3.111 ± 1.411
0.0TyrTrp: 0.0 ± 0.0
0.778TyrTyr: 0.778 ± 0.419
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3858 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski