Amino acid dipepetide frequency for Plesiomonas phage phiP4-7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.985AlaAla: 9.985 ± 1.175
1.208AlaCys: 1.208 ± 0.326
6.2AlaAsp: 6.2 ± 0.746
6.603AlaGlu: 6.603 ± 1.199
3.623AlaPhe: 3.623 ± 0.397
6.844AlaGly: 6.844 ± 0.742
1.208AlaHis: 1.208 ± 0.415
5.717AlaIle: 5.717 ± 0.586
6.442AlaLys: 6.442 ± 0.758
8.052AlaLeu: 8.052 ± 1.113
3.06AlaMet: 3.06 ± 0.674
3.865AlaAsn: 3.865 ± 0.628
2.657AlaPro: 2.657 ± 0.436
4.429AlaGln: 4.429 ± 0.773
4.67AlaArg: 4.67 ± 0.61
5.878AlaSer: 5.878 ± 0.938
5.234AlaThr: 5.234 ± 0.691
6.281AlaVal: 6.281 ± 0.633
1.208AlaTrp: 1.208 ± 0.361
2.577AlaTyr: 2.577 ± 0.544
0.0AlaXaa: 0.0 ± 0.0
Cys
1.208CysAla: 1.208 ± 0.329
0.242CysCys: 0.242 ± 0.145
1.047CysAsp: 1.047 ± 0.276
0.564CysGlu: 0.564 ± 0.232
0.564CysPhe: 0.564 ± 0.24
1.691CysGly: 1.691 ± 0.413
0.483CysHis: 0.483 ± 0.27
0.805CysIle: 0.805 ± 0.268
1.208CysLys: 1.208 ± 0.348
0.966CysLeu: 0.966 ± 0.268
0.483CysMet: 0.483 ± 0.183
0.886CysAsn: 0.886 ± 0.256
0.644CysPro: 0.644 ± 0.191
0.644CysGln: 0.644 ± 0.226
1.047CysArg: 1.047 ± 0.318
0.886CysSer: 0.886 ± 0.272
0.242CysThr: 0.242 ± 0.14
1.288CysVal: 1.288 ± 0.333
0.081CysTrp: 0.081 ± 0.081
0.081CysTyr: 0.081 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
6.2AspAla: 6.2 ± 0.8
0.966AspCys: 0.966 ± 0.293
4.026AspAsp: 4.026 ± 0.552
3.865AspGlu: 3.865 ± 0.515
2.094AspPhe: 2.094 ± 0.488
5.395AspGly: 5.395 ± 0.622
1.127AspHis: 1.127 ± 0.24
3.382AspIle: 3.382 ± 0.442
2.255AspLys: 2.255 ± 0.457
4.992AspLeu: 4.992 ± 0.505
2.335AspMet: 2.335 ± 0.45
2.255AspAsn: 2.255 ± 0.338
2.738AspPro: 2.738 ± 0.533
2.496AspGln: 2.496 ± 0.496
2.899AspArg: 2.899 ± 0.501
2.738AspSer: 2.738 ± 0.435
2.577AspThr: 2.577 ± 0.359
4.992AspVal: 4.992 ± 0.56
1.53AspTrp: 1.53 ± 0.344
2.818AspTyr: 2.818 ± 0.677
0.0AspXaa: 0.0 ± 0.0
Glu
4.509GluAla: 4.509 ± 0.857
0.966GluCys: 0.966 ± 0.326
2.013GluAsp: 2.013 ± 0.394
2.899GluGlu: 2.899 ± 0.387
2.496GluPhe: 2.496 ± 0.407
2.577GluGly: 2.577 ± 0.514
1.127GluHis: 1.127 ± 0.333
3.946GluIle: 3.946 ± 0.481
4.429GluLys: 4.429 ± 0.597
5.717GluLeu: 5.717 ± 0.557
3.06GluMet: 3.06 ± 0.474
2.094GluAsn: 2.094 ± 0.368
2.255GluPro: 2.255 ± 0.425
3.865GluGln: 3.865 ± 0.619
4.509GluArg: 4.509 ± 0.562
3.704GluSer: 3.704 ± 0.601
3.221GluThr: 3.221 ± 0.449
3.462GluVal: 3.462 ± 0.498
1.208GluTrp: 1.208 ± 0.295
1.691GluTyr: 1.691 ± 0.39
0.0GluXaa: 0.0 ± 0.0
Phe
3.543PheAla: 3.543 ± 0.507
0.322PheCys: 0.322 ± 0.169
2.255PheAsp: 2.255 ± 0.413
2.335PheGlu: 2.335 ± 0.482
1.691PhePhe: 1.691 ± 0.353
2.657PheGly: 2.657 ± 0.457
0.805PheHis: 0.805 ± 0.245
3.221PheIle: 3.221 ± 0.479
1.691PheLys: 1.691 ± 0.331
1.852PheLeu: 1.852 ± 0.393
1.208PheMet: 1.208 ± 0.274
2.094PheAsn: 2.094 ± 0.431
0.805PhePro: 0.805 ± 0.216
0.886PheGln: 0.886 ± 0.261
2.335PheArg: 2.335 ± 0.451
2.738PheSer: 2.738 ± 0.396
1.933PheThr: 1.933 ± 0.45
3.06PheVal: 3.06 ± 0.583
0.242PheTrp: 0.242 ± 0.142
1.047PheTyr: 1.047 ± 0.226
0.0PheXaa: 0.0 ± 0.0
Gly
6.2GlyAla: 6.2 ± 0.741
1.449GlyCys: 1.449 ± 0.312
3.704GlyAsp: 3.704 ± 0.452
4.992GlyGlu: 4.992 ± 0.569
3.865GlyPhe: 3.865 ± 0.635
5.798GlyGly: 5.798 ± 0.72
1.047GlyHis: 1.047 ± 0.305
3.865GlyIle: 3.865 ± 0.571
5.556GlyLys: 5.556 ± 0.723
4.026GlyLeu: 4.026 ± 0.482
2.818GlyMet: 2.818 ± 0.435
2.818GlyAsn: 2.818 ± 0.45
0.725GlyPro: 0.725 ± 0.285
3.14GlyGln: 3.14 ± 0.661
3.14GlyArg: 3.14 ± 0.574
4.912GlySer: 4.912 ± 0.556
3.543GlyThr: 3.543 ± 0.825
5.959GlyVal: 5.959 ± 0.682
0.886GlyTrp: 0.886 ± 0.237
2.416GlyTyr: 2.416 ± 0.401
0.0GlyXaa: 0.0 ± 0.0
His
1.369HisAla: 1.369 ± 0.404
0.644HisCys: 0.644 ± 0.245
1.288HisAsp: 1.288 ± 0.346
1.127HisGlu: 1.127 ± 0.331
0.483HisPhe: 0.483 ± 0.242
1.127HisGly: 1.127 ± 0.301
0.805HisHis: 0.805 ± 0.227
1.047HisIle: 1.047 ± 0.308
0.805HisLys: 0.805 ± 0.286
1.288HisLeu: 1.288 ± 0.332
0.483HisMet: 0.483 ± 0.206
0.725HisAsn: 0.725 ± 0.228
0.966HisPro: 0.966 ± 0.288
0.564HisGln: 0.564 ± 0.235
1.127HisArg: 1.127 ± 0.301
0.483HisSer: 0.483 ± 0.171
1.047HisThr: 1.047 ± 0.27
0.966HisVal: 0.966 ± 0.342
0.242HisTrp: 0.242 ± 0.147
1.047HisTyr: 1.047 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
5.556IleAla: 5.556 ± 0.536
0.483IleCys: 0.483 ± 0.186
4.912IleAsp: 4.912 ± 0.666
4.107IleGlu: 4.107 ± 0.569
1.53IlePhe: 1.53 ± 0.325
4.348IleGly: 4.348 ± 0.649
1.288IleHis: 1.288 ± 0.277
3.704IleIle: 3.704 ± 0.505
6.281IleLys: 6.281 ± 0.68
3.301IleLeu: 3.301 ± 0.569
1.61IleMet: 1.61 ± 0.34
2.899IleAsn: 2.899 ± 0.547
2.899IlePro: 2.899 ± 0.439
2.174IleGln: 2.174 ± 0.545
3.462IleArg: 3.462 ± 0.55
3.785IleSer: 3.785 ± 0.542
5.234IleThr: 5.234 ± 0.735
3.865IleVal: 3.865 ± 0.457
0.564IleTrp: 0.564 ± 0.207
2.335IleTyr: 2.335 ± 0.425
0.0IleXaa: 0.0 ± 0.0
Lys
7.247LysAla: 7.247 ± 0.764
1.127LysCys: 1.127 ± 0.328
3.462LysAsp: 3.462 ± 0.467
3.382LysGlu: 3.382 ± 0.727
2.094LysPhe: 2.094 ± 0.51
3.623LysGly: 3.623 ± 0.55
1.449LysHis: 1.449 ± 0.381
3.704LysIle: 3.704 ± 0.566
4.268LysLys: 4.268 ± 0.699
5.717LysLeu: 5.717 ± 0.719
2.657LysMet: 2.657 ± 0.393
2.738LysAsn: 2.738 ± 0.549
2.818LysPro: 2.818 ± 0.381
3.06LysGln: 3.06 ± 0.532
4.67LysArg: 4.67 ± 0.721
3.946LysSer: 3.946 ± 0.538
3.623LysThr: 3.623 ± 0.558
4.187LysVal: 4.187 ± 0.551
0.322LysTrp: 0.322 ± 0.128
2.174LysTyr: 2.174 ± 0.364
0.0LysXaa: 0.0 ± 0.0
Leu
7.489LeuAla: 7.489 ± 0.731
1.208LeuCys: 1.208 ± 0.356
5.153LeuAsp: 5.153 ± 0.656
4.751LeuGlu: 4.751 ± 0.77
2.738LeuPhe: 2.738 ± 0.429
3.946LeuGly: 3.946 ± 0.485
1.208LeuHis: 1.208 ± 0.263
4.59LeuIle: 4.59 ± 0.533
5.314LeuLys: 5.314 ± 0.64
5.234LeuLeu: 5.234 ± 0.666
1.691LeuMet: 1.691 ± 0.341
4.026LeuAsn: 4.026 ± 0.568
2.657LeuPro: 2.657 ± 0.522
3.301LeuGln: 3.301 ± 0.549
3.623LeuArg: 3.623 ± 0.508
5.314LeuSer: 5.314 ± 0.65
6.361LeuThr: 6.361 ± 0.732
4.187LeuVal: 4.187 ± 0.507
0.805LeuTrp: 0.805 ± 0.21
1.771LeuTyr: 1.771 ± 0.344
0.0LeuXaa: 0.0 ± 0.0
Met
4.107MetAla: 4.107 ± 0.634
0.644MetCys: 0.644 ± 0.242
1.208MetAsp: 1.208 ± 0.276
1.449MetGlu: 1.449 ± 0.328
0.644MetPhe: 0.644 ± 0.198
1.691MetGly: 1.691 ± 0.339
0.725MetHis: 0.725 ± 0.265
2.416MetIle: 2.416 ± 0.362
2.174MetLys: 2.174 ± 0.38
2.335MetLeu: 2.335 ± 0.33
0.966MetMet: 0.966 ± 0.258
1.852MetAsn: 1.852 ± 0.393
1.288MetPro: 1.288 ± 0.33
1.127MetGln: 1.127 ± 0.271
1.771MetArg: 1.771 ± 0.372
2.335MetSer: 2.335 ± 0.583
2.657MetThr: 2.657 ± 0.479
2.094MetVal: 2.094 ± 0.407
0.483MetTrp: 0.483 ± 0.153
0.564MetTyr: 0.564 ± 0.219
0.0MetXaa: 0.0 ± 0.0
Asn
4.751AsnAla: 4.751 ± 0.807
0.725AsnCys: 0.725 ± 0.241
3.301AsnAsp: 3.301 ± 0.491
2.416AsnGlu: 2.416 ± 0.452
1.369AsnPhe: 1.369 ± 0.304
4.026AsnGly: 4.026 ± 0.63
0.725AsnHis: 0.725 ± 0.209
2.577AsnIle: 2.577 ± 0.4
2.899AsnLys: 2.899 ± 0.425
3.06AsnLeu: 3.06 ± 0.52
0.725AsnMet: 0.725 ± 0.242
2.657AsnAsn: 2.657 ± 0.449
1.852AsnPro: 1.852 ± 0.489
1.852AsnGln: 1.852 ± 0.313
1.933AsnArg: 1.933 ± 0.393
2.013AsnSer: 2.013 ± 0.345
2.255AsnThr: 2.255 ± 0.286
2.577AsnVal: 2.577 ± 0.392
0.644AsnTrp: 0.644 ± 0.221
1.208AsnTyr: 1.208 ± 0.365
0.0AsnXaa: 0.0 ± 0.0
Pro
4.348ProAla: 4.348 ± 0.678
0.322ProCys: 0.322 ± 0.149
1.852ProAsp: 1.852 ± 0.495
3.623ProGlu: 3.623 ± 0.386
1.288ProPhe: 1.288 ± 0.342
0.805ProGly: 0.805 ± 0.236
0.564ProHis: 0.564 ± 0.205
2.094ProIle: 2.094 ± 0.463
1.369ProLys: 1.369 ± 0.259
2.738ProLeu: 2.738 ± 0.451
1.047ProMet: 1.047 ± 0.214
1.369ProAsn: 1.369 ± 0.295
1.127ProPro: 1.127 ± 0.309
2.416ProGln: 2.416 ± 0.505
2.094ProArg: 2.094 ± 0.491
3.382ProSer: 3.382 ± 0.628
2.255ProThr: 2.255 ± 0.391
1.852ProVal: 1.852 ± 0.343
0.483ProTrp: 0.483 ± 0.244
0.966ProTyr: 0.966 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
3.623GlnAla: 3.623 ± 0.48
0.725GlnCys: 0.725 ± 0.221
2.416GlnAsp: 2.416 ± 0.468
2.335GlnGlu: 2.335 ± 0.439
2.174GlnPhe: 2.174 ± 0.445
2.818GlnGly: 2.818 ± 0.507
1.047GlnHis: 1.047 ± 0.293
3.462GlnIle: 3.462 ± 0.645
3.06GlnLys: 3.06 ± 0.484
4.509GlnLeu: 4.509 ± 0.626
1.933GlnMet: 1.933 ± 0.372
1.53GlnAsn: 1.53 ± 0.343
2.174GlnPro: 2.174 ± 0.509
3.221GlnGln: 3.221 ± 0.562
2.255GlnArg: 2.255 ± 0.381
2.094GlnSer: 2.094 ± 0.393
1.852GlnThr: 1.852 ± 0.406
1.933GlnVal: 1.933 ± 0.405
0.725GlnTrp: 0.725 ± 0.259
1.53GlnTyr: 1.53 ± 0.381
0.0GlnXaa: 0.0 ± 0.0
Arg
4.59ArgAla: 4.59 ± 0.555
0.886ArgCys: 0.886 ± 0.286
3.382ArgAsp: 3.382 ± 0.525
3.301ArgGlu: 3.301 ± 0.596
1.852ArgPhe: 1.852 ± 0.348
4.107ArgGly: 4.107 ± 0.531
0.886ArgHis: 0.886 ± 0.272
3.946ArgIle: 3.946 ± 0.524
4.107ArgLys: 4.107 ± 0.576
3.865ArgLeu: 3.865 ± 0.563
2.174ArgMet: 2.174 ± 0.368
2.416ArgAsn: 2.416 ± 0.425
1.61ArgPro: 1.61 ± 0.394
2.496ArgGln: 2.496 ± 0.419
2.577ArgArg: 2.577 ± 0.397
2.255ArgSer: 2.255 ± 0.45
2.899ArgThr: 2.899 ± 0.525
3.221ArgVal: 3.221 ± 0.515
0.725ArgTrp: 0.725 ± 0.193
1.771ArgTyr: 1.771 ± 0.376
0.0ArgXaa: 0.0 ± 0.0
Ser
5.314SerAla: 5.314 ± 0.747
0.564SerCys: 0.564 ± 0.192
4.509SerAsp: 4.509 ± 0.492
2.818SerGlu: 2.818 ± 0.499
2.577SerPhe: 2.577 ± 0.524
6.12SerGly: 6.12 ± 0.783
0.805SerHis: 0.805 ± 0.296
3.704SerIle: 3.704 ± 0.545
3.543SerLys: 3.543 ± 0.495
4.67SerLeu: 4.67 ± 0.632
1.288SerMet: 1.288 ± 0.29
1.933SerAsn: 1.933 ± 0.39
1.691SerPro: 1.691 ± 0.399
2.496SerGln: 2.496 ± 0.443
3.06SerArg: 3.06 ± 0.542
2.738SerSer: 2.738 ± 0.47
3.543SerThr: 3.543 ± 0.418
4.348SerVal: 4.348 ± 0.576
1.047SerTrp: 1.047 ± 0.236
1.53SerTyr: 1.53 ± 0.306
0.0SerXaa: 0.0 ± 0.0
Thr
6.12ThrAla: 6.12 ± 0.843
0.725ThrCys: 0.725 ± 0.266
3.462ThrAsp: 3.462 ± 0.606
3.704ThrGlu: 3.704 ± 0.461
1.691ThrPhe: 1.691 ± 0.372
4.429ThrGly: 4.429 ± 0.484
0.886ThrHis: 0.886 ± 0.225
4.509ThrIle: 4.509 ± 0.888
3.946ThrLys: 3.946 ± 0.514
5.475ThrLeu: 5.475 ± 0.653
1.208ThrMet: 1.208 ± 0.233
2.094ThrAsn: 2.094 ± 0.34
3.704ThrPro: 3.704 ± 0.598
2.657ThrGln: 2.657 ± 0.494
2.094ThrArg: 2.094 ± 0.438
2.335ThrSer: 2.335 ± 0.326
2.818ThrThr: 2.818 ± 0.548
4.509ThrVal: 4.509 ± 0.547
0.483ThrTrp: 0.483 ± 0.18
1.208ThrTyr: 1.208 ± 0.363
0.0ThrXaa: 0.0 ± 0.0
Val
5.717ValAla: 5.717 ± 0.604
1.208ValCys: 1.208 ± 0.323
4.268ValAsp: 4.268 ± 0.599
3.543ValGlu: 3.543 ± 0.439
2.335ValPhe: 2.335 ± 0.357
4.831ValGly: 4.831 ± 0.507
0.564ValHis: 0.564 ± 0.253
4.912ValIle: 4.912 ± 0.706
5.153ValLys: 5.153 ± 0.684
4.429ValLeu: 4.429 ± 0.606
2.657ValMet: 2.657 ± 0.435
3.865ValAsn: 3.865 ± 0.606
1.61ValPro: 1.61 ± 0.394
2.174ValGln: 2.174 ± 0.346
3.06ValArg: 3.06 ± 0.529
4.348ValSer: 4.348 ± 0.529
3.865ValThr: 3.865 ± 0.538
4.59ValVal: 4.59 ± 0.63
0.725ValTrp: 0.725 ± 0.23
2.174ValTyr: 2.174 ± 0.407
0.0ValXaa: 0.0 ± 0.0
Trp
0.725TrpAla: 0.725 ± 0.24
0.242TrpCys: 0.242 ± 0.134
0.725TrpAsp: 0.725 ± 0.21
0.564TrpGlu: 0.564 ± 0.205
0.644TrpPhe: 0.644 ± 0.207
0.725TrpGly: 0.725 ± 0.22
0.242TrpHis: 0.242 ± 0.13
0.403TrpIle: 0.403 ± 0.163
0.403TrpLys: 0.403 ± 0.162
0.966TrpLeu: 0.966 ± 0.25
0.403TrpMet: 0.403 ± 0.172
0.644TrpAsn: 0.644 ± 0.173
0.644TrpPro: 0.644 ± 0.235
1.047TrpGln: 1.047 ± 0.271
1.127TrpArg: 1.127 ± 0.341
0.886TrpSer: 0.886 ± 0.261
0.966TrpThr: 0.966 ± 0.258
0.725TrpVal: 0.725 ± 0.214
0.242TrpTrp: 0.242 ± 0.122
0.725TrpTyr: 0.725 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.14TyrAla: 3.14 ± 0.68
0.403TyrCys: 0.403 ± 0.176
2.496TyrAsp: 2.496 ± 0.497
1.369TyrGlu: 1.369 ± 0.318
0.966TyrPhe: 0.966 ± 0.258
3.301TyrGly: 3.301 ± 0.53
0.644TyrHis: 0.644 ± 0.276
2.335TyrIle: 2.335 ± 0.432
1.449TyrLys: 1.449 ± 0.406
2.174TyrLeu: 2.174 ± 0.368
0.644TyrMet: 0.644 ± 0.206
0.805TyrAsn: 0.805 ± 0.202
1.127TyrPro: 1.127 ± 0.244
1.369TyrGln: 1.369 ± 0.308
1.61TyrArg: 1.61 ± 0.345
1.53TyrSer: 1.53 ± 0.307
1.933TyrThr: 1.933 ± 0.339
1.933TyrVal: 1.933 ± 0.346
0.403TyrTrp: 0.403 ± 0.168
0.966TyrTyr: 0.966 ± 0.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (12420 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski