Amino acid dipepetide frequency for Pseudoalteromonas phage SL25

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.023AlaAla: 9.023 ± 1.679
0.618AlaCys: 0.618 ± 0.289
4.697AlaAsp: 4.697 ± 1.085
5.439AlaGlu: 5.439 ± 0.911
2.967AlaPhe: 2.967 ± 0.461
6.304AlaGly: 6.304 ± 1.032
0.742AlaHis: 0.742 ± 0.283
5.933AlaIle: 5.933 ± 0.867
5.068AlaLys: 5.068 ± 0.795
8.282AlaLeu: 8.282 ± 0.815
3.708AlaMet: 3.708 ± 0.995
5.562AlaAsn: 5.562 ± 0.802
2.843AlaPro: 2.843 ± 0.579
2.596AlaGln: 2.596 ± 0.561
3.585AlaArg: 3.585 ± 0.635
5.562AlaSer: 5.562 ± 0.561
5.192AlaThr: 5.192 ± 0.791
5.562AlaVal: 5.562 ± 0.744
0.742AlaTrp: 0.742 ± 0.328
2.596AlaTyr: 2.596 ± 0.604
0.0AlaXaa: 0.0 ± 0.0
Cys
0.742CysAla: 0.742 ± 0.313
0.371CysCys: 0.371 ± 0.192
0.494CysAsp: 0.494 ± 0.207
1.236CysGlu: 1.236 ± 0.432
0.124CysPhe: 0.124 ± 0.147
0.618CysGly: 0.618 ± 0.281
0.371CysHis: 0.371 ± 0.299
0.742CysIle: 0.742 ± 0.334
0.618CysLys: 0.618 ± 0.235
0.618CysLeu: 0.618 ± 0.246
0.247CysMet: 0.247 ± 0.162
0.494CysAsn: 0.494 ± 0.262
0.124CysPro: 0.124 ± 0.117
0.371CysGln: 0.371 ± 0.201
0.742CysArg: 0.742 ± 0.27
0.742CysSer: 0.742 ± 0.467
0.865CysThr: 0.865 ± 0.285
0.494CysVal: 0.494 ± 0.234
0.371CysTrp: 0.371 ± 0.18
0.865CysTyr: 0.865 ± 0.347
0.0CysXaa: 0.0 ± 0.0
Asp
4.45AspAla: 4.45 ± 0.517
0.742AspCys: 0.742 ± 0.293
3.832AspAsp: 3.832 ± 0.642
4.203AspGlu: 4.203 ± 0.838
2.101AspPhe: 2.101 ± 0.388
4.326AspGly: 4.326 ± 0.52
0.371AspHis: 0.371 ± 0.158
4.45AspIle: 4.45 ± 0.914
3.585AspLys: 3.585 ± 0.523
5.562AspLeu: 5.562 ± 0.927
1.36AspMet: 1.36 ± 0.354
3.708AspAsn: 3.708 ± 0.585
2.225AspPro: 2.225 ± 0.525
1.854AspGln: 1.854 ± 0.383
1.854AspArg: 1.854 ± 0.371
4.079AspSer: 4.079 ± 0.764
3.708AspThr: 3.708 ± 0.588
2.349AspVal: 2.349 ± 0.377
1.36AspTrp: 1.36 ± 0.378
2.843AspTyr: 2.843 ± 0.509
0.0AspXaa: 0.0 ± 0.0
Glu
5.315GluAla: 5.315 ± 1.053
0.494GluCys: 0.494 ± 0.239
3.09GluAsp: 3.09 ± 0.628
3.956GluGlu: 3.956 ± 0.438
2.596GluPhe: 2.596 ± 0.411
3.461GluGly: 3.461 ± 0.618
0.865GluHis: 0.865 ± 0.303
4.574GluIle: 4.574 ± 0.739
5.686GluLys: 5.686 ± 0.862
10.878GluLeu: 10.878 ± 1.472
2.101GluMet: 2.101 ± 0.492
2.967GluAsn: 2.967 ± 0.58
0.989GluPro: 0.989 ± 0.345
3.214GluGln: 3.214 ± 0.6
2.719GluArg: 2.719 ± 0.739
4.944GluSer: 4.944 ± 0.758
2.349GluThr: 2.349 ± 0.677
4.697GluVal: 4.697 ± 0.645
1.236GluTrp: 1.236 ± 0.36
2.719GluTyr: 2.719 ± 0.63
0.0GluXaa: 0.0 ± 0.0
Phe
3.09PheAla: 3.09 ± 0.481
0.742PheCys: 0.742 ± 0.269
3.09PheAsp: 3.09 ± 0.636
2.101PheGlu: 2.101 ± 0.456
1.112PhePhe: 1.112 ± 0.31
2.596PheGly: 2.596 ± 0.543
0.247PheHis: 0.247 ± 0.179
2.596PheIle: 2.596 ± 0.401
2.967PheLys: 2.967 ± 0.551
1.731PheLeu: 1.731 ± 0.303
1.112PheMet: 1.112 ± 0.379
1.978PheAsn: 1.978 ± 0.47
1.112PhePro: 1.112 ± 0.404
0.989PheGln: 0.989 ± 0.279
1.36PheArg: 1.36 ± 0.356
3.214PheSer: 3.214 ± 0.504
2.843PheThr: 2.843 ± 0.525
1.236PheVal: 1.236 ± 0.312
0.494PheTrp: 0.494 ± 0.237
1.36PheTyr: 1.36 ± 0.378
0.0PheXaa: 0.0 ± 0.0
Gly
6.304GlyAla: 6.304 ± 1.165
0.618GlyCys: 0.618 ± 0.251
3.585GlyAsp: 3.585 ± 0.457
5.562GlyGlu: 5.562 ± 0.658
2.967GlyPhe: 2.967 ± 0.617
8.282GlyGly: 8.282 ± 2.262
0.618GlyHis: 0.618 ± 0.275
3.214GlyIle: 3.214 ± 0.6
4.821GlyLys: 4.821 ± 0.668
6.675GlyLeu: 6.675 ± 0.926
1.112GlyMet: 1.112 ± 0.314
2.596GlyAsn: 2.596 ± 0.575
0.124GlyPro: 0.124 ± 0.117
3.832GlyGln: 3.832 ± 0.67
2.101GlyArg: 2.101 ± 0.506
4.326GlySer: 4.326 ± 0.89
4.574GlyThr: 4.574 ± 0.833
5.192GlyVal: 5.192 ± 0.938
0.742GlyTrp: 0.742 ± 0.291
3.214GlyTyr: 3.214 ± 0.761
0.0GlyXaa: 0.0 ± 0.0
His
1.36HisAla: 1.36 ± 0.348
0.0HisCys: 0.0 ± 0.0
0.865HisAsp: 0.865 ± 0.327
0.989HisGlu: 0.989 ± 0.34
0.371HisPhe: 0.371 ± 0.17
0.865HisGly: 0.865 ± 0.299
0.0HisHis: 0.0 ± 0.0
0.742HisIle: 0.742 ± 0.316
0.989HisLys: 0.989 ± 0.355
0.989HisLeu: 0.989 ± 0.304
0.247HisMet: 0.247 ± 0.149
0.618HisAsn: 0.618 ± 0.244
0.124HisPro: 0.124 ± 0.132
0.618HisGln: 0.618 ± 0.249
1.236HisArg: 1.236 ± 0.453
0.494HisSer: 0.494 ± 0.223
0.865HisThr: 0.865 ± 0.284
0.494HisVal: 0.494 ± 0.244
0.247HisTrp: 0.247 ± 0.183
0.865HisTyr: 0.865 ± 0.384
0.0HisXaa: 0.0 ± 0.0
Ile
4.45IleAla: 4.45 ± 0.597
0.494IleCys: 0.494 ± 0.212
3.708IleAsp: 3.708 ± 0.555
5.686IleGlu: 5.686 ± 0.833
1.112IlePhe: 1.112 ± 0.388
3.832IleGly: 3.832 ± 0.458
0.989IleHis: 0.989 ± 0.318
3.708IleIle: 3.708 ± 0.756
4.326IleLys: 4.326 ± 0.781
3.461IleLeu: 3.461 ± 0.646
1.483IleMet: 1.483 ± 0.353
4.203IleAsn: 4.203 ± 0.824
2.225IlePro: 2.225 ± 0.463
1.483IleGln: 1.483 ± 0.383
3.956IleArg: 3.956 ± 0.779
3.956IleSer: 3.956 ± 0.947
6.057IleThr: 6.057 ± 0.796
3.461IleVal: 3.461 ± 0.803
0.371IleTrp: 0.371 ± 0.162
2.843IleTyr: 2.843 ± 0.593
0.0IleXaa: 0.0 ± 0.0
Lys
7.046LysAla: 7.046 ± 0.915
0.989LysCys: 0.989 ± 0.339
3.461LysAsp: 3.461 ± 0.697
4.326LysGlu: 4.326 ± 0.742
2.349LysPhe: 2.349 ± 0.575
4.45LysGly: 4.45 ± 0.762
1.112LysHis: 1.112 ± 0.364
3.708LysIle: 3.708 ± 0.555
4.944LysLys: 4.944 ± 0.865
6.18LysLeu: 6.18 ± 0.908
2.101LysMet: 2.101 ± 0.589
3.337LysAsn: 3.337 ± 0.564
3.214LysPro: 3.214 ± 0.715
2.967LysGln: 2.967 ± 0.623
3.337LysArg: 3.337 ± 0.793
4.326LysSer: 4.326 ± 0.618
4.574LysThr: 4.574 ± 0.632
4.326LysVal: 4.326 ± 0.794
1.36LysTrp: 1.36 ± 0.349
1.36LysTyr: 1.36 ± 0.371
0.0LysXaa: 0.0 ± 0.0
Leu
6.799LeuAla: 6.799 ± 1.079
0.865LeuCys: 0.865 ± 0.339
4.821LeuAsp: 4.821 ± 0.738
5.315LeuGlu: 5.315 ± 0.784
2.596LeuPhe: 2.596 ± 0.518
4.944LeuGly: 4.944 ± 0.675
1.236LeuHis: 1.236 ± 0.365
5.933LeuIle: 5.933 ± 0.555
6.922LeuLys: 6.922 ± 1.021
6.18LeuLeu: 6.18 ± 0.871
3.09LeuMet: 3.09 ± 0.707
4.821LeuAsn: 4.821 ± 0.615
4.203LeuPro: 4.203 ± 0.594
2.967LeuGln: 2.967 ± 0.609
2.843LeuArg: 2.843 ± 0.59
7.293LeuSer: 7.293 ± 0.754
4.203LeuThr: 4.203 ± 0.819
4.079LeuVal: 4.079 ± 0.613
0.618LeuTrp: 0.618 ± 0.301
2.719LeuTyr: 2.719 ± 0.508
0.0LeuXaa: 0.0 ± 0.0
Met
2.472MetAla: 2.472 ± 0.504
0.0MetCys: 0.0 ± 0.0
0.618MetAsp: 0.618 ± 0.239
2.101MetGlu: 2.101 ± 0.478
0.618MetPhe: 0.618 ± 0.262
2.349MetGly: 2.349 ± 0.505
0.371MetHis: 0.371 ± 0.271
1.978MetIle: 1.978 ± 0.551
2.101MetLys: 2.101 ± 0.441
2.101MetLeu: 2.101 ± 0.486
0.371MetMet: 0.371 ± 0.169
2.101MetAsn: 2.101 ± 0.445
0.989MetPro: 0.989 ± 0.264
2.472MetGln: 2.472 ± 0.603
1.854MetArg: 1.854 ± 0.405
1.978MetSer: 1.978 ± 0.559
2.101MetThr: 2.101 ± 0.575
0.742MetVal: 0.742 ± 0.304
0.618MetTrp: 0.618 ± 0.293
0.371MetTyr: 0.371 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
4.944AsnAla: 4.944 ± 0.766
0.371AsnCys: 0.371 ± 0.228
4.326AsnAsp: 4.326 ± 0.801
4.203AsnGlu: 4.203 ± 0.721
2.225AsnPhe: 2.225 ± 0.36
4.697AsnGly: 4.697 ± 0.705
0.865AsnHis: 0.865 ± 0.494
3.09AsnIle: 3.09 ± 0.7
3.708AsnLys: 3.708 ± 0.607
3.708AsnLeu: 3.708 ± 0.564
1.36AsnMet: 1.36 ± 0.36
3.214AsnAsn: 3.214 ± 0.813
2.349AsnPro: 2.349 ± 0.521
2.101AsnGln: 2.101 ± 0.653
2.101AsnArg: 2.101 ± 0.379
2.967AsnSer: 2.967 ± 0.487
3.708AsnThr: 3.708 ± 0.568
2.101AsnVal: 2.101 ± 0.581
0.989AsnTrp: 0.989 ± 0.417
1.854AsnTyr: 1.854 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
2.472ProAla: 2.472 ± 0.574
0.247ProCys: 0.247 ± 0.163
2.349ProAsp: 2.349 ± 0.569
1.731ProGlu: 1.731 ± 0.439
1.731ProPhe: 1.731 ± 0.6
0.742ProGly: 0.742 ± 0.32
0.371ProHis: 0.371 ± 0.166
1.236ProIle: 1.236 ± 0.324
1.978ProLys: 1.978 ± 0.491
3.09ProLeu: 3.09 ± 0.461
1.36ProMet: 1.36 ± 0.385
2.596ProAsn: 2.596 ± 0.722
1.112ProPro: 1.112 ± 0.351
2.101ProGln: 2.101 ± 0.684
1.607ProArg: 1.607 ± 0.55
1.731ProSer: 1.731 ± 0.336
2.472ProThr: 2.472 ± 0.565
2.843ProVal: 2.843 ± 0.572
0.618ProTrp: 0.618 ± 0.247
1.112ProTyr: 1.112 ± 0.347
0.0ProXaa: 0.0 ± 0.0
Gln
5.315GlnAla: 5.315 ± 0.828
0.618GlnCys: 0.618 ± 0.353
1.483GlnAsp: 1.483 ± 0.435
2.349GlnGlu: 2.349 ± 0.573
1.607GlnPhe: 1.607 ± 0.436
1.978GlnGly: 1.978 ± 0.476
1.36GlnHis: 1.36 ± 0.415
3.461GlnIle: 3.461 ± 0.699
1.483GlnLys: 1.483 ± 0.578
3.461GlnLeu: 3.461 ± 0.776
1.36GlnMet: 1.36 ± 0.317
2.719GlnAsn: 2.719 ± 0.932
1.36GlnPro: 1.36 ± 0.344
2.719GlnGln: 2.719 ± 1.305
1.236GlnArg: 1.236 ± 0.354
3.832GlnSer: 3.832 ± 0.912
2.225GlnThr: 2.225 ± 0.467
2.719GlnVal: 2.719 ± 0.523
0.0GlnTrp: 0.0 ± 0.0
2.101GlnTyr: 2.101 ± 0.567
0.0GlnXaa: 0.0 ± 0.0
Arg
3.585ArgAla: 3.585 ± 0.626
0.742ArgCys: 0.742 ± 0.282
2.843ArgAsp: 2.843 ± 0.435
3.585ArgGlu: 3.585 ± 1.08
2.225ArgPhe: 2.225 ± 0.482
2.101ArgGly: 2.101 ± 0.562
0.247ArgHis: 0.247 ± 0.156
2.349ArgIle: 2.349 ± 0.512
2.843ArgLys: 2.843 ± 0.584
3.214ArgLeu: 3.214 ± 0.671
1.112ArgMet: 1.112 ± 0.316
1.978ArgAsn: 1.978 ± 0.444
1.36ArgPro: 1.36 ± 0.594
2.719ArgGln: 2.719 ± 0.622
1.978ArgArg: 1.978 ± 0.534
2.472ArgSer: 2.472 ± 0.449
1.731ArgThr: 1.731 ± 0.457
3.337ArgVal: 3.337 ± 0.819
0.742ArgTrp: 0.742 ± 0.295
1.36ArgTyr: 1.36 ± 0.36
0.0ArgXaa: 0.0 ± 0.0
Ser
4.821SerAla: 4.821 ± 1.184
0.865SerCys: 0.865 ± 0.335
6.18SerAsp: 6.18 ± 0.765
4.079SerGlu: 4.079 ± 0.769
3.09SerPhe: 3.09 ± 0.571
4.821SerGly: 4.821 ± 0.719
1.112SerHis: 1.112 ± 0.393
4.079SerIle: 4.079 ± 0.845
5.439SerLys: 5.439 ± 0.881
4.944SerLeu: 4.944 ± 0.711
1.607SerMet: 1.607 ± 0.417
3.337SerAsn: 3.337 ± 0.585
2.843SerPro: 2.843 ± 0.828
3.461SerGln: 3.461 ± 0.648
3.461SerArg: 3.461 ± 0.487
4.574SerSer: 4.574 ± 0.796
2.596SerThr: 2.596 ± 0.479
3.461SerVal: 3.461 ± 0.516
0.618SerTrp: 0.618 ± 0.27
2.225SerTyr: 2.225 ± 0.472
0.0SerXaa: 0.0 ± 0.0
Thr
5.81ThrAla: 5.81 ± 0.878
0.618ThrCys: 0.618 ± 0.255
2.843ThrAsp: 2.843 ± 0.548
4.079ThrGlu: 4.079 ± 0.662
2.719ThrPhe: 2.719 ± 0.523
6.18ThrGly: 6.18 ± 0.885
0.742ThrHis: 0.742 ± 0.388
3.337ThrIle: 3.337 ± 0.606
4.326ThrLys: 4.326 ± 0.612
4.079ThrLeu: 4.079 ± 0.886
1.112ThrMet: 1.112 ± 0.404
3.214ThrAsn: 3.214 ± 0.71
2.349ThrPro: 2.349 ± 0.584
3.09ThrGln: 3.09 ± 0.579
1.731ThrArg: 1.731 ± 0.399
3.337ThrSer: 3.337 ± 0.479
3.832ThrThr: 3.832 ± 0.661
5.315ThrVal: 5.315 ± 0.944
0.989ThrTrp: 0.989 ± 0.323
2.843ThrTyr: 2.843 ± 0.558
0.0ThrXaa: 0.0 ± 0.0
Val
5.068ValAla: 5.068 ± 1.124
1.236ValCys: 1.236 ± 0.491
3.832ValAsp: 3.832 ± 0.704
3.832ValGlu: 3.832 ± 0.504
1.978ValPhe: 1.978 ± 0.485
3.461ValGly: 3.461 ± 0.737
0.742ValHis: 0.742 ± 0.316
3.585ValIle: 3.585 ± 0.559
5.068ValLys: 5.068 ± 0.756
3.956ValLeu: 3.956 ± 0.761
1.854ValMet: 1.854 ± 0.497
2.967ValAsn: 2.967 ± 0.556
1.978ValPro: 1.978 ± 0.749
1.483ValGln: 1.483 ± 0.342
2.596ValArg: 2.596 ± 0.393
4.203ValSer: 4.203 ± 0.719
4.944ValThr: 4.944 ± 0.96
3.337ValVal: 3.337 ± 0.667
0.618ValTrp: 0.618 ± 0.314
1.483ValTyr: 1.483 ± 0.414
0.0ValXaa: 0.0 ± 0.0
Trp
0.742TrpAla: 0.742 ± 0.315
0.0TrpCys: 0.0 ± 0.0
1.112TrpAsp: 1.112 ± 0.383
0.865TrpGlu: 0.865 ± 0.364
0.618TrpPhe: 0.618 ± 0.256
0.865TrpGly: 0.865 ± 0.31
0.494TrpHis: 0.494 ± 0.221
1.112TrpIle: 1.112 ± 0.32
0.494TrpLys: 0.494 ± 0.283
0.989TrpLeu: 0.989 ± 0.268
0.124TrpMet: 0.124 ± 0.123
0.371TrpAsn: 0.371 ± 0.192
0.865TrpPro: 0.865 ± 0.316
0.494TrpGln: 0.494 ± 0.307
0.989TrpArg: 0.989 ± 0.268
0.742TrpSer: 0.742 ± 0.296
0.989TrpThr: 0.989 ± 0.295
0.742TrpVal: 0.742 ± 0.319
0.247TrpTrp: 0.247 ± 0.167
0.494TrpTyr: 0.494 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.214TyrAla: 3.214 ± 0.558
0.618TyrCys: 0.618 ± 0.25
1.854TyrAsp: 1.854 ± 0.474
3.214TyrGlu: 3.214 ± 0.567
0.865TyrPhe: 0.865 ± 0.359
3.585TyrGly: 3.585 ± 0.582
0.124TyrHis: 0.124 ± 0.129
1.978TyrIle: 1.978 ± 0.492
2.101TyrLys: 2.101 ± 0.511
2.349TyrLeu: 2.349 ± 0.497
1.36TyrMet: 1.36 ± 0.385
2.101TyrAsn: 2.101 ± 0.443
1.112TyrPro: 1.112 ± 0.421
1.854TyrGln: 1.854 ± 0.511
1.236TyrArg: 1.236 ± 0.355
2.843TyrSer: 2.843 ± 0.583
2.719TyrThr: 2.719 ± 0.519
1.731TyrVal: 1.731 ± 0.326
0.371TyrTrp: 0.371 ± 0.182
1.236TyrTyr: 1.236 ± 0.457
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (8091 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski