Amino acid dipepetide frequency for Puma lentivirus 14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.358AlaAla: 3.358 ± 2.179
1.119AlaCys: 1.119 ± 1.429
1.866AlaAsp: 1.866 ± 1.311
4.104AlaGlu: 4.104 ± 0.888
1.866AlaPhe: 1.866 ± 0.821
3.731AlaGly: 3.731 ± 0.122
0.373AlaHis: 0.373 ± 0.284
4.478AlaIle: 4.478 ± 1.073
2.239AlaLys: 2.239 ± 0.705
7.09AlaLeu: 7.09 ± 1.114
1.119AlaMet: 1.119 ± 0.39
2.239AlaAsn: 2.239 ± 0.633
2.612AlaPro: 2.612 ± 1.176
2.612AlaGln: 2.612 ± 0.756
1.866AlaArg: 1.866 ± 1.243
1.866AlaSer: 1.866 ± 1.311
1.866AlaThr: 1.866 ± 0.81
3.358AlaVal: 3.358 ± 0.62
0.746AlaTrp: 0.746 ± 0.567
2.985AlaTyr: 2.985 ± 0.837
0.0AlaXaa: 0.0 ± 0.0
Cys
0.373CysAla: 0.373 ± 0.476
0.373CysCys: 0.373 ± 0.589
0.373CysAsp: 0.373 ± 0.262
0.0CysGlu: 0.0 ± 0.0
1.493CysPhe: 1.493 ± 0.467
1.119CysGly: 1.119 ± 1.039
0.0CysHis: 0.0 ± 0.0
2.612CysIle: 2.612 ± 0.547
2.239CysLys: 2.239 ± 1.615
1.493CysLeu: 1.493 ± 0.721
0.746CysMet: 0.746 ± 0.59
1.493CysAsn: 1.493 ± 0.467
1.119CysPro: 1.119 ± 0.484
1.866CysGln: 1.866 ± 0.81
1.119CysArg: 1.119 ± 0.939
1.119CysSer: 1.119 ± 0.447
2.239CysThr: 2.239 ± 0.64
0.746CysVal: 0.746 ± 0.59
0.746CysTrp: 0.746 ± 1.177
0.746CysTyr: 0.746 ± 0.504
0.0CysXaa: 0.0 ± 0.0
Asp
1.866AspAla: 1.866 ± 0.721
1.493AspCys: 1.493 ± 1.111
1.866AspAsp: 1.866 ± 0.366
1.493AspGlu: 1.493 ± 1.111
2.239AspPhe: 2.239 ± 0.429
2.612AspGly: 2.612 ± 0.629
0.746AspHis: 0.746 ± 0.399
4.478AspIle: 4.478 ± 0.728
5.224AspLys: 5.224 ± 1.677
4.851AspLeu: 4.851 ± 1.381
1.493AspMet: 1.493 ± 0.467
3.358AspAsn: 3.358 ± 1.204
3.358AspPro: 3.358 ± 0.519
1.866AspGln: 1.866 ± 0.702
1.866AspArg: 1.866 ± 0.48
2.985AspSer: 2.985 ± 0.856
1.119AspThr: 1.119 ± 0.851
1.866AspVal: 1.866 ± 0.884
3.358AspTrp: 3.358 ± 1.803
1.866AspTyr: 1.866 ± 0.479
0.0AspXaa: 0.0 ± 0.0
Glu
3.731GluAla: 3.731 ± 1.682
1.493GluCys: 1.493 ± 1.134
4.478GluAsp: 4.478 ± 1.322
5.597GluGlu: 5.597 ± 1.816
1.493GluPhe: 1.493 ± 1.134
6.716GluGly: 6.716 ± 1.046
0.746GluHis: 0.746 ± 0.267
4.851GluIle: 4.851 ± 1.252
4.851GluLys: 4.851 ± 1.074
4.478GluLeu: 4.478 ± 1.316
1.119GluMet: 1.119 ± 0.329
3.731GluAsn: 3.731 ± 0.705
3.358GluPro: 3.358 ± 0.769
2.612GluGln: 2.612 ± 0.824
1.866GluArg: 1.866 ± 0.729
3.358GluSer: 3.358 ± 1.116
3.358GluThr: 3.358 ± 0.769
5.97GluVal: 5.97 ± 1.71
0.373GluTrp: 0.373 ± 0.589
1.119GluTyr: 1.119 ± 1.149
0.0GluXaa: 0.0 ± 0.0
Phe
0.373PheAla: 0.373 ± 0.262
0.746PheCys: 0.746 ± 0.59
0.746PheAsp: 0.746 ± 0.525
1.119PheGlu: 1.119 ± 0.447
0.746PhePhe: 0.746 ± 0.567
2.985PheGly: 2.985 ± 0.633
0.746PheHis: 0.746 ± 0.525
2.985PheIle: 2.985 ± 0.369
1.119PheLys: 1.119 ± 0.484
4.478PheLeu: 4.478 ± 0.857
0.373PheMet: 0.373 ± 0.462
2.612PheAsn: 2.612 ± 1.09
0.373PhePro: 0.373 ± 0.284
2.612PheGln: 2.612 ± 1.118
1.119PheArg: 1.119 ± 0.329
3.358PheSer: 3.358 ± 1.006
3.731PheThr: 3.731 ± 1.398
1.866PheVal: 1.866 ± 1.06
0.746PheTrp: 0.746 ± 0.267
1.119PheTyr: 1.119 ± 0.851
0.0PheXaa: 0.0 ± 0.0
Gly
2.612GlyAla: 2.612 ± 0.944
0.373GlyCys: 0.373 ± 0.476
2.239GlyAsp: 2.239 ± 0.687
5.597GlyGlu: 5.597 ± 1.014
2.239GlyPhe: 2.239 ± 0.705
7.463GlyGly: 7.463 ± 2.719
1.866GlyHis: 1.866 ± 1.224
6.343GlyIle: 6.343 ± 1.506
5.597GlyLys: 5.597 ± 1.15
5.597GlyLeu: 5.597 ± 0.51
1.119GlyMet: 1.119 ± 0.851
4.104GlyAsn: 4.104 ± 1.882
5.224GlyPro: 5.224 ± 1.604
1.493GlyGln: 1.493 ± 0.661
5.597GlyArg: 5.597 ± 2.502
3.731GlySer: 3.731 ± 1.285
3.731GlyThr: 3.731 ± 1.204
3.358GlyVal: 3.358 ± 0.768
1.119GlyTrp: 1.119 ± 1.149
2.239GlyTyr: 2.239 ± 0.802
0.0GlyXaa: 0.0 ± 0.0
His
0.746HisAla: 0.746 ± 0.525
0.373HisCys: 0.373 ± 0.284
0.373HisAsp: 0.373 ± 0.262
1.119HisGlu: 1.119 ± 0.329
0.746HisPhe: 0.746 ± 0.525
0.373HisGly: 0.373 ± 0.262
0.373HisHis: 0.373 ± 0.284
0.746HisIle: 0.746 ± 0.567
1.493HisLys: 1.493 ± 1.05
3.358HisLeu: 3.358 ± 2.116
0.0HisMet: 0.0 ± 0.247
1.119HisAsn: 1.119 ± 0.851
1.119HisPro: 1.119 ± 0.329
1.119HisGln: 1.119 ± 0.329
0.373HisArg: 0.373 ± 0.284
1.119HisSer: 1.119 ± 0.51
1.119HisThr: 1.119 ± 1.134
0.373HisVal: 0.373 ± 0.476
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.612IleAla: 2.612 ± 0.986
2.612IleCys: 2.612 ± 0.944
5.597IleAsp: 5.597 ± 1.26
4.104IleGlu: 4.104 ± 1.975
2.239IlePhe: 2.239 ± 0.802
4.851IleGly: 4.851 ± 0.416
1.866IleHis: 1.866 ± 0.479
5.97IleIle: 5.97 ± 1.657
9.328IleLys: 9.328 ± 0.475
2.985IleLeu: 2.985 ± 0.834
2.612IleMet: 2.612 ± 0.546
4.104IleAsn: 4.104 ± 1.51
5.597IlePro: 5.597 ± 1.924
2.985IleGln: 2.985 ± 1.045
3.358IleArg: 3.358 ± 2.765
2.985IleSer: 2.985 ± 0.426
2.985IleThr: 2.985 ± 0.641
5.224IleVal: 5.224 ± 1.526
1.119IleTrp: 1.119 ± 0.479
2.239IleTyr: 2.239 ± 0.895
0.0IleXaa: 0.0 ± 0.0
Lys
5.597LysAla: 5.597 ± 1.489
1.493LysCys: 1.493 ± 0.899
4.478LysAsp: 4.478 ± 0.762
6.343LysGlu: 6.343 ± 1.175
3.358LysPhe: 3.358 ± 0.349
5.597LysGly: 5.597 ± 2.236
0.746LysHis: 0.746 ± 0.59
7.463LysIle: 7.463 ± 2.564
7.09LysLys: 7.09 ± 0.805
8.582LysLeu: 8.582 ± 0.964
1.119LysMet: 1.119 ± 0.329
1.866LysAsn: 1.866 ± 0.48
1.866LysPro: 1.866 ± 0.81
4.104LysGln: 4.104 ± 0.71
2.612LysArg: 2.612 ± 0.76
1.866LysSer: 1.866 ± 0.562
1.866LysThr: 1.866 ± 0.699
2.239LysVal: 2.239 ± 0.429
2.612LysTrp: 2.612 ± 1.045
3.731LysTyr: 3.731 ± 0.734
0.0LysXaa: 0.0 ± 0.0
Leu
4.104LeuAla: 4.104 ± 1.213
1.493LeuCys: 1.493 ± 0.577
7.463LeuAsp: 7.463 ± 2.249
6.343LeuGlu: 6.343 ± 0.683
2.612LeuPhe: 2.612 ± 1.638
7.836LeuGly: 7.836 ± 0.905
1.493LeuHis: 1.493 ± 0.894
5.597LeuIle: 5.597 ± 1.924
8.582LeuLys: 8.582 ± 1.244
6.343LeuLeu: 6.343 ± 1.349
1.493LeuMet: 1.493 ± 0.661
5.224LeuAsn: 5.224 ± 1.181
3.358LeuPro: 3.358 ± 0.768
7.836LeuGln: 7.836 ± 1.781
5.224LeuArg: 5.224 ± 1.098
4.104LeuSer: 4.104 ± 1.281
6.716LeuThr: 6.716 ± 0.832
4.478LeuVal: 4.478 ± 0.77
2.985LeuTrp: 2.985 ± 0.947
2.239LeuTyr: 2.239 ± 0.669
0.0LeuXaa: 0.0 ± 0.0
Met
2.239MetAla: 2.239 ± 0.442
0.746MetCys: 0.746 ± 0.567
1.866MetAsp: 1.866 ± 0.721
1.493MetGlu: 1.493 ± 1.395
0.0MetPhe: 0.0 ± 0.0
2.239MetGly: 2.239 ± 1.124
0.746MetHis: 0.746 ± 0.58
1.866MetIle: 1.866 ± 0.689
1.493MetLys: 1.493 ± 0.683
4.104MetLeu: 4.104 ± 0.71
0.746MetMet: 0.746 ± 0.267
1.493MetAsn: 1.493 ± 0.683
1.119MetPro: 1.119 ± 0.698
1.866MetGln: 1.866 ± 1.311
1.119MetArg: 1.119 ± 0.484
0.373MetSer: 0.373 ± 0.262
0.373MetThr: 0.373 ± 0.476
0.746MetVal: 0.746 ± 0.567
0.746MetTrp: 0.746 ± 0.697
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.612AsnAla: 2.612 ± 0.59
2.612AsnCys: 2.612 ± 1.449
0.746AsnAsp: 0.746 ± 0.504
3.358AsnGlu: 3.358 ± 1.768
3.358AsnPhe: 3.358 ± 1.367
2.612AsnGly: 2.612 ± 1.059
1.119AsnHis: 1.119 ± 0.484
3.358AsnIle: 3.358 ± 1.342
4.104AsnLys: 4.104 ± 1.437
4.104AsnLeu: 4.104 ± 0.751
1.493AsnMet: 1.493 ± 0.534
3.358AsnAsn: 3.358 ± 0.769
2.985AsnPro: 2.985 ± 0.793
3.731AsnGln: 3.731 ± 0.949
2.612AsnArg: 2.612 ± 1.157
1.119AsnSer: 1.119 ± 0.851
2.612AsnThr: 2.612 ± 0.538
3.358AsnVal: 3.358 ± 0.769
1.493AsnTrp: 1.493 ± 0.577
1.866AsnTyr: 1.866 ± 1.128
0.0AsnXaa: 0.0 ± 0.0
Pro
2.239ProAla: 2.239 ± 1.678
1.493ProCys: 1.493 ± 0.467
1.119ProAsp: 1.119 ± 0.479
4.478ProGlu: 4.478 ± 0.889
1.493ProPhe: 1.493 ± 1.134
3.358ProGly: 3.358 ± 0.786
1.493ProHis: 1.493 ± 0.534
3.731ProIle: 3.731 ± 0.883
3.358ProLys: 3.358 ± 0.62
5.97ProLeu: 5.97 ± 1.065
1.866ProMet: 1.866 ± 1.137
2.985ProAsn: 2.985 ± 1.747
2.985ProPro: 2.985 ± 0.29
1.866ProGln: 1.866 ± 0.841
2.239ProArg: 2.239 ± 1.124
1.493ProSer: 1.493 ± 1.111
1.493ProThr: 1.493 ± 0.317
2.239ProVal: 2.239 ± 0.57
2.239ProTrp: 2.239 ± 1.02
2.239ProTyr: 2.239 ± 0.878
0.0ProXaa: 0.0 ± 0.0
Gln
4.104GlnAla: 4.104 ± 0.73
1.493GlnCys: 1.493 ± 0.721
3.731GlnAsp: 3.731 ± 1.74
6.716GlnGlu: 6.716 ± 1.639
1.493GlnPhe: 1.493 ± 0.467
3.731GlnGly: 3.731 ± 0.959
0.373GlnHis: 0.373 ± 0.476
2.985GlnIle: 2.985 ± 0.993
3.731GlnLys: 3.731 ± 0.959
4.851GlnLeu: 4.851 ± 1.829
1.493GlnMet: 1.493 ± 0.982
3.358GlnAsn: 3.358 ± 1.442
2.239GlnPro: 2.239 ± 0.658
4.478GlnGln: 4.478 ± 0.19
2.239GlnArg: 2.239 ± 0.275
2.239GlnSer: 2.239 ± 0.669
2.985GlnThr: 2.985 ± 0.758
5.224GlnVal: 5.224 ± 1.513
1.119GlnTrp: 1.119 ± 0.697
2.612GlnTyr: 2.612 ± 0.76
0.0GlnXaa: 0.0 ± 0.0
Arg
2.612ArgAla: 2.612 ± 1.059
0.746ArgCys: 0.746 ± 0.267
2.612ArgAsp: 2.612 ± 0.538
3.731ArgGlu: 3.731 ± 1.124
1.866ArgPhe: 1.866 ± 1.022
5.97ArgGly: 5.97 ± 4.718
0.373ArgHis: 0.373 ± 0.262
3.358ArgIle: 3.358 ± 1.342
2.985ArgLys: 2.985 ± 0.856
4.104ArgLeu: 4.104 ± 0.244
1.866ArgMet: 1.866 ± 1.085
1.493ArgAsn: 1.493 ± 1.134
2.612ArgPro: 2.612 ± 1.752
4.851ArgGln: 4.851 ± 1.308
4.104ArgArg: 4.104 ± 0.808
1.119ArgSer: 1.119 ± 0.665
1.493ArgThr: 1.493 ± 1.111
1.493ArgVal: 1.493 ± 0.577
0.373ArgTrp: 0.373 ± 0.284
1.493ArgTyr: 1.493 ± 0.467
0.0ArgXaa: 0.0 ± 0.0
Ser
1.493SerAla: 1.493 ± 0.774
0.373SerCys: 0.373 ± 0.284
1.866SerAsp: 1.866 ± 0.364
1.493SerGlu: 1.493 ± 0.661
0.746SerPhe: 0.746 ± 0.697
2.239SerGly: 2.239 ± 0.429
0.0SerHis: 0.0 ± 0.0
3.731SerIle: 3.731 ± 0.935
1.866SerLys: 1.866 ± 1.224
6.716SerLeu: 6.716 ± 1.74
1.119SerMet: 1.119 ± 0.484
3.358SerAsn: 3.358 ± 1.006
0.746SerPro: 0.746 ± 0.59
2.985SerGln: 2.985 ± 0.624
1.866SerArg: 1.866 ± 1.728
1.119SerSer: 1.119 ± 0.787
2.239SerThr: 2.239 ± 0.669
0.746SerVal: 0.746 ± 0.567
1.866SerTrp: 1.866 ± 0.562
2.612SerTyr: 2.612 ± 0.95
0.0SerXaa: 0.0 ± 0.0
Thr
5.597ThrAla: 5.597 ± 3.481
1.119ThrCys: 1.119 ± 0.484
3.358ThrAsp: 3.358 ± 0.795
4.104ThrGlu: 4.104 ± 1.386
1.119ThrPhe: 1.119 ± 0.698
2.612ThrGly: 2.612 ± 0.244
0.746ThrHis: 0.746 ± 0.399
2.985ThrIle: 2.985 ± 1.495
1.866ThrLys: 1.866 ± 0.729
7.463ThrLeu: 7.463 ± 0.997
1.119ThrMet: 1.119 ± 0.329
1.493ThrAsn: 1.493 ± 0.566
3.731ThrPro: 3.731 ± 1.124
2.985ThrGln: 2.985 ± 0.641
1.493ThrArg: 1.493 ± 0.577
2.239ThrSer: 2.239 ± 0.925
2.239ThrThr: 2.239 ± 1.3
1.866ThrVal: 1.866 ± 0.81
1.493ThrTrp: 1.493 ± 0.534
1.119ThrTyr: 1.119 ± 0.698
0.0ThrXaa: 0.0 ± 0.0
Val
2.239ValAla: 2.239 ± 0.633
0.746ValCys: 0.746 ± 0.267
1.866ValAsp: 1.866 ± 0.933
1.866ValGlu: 1.866 ± 0.48
1.119ValPhe: 1.119 ± 0.484
2.612ValGly: 2.612 ± 0.547
1.866ValHis: 1.866 ± 0.48
3.731ValIle: 3.731 ± 2.13
2.985ValLys: 2.985 ± 1.039
4.851ValLeu: 4.851 ± 0.915
1.493ValMet: 1.493 ± 0.748
1.866ValAsn: 1.866 ± 0.699
2.239ValPro: 2.239 ± 0.705
3.731ValGln: 3.731 ± 1.348
3.731ValArg: 3.731 ± 0.883
1.493ValSer: 1.493 ± 0.467
5.597ValThr: 5.597 ± 0.648
3.358ValVal: 3.358 ± 1.074
1.866ValTrp: 1.866 ± 0.562
4.104ValTyr: 4.104 ± 1.437
0.0ValXaa: 0.0 ± 0.0
Trp
1.119TrpAla: 1.119 ± 0.329
0.373TrpCys: 0.373 ± 0.262
1.493TrpAsp: 1.493 ± 1.16
0.746TrpGlu: 0.746 ± 0.267
2.239TrpPhe: 2.239 ± 0.934
2.239TrpGly: 2.239 ± 0.802
0.746TrpHis: 0.746 ± 1.177
1.866TrpIle: 1.866 ± 0.753
1.493TrpLys: 1.493 ± 0.577
1.119TrpLeu: 1.119 ± 1.766
1.866TrpMet: 1.866 ± 0.702
1.119TrpAsn: 1.119 ± 1.149
1.493TrpPro: 1.493 ± 1.068
1.493TrpGln: 1.493 ± 0.566
2.612TrpArg: 2.612 ± 1.567
0.373TrpSer: 0.373 ± 0.589
1.119TrpThr: 1.119 ± 0.787
2.985TrpVal: 2.985 ± 0.868
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.866TyrAla: 1.866 ± 0.882
0.746TyrCys: 0.746 ± 0.59
1.866TyrAsp: 1.866 ± 1.06
1.493TyrGlu: 1.493 ± 0.317
1.493TyrPhe: 1.493 ± 0.661
0.746TyrGly: 0.746 ± 0.399
0.0TyrHis: 0.0 ± 0.0
2.239TyrIle: 2.239 ± 1.315
2.985TyrLys: 2.985 ± 0.983
2.985TyrLeu: 2.985 ± 0.641
0.746TyrMet: 0.746 ± 0.59
2.239TyrAsn: 2.239 ± 1.3
2.239TyrPro: 2.239 ± 0.669
4.104TyrGln: 4.104 ± 1.158
2.239TyrArg: 2.239 ± 0.687
1.119TyrSer: 1.119 ± 0.484
1.866TyrThr: 1.866 ± 0.364
1.866TyrVal: 1.866 ± 1.312
1.493TyrTrp: 1.493 ± 0.566
1.119TyrTyr: 1.119 ± 0.787
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2681 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski