Amino acid dipepetide frequency for Ntepes virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.706AlaAla: 4.706 ± 1.821
1.734AlaCys: 1.734 ± 0.292
1.734AlaAsp: 1.734 ± 0.264
5.697AlaGlu: 5.697 ± 1.156
2.725AlaPhe: 2.725 ± 0.543
2.973AlaGly: 2.973 ± 0.403
2.229AlaHis: 2.229 ± 0.337
4.211AlaIle: 4.211 ± 1.24
3.716AlaLys: 3.716 ± 0.605
6.193AlaLeu: 6.193 ± 0.846
2.229AlaMet: 2.229 ± 0.258
0.743AlaAsn: 0.743 ± 0.58
1.486AlaPro: 1.486 ± 0.34
2.229AlaGln: 2.229 ± 0.567
1.982AlaArg: 1.982 ± 0.908
5.697AlaSer: 5.697 ± 0.826
3.22AlaThr: 3.22 ± 0.67
3.716AlaVal: 3.716 ± 0.779
0.248AlaTrp: 0.248 ± 0.383
1.486AlaTyr: 1.486 ± 0.411
0.0AlaXaa: 0.0 ± 0.0
Cys
0.743CysAla: 0.743 ± 0.368
0.495CysCys: 0.495 ± 0.305
0.743CysAsp: 0.743 ± 0.17
0.991CysGlu: 0.991 ± 0.611
1.982CysPhe: 1.982 ± 0.219
0.743CysGly: 0.743 ± 0.17
0.743CysHis: 0.743 ± 0.58
0.743CysIle: 0.743 ± 0.267
2.229CysLys: 2.229 ± 0.8
3.716CysLeu: 3.716 ± 1.305
0.248CysMet: 0.248 ± 0.153
1.734CysAsn: 1.734 ± 0.444
0.743CysPro: 0.743 ± 0.58
0.743CysGln: 0.743 ± 0.267
0.743CysArg: 0.743 ± 0.17
3.22CysSer: 3.22 ± 0.43
2.477CysThr: 2.477 ± 0.706
1.734CysVal: 1.734 ± 1.21
0.0CysTrp: 0.0 ± 0.0
0.495CysTyr: 0.495 ± 0.099
0.0CysXaa: 0.0 ± 0.0
Asp
2.725AspAla: 2.725 ± 1.008
0.991AspCys: 0.991 ± 0.198
4.459AspAsp: 4.459 ± 1.019
3.468AspGlu: 3.468 ± 0.278
1.734AspPhe: 1.734 ± 0.621
3.468AspGly: 3.468 ± 0.368
1.486AspHis: 1.486 ± 0.712
2.973AspIle: 2.973 ± 0.679
1.486AspLys: 1.486 ± 0.605
4.706AspLeu: 4.706 ± 0.614
1.486AspMet: 1.486 ± 0.34
1.734AspAsn: 1.734 ± 0.314
3.22AspPro: 3.22 ± 0.697
1.486AspGln: 1.486 ± 0.74
2.973AspArg: 2.973 ± 0.697
3.468AspSer: 3.468 ± 1.366
2.973AspThr: 2.973 ± 0.339
4.954AspVal: 4.954 ± 0.642
0.991AspTrp: 0.991 ± 0.198
1.486AspTyr: 1.486 ± 0.829
0.0AspXaa: 0.0 ± 0.0
Glu
5.945GluAla: 5.945 ± 0.851
1.982GluCys: 1.982 ± 0.396
2.725GluAsp: 2.725 ± 1.059
8.422GluGlu: 8.422 ± 0.794
4.211GluPhe: 4.211 ± 1.071
4.706GluGly: 4.706 ± 0.355
0.991GluHis: 0.991 ± 0.567
7.184GluIle: 7.184 ± 1.774
4.211GluLys: 4.211 ± 0.651
4.954GluLeu: 4.954 ± 0.479
1.734GluMet: 1.734 ± 0.243
2.973GluAsn: 2.973 ± 0.594
1.239GluPro: 1.239 ± 0.326
1.239GluGln: 1.239 ± 0.353
3.22GluArg: 3.22 ± 0.809
4.954GluSer: 4.954 ± 0.672
4.706GluThr: 4.706 ± 0.604
4.459GluVal: 4.459 ± 0.461
0.743GluTrp: 0.743 ± 0.267
1.734GluTyr: 1.734 ± 0.409
0.0GluXaa: 0.0 ± 0.0
Phe
4.211PheAla: 4.211 ± 1.071
1.486PheCys: 1.486 ± 0.34
3.963PheAsp: 3.963 ± 1.516
1.982PheGlu: 1.982 ± 0.248
1.486PhePhe: 1.486 ± 0.605
3.22PheGly: 3.22 ± 1.05
0.248PheHis: 0.248 ± 0.153
1.239PheIle: 1.239 ± 0.326
2.973PheLys: 2.973 ± 0.679
4.459PheLeu: 4.459 ± 0.739
1.982PheMet: 1.982 ± 0.219
1.982PheAsn: 1.982 ± 0.396
2.229PhePro: 2.229 ± 0.803
0.248PheGln: 0.248 ± 0.153
2.477PheArg: 2.477 ± 1.05
3.22PheSer: 3.22 ± 0.43
2.477PheThr: 2.477 ± 0.64
3.22PheVal: 3.22 ± 0.75
0.248PheTrp: 0.248 ± 0.153
0.495PheTyr: 0.495 ± 0.371
0.0PheXaa: 0.0 ± 0.0
Gly
3.468GlyAla: 3.468 ± 0.73
0.743GlyCys: 0.743 ± 0.267
2.725GlyAsp: 2.725 ± 0.562
4.954GlyGlu: 4.954 ± 0.487
4.211GlyPhe: 4.211 ± 0.626
4.459GlyGly: 4.459 ± 0.634
0.991GlyHis: 0.991 ± 0.307
2.725GlyIle: 2.725 ± 0.288
3.468GlyLys: 3.468 ± 0.402
5.45GlyLeu: 5.45 ± 0.313
1.239GlyMet: 1.239 ± 0.379
2.973GlyAsn: 2.973 ± 0.898
2.229GlyPro: 2.229 ± 0.404
1.982GlyGln: 1.982 ± 0.219
2.973GlyArg: 2.973 ± 0.561
7.679GlySer: 7.679 ± 1.424
1.734GlyThr: 1.734 ± 0.444
4.459GlyVal: 4.459 ± 0.655
1.486GlyTrp: 1.486 ± 0.439
1.239GlyTyr: 1.239 ± 0.353
0.0GlyXaa: 0.0 ± 0.0
His
0.991HisAla: 0.991 ± 0.774
0.248HisCys: 0.248 ± 0.153
0.991HisAsp: 0.991 ± 0.774
0.743HisGlu: 0.743 ± 0.267
1.734HisPhe: 1.734 ± 0.621
1.734HisGly: 1.734 ± 0.473
0.248HisHis: 0.248 ± 0.193
1.486HisIle: 1.486 ± 0.605
1.239HisLys: 1.239 ± 0.326
1.486HisLeu: 1.486 ± 0.533
0.743HisMet: 0.743 ± 0.578
1.239HisAsn: 1.239 ± 0.302
1.982HisPro: 1.982 ± 0.806
0.743HisGln: 0.743 ± 0.368
1.982HisArg: 1.982 ± 0.432
1.486HisSer: 1.486 ± 0.247
1.982HisThr: 1.982 ± 0.219
1.239HisVal: 1.239 ± 0.232
0.0HisTrp: 0.0 ± 0.0
1.486HisTyr: 1.486 ± 0.533
0.0HisXaa: 0.0 ± 0.0
Ile
2.477IleAla: 2.477 ± 0.609
0.743IleCys: 0.743 ± 0.267
4.459IleAsp: 4.459 ± 0.842
5.45IleGlu: 5.45 ± 0.73
1.486IlePhe: 1.486 ± 0.916
3.963IleGly: 3.963 ± 1.235
0.991IleHis: 0.991 ± 0.198
5.945IleIle: 5.945 ± 0.881
5.202IleLys: 5.202 ± 1.601
5.45IleLeu: 5.45 ± 0.953
2.477IleMet: 2.477 ± 0.465
1.982IleAsn: 1.982 ± 0.643
2.973IlePro: 2.973 ± 0.922
1.486IleGln: 1.486 ± 0.34
5.697IleArg: 5.697 ± 0.655
5.697IleSer: 5.697 ± 0.333
3.716IleThr: 3.716 ± 1.059
4.211IleVal: 4.211 ± 0.372
0.495IleTrp: 0.495 ± 0.305
1.734IleTyr: 1.734 ± 0.509
0.0IleXaa: 0.0 ± 0.0
Lys
3.963LysAla: 3.963 ± 0.941
1.982LysCys: 1.982 ± 0.618
2.725LysAsp: 2.725 ± 0.652
4.211LysGlu: 4.211 ± 1.708
3.22LysPhe: 3.22 ± 1.068
3.22LysGly: 3.22 ± 0.566
1.734LysHis: 1.734 ± 0.567
4.459LysIle: 4.459 ± 1.119
4.706LysLys: 4.706 ± 0.334
5.202LysLeu: 5.202 ± 0.953
3.963LysMet: 3.963 ± 1.277
1.982LysAsn: 1.982 ± 0.396
2.477LysPro: 2.477 ± 0.833
1.486LysGln: 1.486 ± 0.34
3.716LysArg: 3.716 ± 0.697
5.202LysSer: 5.202 ± 0.766
2.973LysThr: 2.973 ± 0.837
5.202LysVal: 5.202 ± 0.677
1.239LysTrp: 1.239 ± 0.587
2.477LysTyr: 2.477 ± 0.706
0.0LysXaa: 0.0 ± 0.0
Leu
5.202LeuAla: 5.202 ± 0.471
1.486LeuCys: 1.486 ± 0.605
3.716LeuAsp: 3.716 ± 0.45
6.193LeuGlu: 6.193 ± 1.354
3.963LeuPhe: 3.963 ± 1.185
4.706LeuGly: 4.706 ± 0.548
1.982LeuHis: 1.982 ± 0.395
6.688LeuIle: 6.688 ± 1.736
7.184LeuLys: 7.184 ± 1.183
6.688LeuLeu: 6.688 ± 0.916
2.477LeuMet: 2.477 ± 1.071
2.973LeuAsn: 2.973 ± 1.076
2.229LeuPro: 2.229 ± 0.37
3.22LeuGln: 3.22 ± 0.94
5.202LeuArg: 5.202 ± 0.449
10.899LeuSer: 10.899 ± 0.846
5.45LeuThr: 5.45 ± 1.374
3.716LeuVal: 3.716 ± 0.672
0.248LeuTrp: 0.248 ± 0.153
2.725LeuTyr: 2.725 ± 0.539
0.0LeuXaa: 0.0 ± 0.0
Met
1.239MetAla: 1.239 ± 0.326
0.743MetCys: 0.743 ± 0.458
1.486MetAsp: 1.486 ± 0.455
1.982MetGlu: 1.982 ± 0.636
0.495MetPhe: 0.495 ± 0.305
2.725MetGly: 2.725 ± 0.465
1.486MetHis: 1.486 ± 0.455
1.486MetIle: 1.486 ± 0.829
2.229MetLys: 2.229 ± 0.49
1.982MetLeu: 1.982 ± 0.615
1.982MetMet: 1.982 ± 0.795
2.229MetAsn: 2.229 ± 0.803
0.248MetPro: 0.248 ± 0.193
1.239MetGln: 1.239 ± 0.587
1.486MetArg: 1.486 ± 0.391
3.468MetSer: 3.468 ± 0.528
2.725MetThr: 2.725 ± 0.363
1.239MetVal: 1.239 ± 0.353
0.495MetTrp: 0.495 ± 0.438
0.248MetTyr: 0.248 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
1.486AsnAla: 1.486 ± 0.722
0.991AsnCys: 0.991 ± 0.69
1.982AsnAsp: 1.982 ± 0.896
2.973AsnGlu: 2.973 ± 0.294
1.982AsnPhe: 1.982 ± 0.527
0.991AsnGly: 0.991 ± 0.198
0.991AsnHis: 0.991 ± 0.307
2.477AsnIle: 2.477 ± 0.24
2.725AsnLys: 2.725 ± 1.012
5.202AsnLeu: 5.202 ± 0.816
0.495AsnMet: 0.495 ± 0.305
0.991AsnAsn: 0.991 ± 0.413
3.22AsnPro: 3.22 ± 0.487
0.743AsnGln: 0.743 ± 0.17
2.477AsnArg: 2.477 ± 0.706
4.954AsnSer: 4.954 ± 0.612
1.982AsnThr: 1.982 ± 0.396
1.734AsnVal: 1.734 ± 0.509
0.248AsnTrp: 0.248 ± 0.153
0.991AsnTyr: 0.991 ± 0.567
0.0AsnXaa: 0.0 ± 0.0
Pro
1.734ProAla: 1.734 ± 0.314
0.743ProCys: 0.743 ± 0.267
2.229ProAsp: 2.229 ± 1.06
4.211ProGlu: 4.211 ± 1.037
1.982ProPhe: 1.982 ± 0.432
4.706ProGly: 4.706 ± 0.334
0.991ProHis: 0.991 ± 0.198
2.229ProIle: 2.229 ± 0.404
2.973ProLys: 2.973 ± 0.718
1.982ProLeu: 1.982 ± 0.354
0.743ProMet: 0.743 ± 0.267
1.982ProAsn: 1.982 ± 0.78
0.743ProPro: 0.743 ± 0.267
0.743ProGln: 0.743 ± 0.17
2.229ProArg: 2.229 ± 0.644
3.22ProSer: 3.22 ± 0.797
1.239ProThr: 1.239 ± 0.686
2.725ProVal: 2.725 ± 0.272
0.743ProTrp: 0.743 ± 0.458
0.743ProTyr: 0.743 ± 0.458
0.0ProXaa: 0.0 ± 0.0
Gln
1.486GlnAla: 1.486 ± 0.241
0.991GlnCys: 0.991 ± 0.297
1.486GlnAsp: 1.486 ± 0.455
1.239GlnGlu: 1.239 ± 0.455
0.248GlnPhe: 0.248 ± 0.193
2.229GlnGly: 2.229 ± 0.37
0.743GlnHis: 0.743 ± 0.17
3.22GlnIle: 3.22 ± 1.05
4.211GlnLys: 4.211 ± 1.093
1.486GlnLeu: 1.486 ± 0.411
0.743GlnMet: 0.743 ± 0.17
1.734GlnAsn: 1.734 ± 0.509
1.486GlnPro: 1.486 ± 0.247
1.239GlnGln: 1.239 ± 0.455
0.743GlnArg: 0.743 ± 0.267
1.734GlnSer: 1.734 ± 0.292
1.239GlnThr: 1.239 ± 0.232
0.743GlnVal: 0.743 ± 0.17
0.0GlnTrp: 0.0 ± 0.0
0.743GlnTyr: 0.743 ± 0.17
0.0GlnXaa: 0.0 ± 0.0
Arg
2.725ArgAla: 2.725 ± 0.272
1.982ArgCys: 1.982 ± 0.618
4.706ArgAsp: 4.706 ± 0.775
5.45ArgGlu: 5.45 ± 1.398
0.991ArgPhe: 0.991 ± 0.297
3.963ArgGly: 3.963 ± 0.545
0.248ArgHis: 0.248 ± 0.153
2.973ArgIle: 2.973 ± 0.544
2.725ArgLys: 2.725 ± 0.281
5.45ArgLeu: 5.45 ± 0.724
2.725ArgMet: 2.725 ± 1.059
3.468ArgAsn: 3.468 ± 0.528
2.229ArgPro: 2.229 ± 0.404
1.734ArgGln: 1.734 ± 0.292
2.477ArgArg: 2.477 ± 0.495
4.954ArgSer: 4.954 ± 0.903
2.229ArgThr: 2.229 ± 1.06
3.468ArgVal: 3.468 ± 0.506
0.991ArgTrp: 0.991 ± 0.478
1.734ArgTyr: 1.734 ± 0.473
0.0ArgXaa: 0.0 ± 0.0
Ser
5.202SerAla: 5.202 ± 1.071
2.725SerCys: 2.725 ± 1.626
4.459SerAsp: 4.459 ± 0.728
5.202SerGlu: 5.202 ± 0.584
4.954SerPhe: 4.954 ± 0.875
6.44SerGly: 6.44 ± 0.689
2.229SerHis: 2.229 ± 0.258
5.45SerIle: 5.45 ± 0.725
5.697SerLys: 5.697 ± 0.623
8.918SerLeu: 8.918 ± 1.693
2.229SerMet: 2.229 ± 0.603
3.716SerAsn: 3.716 ± 0.567
3.963SerPro: 3.963 ± 0.419
1.486SerGln: 1.486 ± 0.62
6.193SerArg: 6.193 ± 0.714
7.679SerSer: 7.679 ± 1.356
5.945SerThr: 5.945 ± 1.061
4.211SerVal: 4.211 ± 0.849
2.229SerTrp: 2.229 ± 0.337
1.486SerTyr: 1.486 ± 0.413
0.0SerXaa: 0.0 ± 0.0
Thr
3.468ThrAla: 3.468 ± 1.215
1.734ThrCys: 1.734 ± 0.601
1.982ThrAsp: 1.982 ± 0.615
4.459ThrGlu: 4.459 ± 0.655
0.743ThrPhe: 0.743 ± 0.267
3.22ThrGly: 3.22 ± 1.303
1.982ThrHis: 1.982 ± 1.087
3.963ThrIle: 3.963 ± 1.23
3.22ThrLys: 3.22 ± 0.97
6.193ThrLeu: 6.193 ± 0.959
0.991ThrMet: 0.991 ± 0.741
1.486ThrAsn: 1.486 ± 0.34
2.973ThrPro: 2.973 ± 0.339
1.486ThrGln: 1.486 ± 0.34
4.211ThrArg: 4.211 ± 0.405
4.954ThrSer: 4.954 ± 0.913
4.954ThrThr: 4.954 ± 1.724
3.716ThrVal: 3.716 ± 1.94
0.495ThrTrp: 0.495 ± 0.371
2.477ThrTyr: 2.477 ± 0.603
0.0ThrXaa: 0.0 ± 0.0
Val
4.954ValAla: 4.954 ± 1.106
2.973ValCys: 2.973 ± 1.067
3.468ValAsp: 3.468 ± 0.694
3.468ValGlu: 3.468 ± 0.888
4.211ValPhe: 4.211 ± 1.036
1.982ValGly: 1.982 ± 0.395
2.725ValHis: 2.725 ± 0.97
4.211ValIle: 4.211 ± 0.755
3.963ValLys: 3.963 ± 1.171
5.202ValLeu: 5.202 ± 1.123
1.734ValMet: 1.734 ± 0.677
1.734ValAsn: 1.734 ± 0.601
1.239ValPro: 1.239 ± 0.302
2.477ValGln: 2.477 ± 0.437
3.963ValArg: 3.963 ± 0.556
4.706ValSer: 4.706 ± 1.004
3.716ValThr: 3.716 ± 0.864
3.963ValVal: 3.963 ± 1.013
0.0ValTrp: 0.0 ± 0.0
0.743ValTyr: 0.743 ± 0.17
0.0ValXaa: 0.0 ± 0.0
Trp
0.991TrpAla: 0.991 ± 0.307
0.248TrpCys: 0.248 ± 0.402
0.991TrpAsp: 0.991 ± 0.198
0.248TrpGlu: 0.248 ± 0.153
0.248TrpPhe: 0.248 ± 0.153
0.991TrpGly: 0.991 ± 0.297
0.0TrpHis: 0.0 ± 0.0
1.239TrpIle: 1.239 ± 0.232
0.0TrpLys: 0.0 ± 0.0
0.495TrpLeu: 0.495 ± 0.099
0.495TrpMet: 0.495 ± 0.305
0.743TrpAsn: 0.743 ± 0.267
0.495TrpPro: 0.495 ± 0.371
0.743TrpGln: 0.743 ± 0.368
0.495TrpArg: 0.495 ± 0.099
0.495TrpSer: 0.495 ± 0.371
1.239TrpThr: 1.239 ± 0.491
1.239TrpVal: 1.239 ± 0.314
0.248TrpTrp: 0.248 ± 0.153
0.248TrpTyr: 0.248 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.239TyrAla: 1.239 ± 0.883
0.248TyrCys: 0.248 ± 0.193
1.239TyrAsp: 1.239 ± 1.124
0.991TyrGlu: 0.991 ± 0.198
1.239TyrPhe: 1.239 ± 0.455
0.991TyrGly: 0.991 ± 0.307
0.743TyrHis: 0.743 ± 0.458
1.734TyrIle: 1.734 ± 0.444
1.982TyrLys: 1.982 ± 0.354
1.734TyrLeu: 1.734 ± 0.409
0.248TyrMet: 0.248 ± 0.153
1.239TyrAsn: 1.239 ± 0.232
1.486TyrPro: 1.486 ± 0.455
0.991TyrGln: 0.991 ± 0.567
1.982TyrArg: 1.982 ± 0.593
2.725TyrSer: 2.725 ± 0.97
1.734TyrThr: 1.734 ± 0.531
1.486TyrVal: 1.486 ± 0.62
0.743TyrTrp: 0.743 ± 0.267
0.743TyrTyr: 0.743 ± 0.336
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4038 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski