
Getting LinkedIn recruitment data always blocked? Try this.
Recruitment friends have been complaining recently, using crawlers to grab LinkedIn data to move the block. There is a headhunter brother is even worse, even changed 5 accounts are blacked out. In fact, this matter is really not to blame the platform, the anti-climbing mechanism is now very fine, the same IP high-frequency access immediately red light.
Why do you need a proxy IP for data?
For example, if you live in the sunrise area and go to the same convenience store every day to buy water, on the third day the boss should suspect that you are here to step on the spot.LinkedIn's backcrawl is also the same reasoning, theSingle-IP high-frequency access will be targetedThe first thing you need to do is to use a proxy IP to get into the store. Using a proxy IP is equivalent to entering the store in a different outfit every day, and the boss can't remember you at all.
Here's the point:
- Dynamic IP pool ready for at least 200+ IP rotations
- Don't be too regular in the interval of each visit, like a human manual operation with a bit of randomness
- It's best to use a residential IP, server room IPs are easy to identify
Hands on teaching you how to play with proxy IPs
Here's a chestnut with ipipgo's service, who specializes in this. First, open an account in the background, selectDynamic Residential AgentsPackage. Pay attention to these two parameters:
| parameters | recommended value |
|---|---|
| IP Survival Time | 3-5 minutes |
| concurrency | ≤5/sec |
Remember to add these three lines of code when configuring the script:
proxy = {
'http': 'http://用户名:密码@gateway.ipipgo.com:端口',
'https': 'http://用户名:密码@gateway.ipipgo.com:端口'
}
Don't step on these potholes.
Last year, a customer was greedy for cheap free proxy, the result of climbing to the data are all phishing sites fake page. Here to remind three points:
- Don't use the IP, the recognition rate is up to 90%
- IP switching interval of less than 30 seconds
- Note the browser fingerprint in the request header
If you are not sure about the parameter settings, directly find ipipgo technical customer service, they can remotely help you adjust the configuration. The last time a customer tossed three days to get it done, customer service ten minutes to the whole understanding.
Frequently Asked Questions QA
Q: Do I have to use a paid proxy?
A: Temporary use can find a shared IP pool, but long-term stability must also be a professional service. ipipgo new users have a 3-day free trial, try it yourself to know the difference.
Q: How much IP volume is needed per day?
A: Look at the size of the data. Ordinary headhunters 200-500 IP per day is enough, if you do big data analysis has to be thousands. It is recommended to buy a small package to test, ipipgo support upgrade at any time.
Q: Will I be sued by LinkedIn?
A: Pay attention not to climb personal privacy data, only collect public post information. ipipgo's IP pool comes with compliance attributes, as long as do not die touching sensitive fields on the line.
Tell the truth.
Now do data collection is like playing guerrilla warfare, platform algorithms are upgraded every month. With ipipgo this kind of service is mainly a figure of mind, their IP library automatically updated weekly 15%, encountered blocking can also be cut in seconds spare line. After the last update, more IP nodes in the Middle East, digging for oil industry recruitment information is particularly good.
Last reminder: don't use retarded delays like sleep(1) in your crawler program, learn from people who use random numbers. For example, random.uniform (0.5,3.5), so that the rhythm of access is more like a real person operation. These details ipipgo technical documentation have written, more look can be less detour.

